Hello, we don't have enough knowledge if this is a kernel issue or test case issue. ========================================================================================= tbox_group/testcase/rootfs/kconfig/compiler/nr_threads/disk/testtime/fs/test/cpufreq_governor: lkp-icl-2sp4/stress-ng/debian-12-x86_64-20240206.cgz/x86_64-rhel-9.4/gcc-12/100%/1HDD/60s/xfs/fpunch/performance 3efe7571c3ae2b64 245618f8e45ff4f79327627b474 ---------------- --------------------------- fail:runs %reproduction fail:runs | | | :6 100% 6:6 stress-ng.fpunch.fail since the failure is persistent, just report what we observed in our tests FYI. kernel test robot noticed "stress-ng.fpunch.fail" on: commit: 245618f8e45ff4f79327627b474b563da71c2c75 ("block: protect wbt_lat_usec using q->elevator_lock") https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master [test failed on linus/master b36ddb9210e6812eb1c86ad46b66cc46aa193487] [test failed on linux-next/master 8566fc3b96539e3235909d6bdda198e1282beaed] [test failed on fix commit 9730763f4756e32520cb86778331465e8d063a8f] in testcase: stress-ng version: stress-ng-x86_64-1c71921fd-1_20250212 with following parameters: nr_threads: 100% disk: 1HDD testtime: 60s fs: xfs test: fpunch cpufreq_governor: performance config: x86_64-rhel-9.4 compiler: gcc-12 test machine: 128 threads 2 sockets Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz (Ice Lake) with 128G memory (please refer to attached dmesg/kmsg for entire log/backtrace) If you fix the issue in a separate patch/commit (i.e. not just a new version of the same patch/commit), kindly add following tags | Reported-by: kernel test robot <oliver.sang@xxxxxxxxx> | Closes: https://lore.kernel.org/oe-lkp/202505221030.760980df-lkp@xxxxxxxxx 2025-03-20 08:33:52 mkdir -p /mnt/stress-ng 2025-03-20 08:33:52 mount /dev/sdc1 /mnt/stress-ng 2025-03-20 08:33:52 cd /mnt/stress-ng File: "/mnt/stress-ng" ID: 82100000000 Namelen: 255 Type: xfs Block size: 4096 Fundamental block size: 4096 Blocks: Total: 78604800 Free: 78518242 Available: 78518242 Inodes: Total: 157286400 Free: 157286397 2025-03-20 08:33:52 stress-ng --timeout 60 --times --verify --metrics --no-rand-seed --fpunch 128 stress-ng: info: [4680] setting to a 1 min run per stressor stress-ng: info: [4680] dispatching hogs: 128 fpunch stress-ng: info: [4680] note: /proc/sys/kernel/sched_autogroup_enabled is 1 and this can impact scheduling throughput for processes not attached to a tty. Setting this to 0 may improve performance metrics stress-ng: warn: [4680] metrics-check: all bogo-op counters are zero, data may be incorrect stress-ng: metrc: [4680] stressor bogo ops real time usr time sys time bogo ops/s bogo ops/s CPU used per RSS Max stress-ng: metrc: [4680] (secs) (secs) (secs) (real time) (usr+sys time) instance (%) (KB) stress-ng: metrc: [4680] fpunch 0 557.92 0.40 19.56 0.00 0.00 0.03 3180 stress-ng: metrc: [4680] miscellaneous metrics: stress-ng: metrc: [4680] fpunch 2049.12 extents per file (geometric mean of 128 instances) stress-ng: info: [4680] for a 620.45s run time: stress-ng: info: [4680] 79418.05s available CPU time stress-ng: info: [4680] 0.40s user time ( 0.00%) stress-ng: info: [4680] 19.59s system time ( 0.02%) stress-ng: info: [4680] 19.99s total time ( 0.03%) stress-ng: info: [4680] load average: 250.69 349.62 213.80 stress-ng: info: [4680] skipped: 0 stress-ng: info: [4680] passed: 128: fpunch (128) stress-ng: info: [4680] failed: 0 stress-ng: info: [4680] metrics untrustworthy: 0 stress-ng: info: [4680] successful run completed in 10 mins, 20.45 secs we don't observe any abnormal output in dmesg. below is an example from parent commit. 2025-03-20 09:12:39 mkdir -p /mnt/stress-ng 2025-03-20 09:12:39 mount /dev/sdc1 /mnt/stress-ng 2025-03-20 09:12:39 cd /mnt/stress-ng File: "/mnt/stress-ng" ID: 82100000000 Namelen: 255 Type: xfs Block size: 4096 Fundamental block size: 4096 Blocks: Total: 78604800 Free: 78518242 Available: 78518242 Inodes: Total: 157286400 Free: 157286397 2025-03-20 09:12:39 stress-ng --timeout 60 --times --verify --metrics --no-rand-seed --fpunch 128 stress-ng: info: [4689] setting to a 1 min run per stressor stress-ng: info: [4689] dispatching hogs: 128 fpunch stress-ng: info: [4689] note: /proc/sys/kernel/sched_autogroup_enabled is 1 and this can impact scheduling throughput for processes not attached to a tty. Setting this to 0 may improve performance metrics stress-ng: metrc: [4689] stressor bogo ops real time usr time sys time bogo ops/s bogo ops/s CPU used per RSS Max stress-ng: metrc: [4689] (secs) (secs) (secs) (real time) (usr+sys time) instance (%) (KB) stress-ng: metrc: [4689] fpunch 1166 60.31 0.11 34.66 19.33 33.54 0.45 3164 stress-ng: metrc: [4689] miscellaneous metrics: stress-ng: metrc: [4689] fpunch 2051.97 extents per file (geometric mean of 128 instances) stress-ng: info: [4689] for a 60.91s run time: stress-ng: info: [4689] 7796.93s available CPU time stress-ng: info: [4689] 0.11s user time ( 0.00%) stress-ng: info: [4689] 34.68s system time ( 0.44%) stress-ng: info: [4689] 34.79s total time ( 0.45%) stress-ng: info: [4689] load average: 325.78 93.83 32.28 stress-ng: info: [4689] skipped: 0 stress-ng: info: [4689] passed: 128: fpunch (128) stress-ng: info: [4689] failed: 0 stress-ng: info: [4689] metrics untrustworthy: 0 stress-ng: info: [4689] successful run completed in 1 min from above, parent can finish run in 1 min, then has "bogo ops" and "bogo ops/s" for 245618f8e4, the test seems run much longer, and the results for "bogo ops" and "bogo ops/s" are all 0. The kernel config and materials to reproduce are available at: https://download.01.org/0day-ci/archive/20250522/202505221030.760980df-lkp@xxxxxxxxx -- 0-DAY CI Kernel Test Service https://github.com/intel/lkp-tests/wiki