Hi Ulf, On Thu, 14 Aug 2025 at 17:50, Ulf Hansson <ulf.hansson@xxxxxxxxxx> wrote: > On Wed, 13 Aug 2025 at 13:58, Geert Uytterhoeven <geert@xxxxxxxxxxxxxx> wrote: > > On Tue, 12 Aug 2025 at 12:01, Ulf Hansson <ulf.hansson@xxxxxxxxxx> wrote: > > > On Thu, 7 Aug 2025 at 11:38, Geert Uytterhoeven <geert@xxxxxxxxxxxxxx> wrote: > > > > On Wed, 30 Jul 2025 at 12:29, Ulf Hansson <ulf.hansson@xxxxxxxxxx> wrote: > > > > > On Wed, 30 Jul 2025 at 11:56, Geert Uytterhoeven <geert@xxxxxxxxxxxxxx> wrote: > > > > > > On Wed, 9 Jul 2025 at 13:31, Ulf Hansson <ulf.hansson@xxxxxxxxxx> wrote: > > > > > > > On Tue, 1 Jul 2025 at 13:47, Ulf Hansson <ulf.hansson@xxxxxxxxxx> wrote: > > > > > > > > Changes in v3: > > > > > > > > - Added a couple of patches to adress problems on some Renesas > > > > > > > > platforms. Thanks Geert and Tomi for helping out! > > > > > > > > - Adressed a few comments from Saravanna and Konrad. > > > > > > > > - Added some tested-by tags. > > > > > > > > > > > > > > I decided it was time to give this a try, so I have queued this up for > > > > > > > v6.17 via the next branch at my pmdomain tree. > > > > > > > > > > > > > > If you encounter any issues, please let me know so I can help to fix them. > > > > > > > > > > > > Thanks for your series! Due to holidays, I only managed to test > > > > > > this very recently. > > > > > > > > > > > > Unfortunately I have an issue with unused PM Domains no longer being > > > > > > disabled on R-Car: > > > > > > - On R-Car Gen1/2/3, using rcar-sysc.c, unused PM Domains are never > > > > > > disabled. > > > > > > - On R-Car Gen4, using rcar-gen4-sysc.c, unused PM Domains are > > > > > > sometimes not disabled. > > > > > > At first, I noticed the IOMMU driver was not enabled in my config, > > > > > > and enabling it did fix the issue. However, after that I still > > > > > > encountered the issue in a different config that does have the > > > > > > IOMMU driver enabled... > > > > > > > > > > > > FTR, unused PM Domains are still disabled correctly on R/SH-Mobile > > > > > > (using rmobile-sysc.c) and on BeagleBone Black. Note that these use > > > > > > of_genpd_add_provider_simple(), while all R-Car drivers use > > > > > > of_genpd_add_provider_onecell(). Perhaps there is an issue with > > > > > > the latter? If you don't have a clue, I plan to do some more > > > > > > investigation later... > > > > > > > > of_genpd_add_provider_onecell() has: > > > > > > > > if (!dev) > > > > sync_state = true; > > > > else > > > > dev_set_drv_sync_state(dev, genpd_sync_state); > > > > > > > > for (i = 0; i < data->num_domains; i++) { > > > > ... > > > > if (sync_state && !genpd_is_no_sync_state(genpd)) { > > > > genpd->sync_state = GENPD_SYNC_STATE_ONECELL; > > > > device_set_node(&genpd->dev, fwnode); > > > > sync_state = false; > > > > ^^^^^^^^^^^^^^^^^^^ > > > > } > > > > ... > > > > } > > > > > > > > As the R-Car SYSC drivers are not platform drivers, dev is NULL, and > > > > genpd->sync_state is set to GENPD_SYNC_STATE_ONECELL for the first PM > > > > Domain only. All other domains have the default value of sync_state > > > > (0 = GENPD_SYNC_STATE_OFF). Hence when genpd_provider_sync_state() > > > > is called later, it ignores all but the first domain. > > > > Apparently this is intentional, as of_genpd_sync_state() tries to > > > > power off all domains handled by the same controller anyway (see below)? > > > > > > Right, this is intentional and mainly because of how fw_devlink works. > > > > > > fw_devlink is limited to use only the first device - if multiple > > > devices share the same fwnode. In principle, we could have picked any > > > of the devices in the array of genpds here - and reached the same > > > result. > > > > OK, just like I already assumed... > > > > > > > > BTW, the "pending due to"-messages look weird to me. > > > > > > On R-Car M2-W (r8a7791.dtsi) I see e.g.: > > > > > > > > > > > > genpd_provider ca15-cpu0: sync_state() pending due to e6020000.watchdog > > > > > > renesas-cpg-mssr e6150000.clock-controller: sync_state() pending > > > > > > due to e6020000.watchdog > > > > > > > > > > > > ca15-cpu0 is the PM Domain holding the first CPU core, while > > > > > > the watchdog resides in the always-on Clock Domain, and uses the > > > > > > clock-controller for PM_CLK handling. > > > > > > > > Unfortunately the first PM Domain is "ca15-cpu0", which is blocked on > > > > these bogus pending states, and no PM Domain is powered off. > > > > > > I see, thanks for the details. I am looking closer at this. > > > > > > In any case, this is the main issue, as it prevents the ->sync_state() > > > callback to be called. Hence the "genpd->stay_on" will also *not* be > > > cleared for any of the genpd's for the genpd-provider. > > > > I was under the impression there is a time-out, after which the > > .sync_state() callback would be called anyway, just like for probe > > deferral due to missing optional providers like DMACs and IOMMUs. > > Apparently that is not the case? > > The behaviour is configurable, so it depends. The current default > behaviour does *not* enforce the ->sync_state() callbacks to be > called, even after a time-out. > > You may set CONFIG_FW_DEVLINK_SYNC_STATE_TIMEOUT to achieve the above > behavior or use the fw_devlink command line parameters to change it. > Like setting "fw_devlink.sync_state=timeout". > > I guess it can be debated what the default behaviour should be. > Perhaps we should even allow the default behaviour to be dynamically > tweaked on a per provider device/driver basis? The domains are indeed powered off like before when passing "fw_devlink.sync_state=timeout", so that fixes the regression. But it was not needed before... I could add "select FW_DEVLINK_SYNC_STATE_TIMEOUT" to the SYSC_RCAR and SYSC_RCAR_GEN4 Kconfig options, but that would play badly with multi-platform kernels. As the fw_devlink_sync_state flag is static, the R-Car SYSC drivers can't just auto-enable the flag at runtime. Any other options? Perhaps a device-specific flag to be set by the PM Domain driver, and to be checked by fw_devlink_dev_sync_state()? > > > > If I remove the "sync_state = false" above, genpd_provider_sync_state() > > > > considers all domains, and does power down all unused domains (even > > > > multiple times, as expected). > > > > > > I think those are getting called because with the change above, there > > > is no device_link being tracked. As stated above, fw_devlink is > > > limited to use only one device - if multiple devices share the same > > > fwnode. > > > > Indeed. > > > > > In other words, the ->sync_state() callbacks are called even if the > > > corresponding consumer devices have not been probed yet. > > > > Hence shouldn't there be a timeout, as the kernel may not even have > > a driver for one or more consumer devices? > > See above. > > > > > Upon closer look, all "pending due to" messages I see claim that the > > > > first (index 0) PM Domain is pending on some devices, while all of > > > > these devices are part of a different domain (usually the always-on > > > > domain, which is always the last (32 or 64) on R-Car). > > > > > > > > So I think there are two issues: > > > > 1. Devices are not attributed to the correct PM Domain using > > > > fw_devlink sync_state, > > > > 2. One PM Domain of a multi-domain controller being blocked should > > > > not block all other domains handled by the same controller. > > > > > > Right, that's a current limitation with fw_devlink. To cope with this, > > > it's possible to enforce the ->sync_state() callback to be invoked > > > from user-space (timeout or explicitly) for a device. That needs explicit handling, which was not needed before. Perhaps the fw_devlink creation should be removed again from of_genpd_add_provider_onecell(), as it is not correct, except for the first domain? > > > Another option would be to allow an opt-out behavior for some genpd's > > > that are powered-on at initialization. Something along the lines of > > > the below. > > > > > > From: Ulf Hansson <ulf.hansson@xxxxxxxxxx> > > > Date: Tue, 29 Jul 2025 14:27:22 +0200 > > > Subject: [PATCH] pmdomain: core: Allow powered-on PM domains to be powered-off > > > during boot > > > > [...] > > > > I gave this a try (i.e. "| GENPD_FLAG_NO_STAY_ON" in rcar-sysc.c), but > > this doesn't make any difference. I assume this would only work when > > actively calling genpd_power_off() (i.e. not from of_genpd_sync_state() > > or genpd_provider_sync_state())? > > Right. Thanks for testing! > > So, we may need to restore some part of the genpd_power_off_unused() > when CONFIG_PM_GENERIC_DOMAINS_OF is set. Without clearing > "genpd->stay_on". > > I can extend the patch, if you think it would make sense for you? I would applaud anything that would fix these regressions. Thanks! Gr{oetje,eeting}s, Geert -- Geert Uytterhoeven -- There's lots of Linux beyond ia32 -- geert@xxxxxxxxxxxxxx In personal conversations with technical people, I call myself a hacker. But when I'm talking to journalists I just say "programmer" or something like that. -- Linus Torvalds