Re: [PATCH] nfsrahead: modify get_device_info logic

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



The success is detected in the loop in get_device_info, in the
stopping clause: if the call to get_mountinfo succeeds, the loop
stops, and the function returns success. If the call fails for the
number of retries plus one, the last error is returned, detected in
the caller, which logs an error.

The 50000 us is arbitrary. We need to give the kernel some time to
update mountinfo, and 50 ms is a value that works well, reducing the
number of attempts to 2 to succeed. The current implementation tries
for 50 times and fails because it executes too fast. Using a larger
value doesn't improve the success rate, and a smaller value increases
the number of retries (at least on my PC).

-- Thiago


On Mon, Aug 18, 2025 at 10:03 AM Steve Dickson <steved@xxxxxxxxxx> wrote:
>
> Hello,
>
> On 8/12/25 5:38 PM, tbecker@xxxxxxxxxx wrote:
> > From: Thiago Becker <tbecker@xxxxxxxxxx>
> >
> > There are a few reports of failures by nfsrahead to set the read ahead
> > when the nfs mount information is not available when the udev event
> > fires. This was alleviated by retrying to read mountinfo multiple times,
> > but some cases where still failing to find the device information. To
> > further alleviate this issue, this patch adds a 50ms delay between
> > attempts. To not incur into unecessary delays, the logic in
> > get_device_info is reworked.
> >
> > While we are in this, remove a second loop of reptitions of
> > get_device_info.
> >
> > Signed-off-by: Thiago Becker <tbecker@xxxxxxxxxx>
> > ---
> >   tools/nfsrahead/main.c | 14 ++++++--------
> >   1 file changed, 6 insertions(+), 8 deletions(-)
> >
> > diff --git a/tools/nfsrahead/main.c b/tools/nfsrahead/main.c
> > index 8a11cf1a..b7b889ff 100644
> > --- a/tools/nfsrahead/main.c
> > +++ b/tools/nfsrahead/main.c
> > @@ -117,9 +117,11 @@ out_free_device_info:
> >
> >   static int get_device_info(const char *device_number, struct device_info *device_info)
> >   {
> > -     int ret = ENOENT;
> > -     for (int retry_count = 0; retry_count < 10 && ret != 0; retry_count++)
> > +     int ret = get_mountinfo(device_number, device_info, MOUNTINFO_PATH);
> > +     for (int retry_count = 0; retry_count < 5 && ret != 0; retry_count++) {
> > +             usleep(50000);
> >               ret = get_mountinfo(device_number, device_info, MOUNTINFO_PATH);
> > +     }
> get_mountinfo() will error out with an errno value (ret != 0) so
> how is how are error detected? What am I missing...
>
> Also why was a value of 50000 used to sleep?
>
> steved.
>
> >
> >       return ret;
> >   }
> > @@ -135,7 +137,7 @@ static int conf_get_readahead(const char *kind) {
> >
> >   int main(int argc, char **argv)
> >   {
> > -     int ret = 0, retry, opt;
> > +     int ret = 0, opt;
> >       struct device_info device;
> >       unsigned int readahead = 128, log_level, log_stderr = 0;
> >
> > @@ -163,11 +165,7 @@ int main(int argc, char **argv)
> >       if ((argc - optind) != 1)
> >               xlog_err("expected the device number of a BDI; is udev ok?");
> >
> > -     for (retry = 0; retry <= 10; retry++ )
> > -             if ((ret = get_device_info(argv[optind], &device)) == 0)
> > -                     break;
> > -
> > -     if (ret != 0 || device.fstype == NULL) {
> > +     if ((ret = get_device_info(argv[optind], &device)) != 0 || device.fstype == NULL) {
> >               xlog(D_GENERAL, "unable to find device %s\n", argv[optind]);
> >               goto out;
> >       }
>


-- 
Thiago Rafael Becker
Software Maintenance Engineer
redhat.com






[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux