Re: [PATCH] Documentation: locking: update libc support status of PI futexes

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 2025-05-23 08:00, Sebastian Andrzej Siewior wrote:
On 2025-01-11 10:55:55 [-0800], Alison Chaiken wrote:
> Are you sure? My memory is that glibc avoided using the internal mutex.
> The old problem should be gone and pthread_cond_signal() and
> pthread_cond_wait() should work.

Ignoring support for 64-bit time, the last substantive change to
pthread_cond_wait() and pthread_cond_signal() was Torvald Riegel's commit ed19993b5b0d05d62cc883571519a67dae481a14 "New condvar implementation that provides stronger ordering guarantees," which fixed problems with waking of ineligible futex waiters and with ABA issues concerning the futex word.
What the patch does not do is made clear by the commit message:

     This condvar doesn't yet use a requeue optimization (ie, on a
broadcast,
waking just one thread and requeueing all others on the futex of the
     mutex supplied by the program).

What futex-requeue-pi.rst directs is

In order to support PI-aware pthread_condvar's, the kernel needs to
     be able to requeue tasks to PI futexes.

Riegel and Darren Hart discussed Riegel's patch in at length at the 2016 RT
Summit:

https://wiki.linuxfoundation.org/realtime/events/rt-summit2016/schedule

The related glibc bug report by Darren may be found at

    https://sourceware.org/bugzilla/show_bug.cgi?id=11588

The last comment on the bug from 2017 is by Riegel:

So far, there is no known solution for how to achieve PI support given
the current constraints we have (eg, available futex operations, POSIX
requirements, ...).

I ran the bug reproducer posted by Darren in Qemu and found that it did not fail. I'm not sure if the result is valid given the peculiarities of Qemu,
or whether I made some other mistake.

I've been looking at this again for other reasons and looked at the
code again…

Back then we use futex-requeue API and required both futex-object to
have the PI bit set. This wasn't the case originally, hence the patch by
Darren which did not make it into the official libc.

With the rework by Riegel, the mutex within pthread's condvar
implementation is gone also the usage of the requeue API. The
pthread_cond_wait()/ pthread_cond_signal() API is back to use futex'
wait/ wake.
The glibc comments write something about important ordering constrains.
The futex wait enqueues the waiter according to its priority. So the
task with highest priority gets always a front seat. The futex wake
function wakes always the first waiter in the queue.

With all this I would say that the glib'c condvar implementation does
not have any issues since the rework.
There were a few loops, with a 0 retry counter (basically dead) and they
have been removed.

Sebastian

Thanks, Sebastian, for looking into this question.

Torvald Riegel's last patch to pthread_cond_wait.c:

$ git log -n 1 --author=riegel -- pthread_cond_wait.c
commit ed19993b5b0d05d62cc883571519a67dae481a14
Author: Torvald Riegel <triegel@xxxxxxxxxx>
Date:   Wed May 25 23:43:36 2016 +0200
New condvar implementation that provides stronger ordering guarantees.

Speaking of ordering, the 2016 Linux Realtime Summit happened after, on 11 October. Torvald and Darren co-presented a talk about condition variables:

https://wiki.linuxfoundation.org/realtime/events/rt-summit2016/pthread-condvars

Torvald in his half of the talk discusses the POSIX requirement which necessitated a change to condvars and his redesign. In the video at 30:50,

---

Zijlstra: Even for FIFO, in the previous slides, S2 will only wake W2, because W3 was not yet eligible, but W3 might be the highest-priority waiter. Strictly speaking, W3 was eligible at S2. [p. 9, W3 was in G2, not G1, but "happened before" S2]. At S2, the only possible wakeup was W2,
even thought W3 might be the highest-priority waiter.

Hart: Correct.  Not in this scheme.

Zijstra: Sequence-wise, it's correct,

Hart: it's mathematically correct.

Zijlstra: But it's not the one we want to wake according to PI rules.

Hart: Yep.

Zijlstra: This scheme does not permit us doing so.

Hart: Noted.

---

Darren and Torvald agree that glibc cannot make pthread condvars PI-aware without breaking ABI. Am I missing something?

Thanks,
Alison

---
Alison Chaiken                   alison@xxxxxxxxxxxxx
https://she-devel.com
"What respite from her thrilling toil did Beauty ever take — But Work might be Electric Rest To those that Magic make" -- Emily Dickinson




[Index of Archives]     [RT Stable]     [Kernel Newbies]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Samba]     [Video 4 Linux]     [Device Mapper]

  Powered by Linux