Re: Forcing initial Frame Transmission for dmabuf encoding on SPICE display channel connection

Frediano Ziglio <freddy77@xxxxxxxxx> · Tue, 15 Apr 2025 23:00:53 +0100

On Thu, Apr 10, 2025 at 3:18 PM Michael Scherle <michael.scherle@xxxxxxxxxxxxxxxxxx> wrote:
Hello,

I’ve encountered an issue with the new DMA-BUF -> video encoding feature 

in SPICE. When connecting, the first frame is only sent once the GPU 

renders a new frame. However, this can take quite some time if the VM is 

idle (e.g., sitting on the desktop), since the GPU only renders a new 

frame when something on the screen changes. To address this, I wanted to 

force a frame to be sent when the display channel is connected.

Which makes sense.

My initial, naive attempt was to grab the latest DMA-BUF on the display 

channel's connection in the SPICE server, encode it, and send it. 

However, this led to race conditions and crashes—particularly when QEMU 

happened to perform a scanout at the same time, closing the DMA-BUF in 

the process.

By "closing" do you mean calling close() function? No, we should have ownership.
What exact race did you encounter?

As a second approach, I modified the QXLInterface to pass the display 

channel on_connect event back to QEMU. I couldn’t find any existing 

mechanism in QEMU to detect the connection of a display channel. Within 

QEMU, I then used qemu_spice_gl_monitor_config, and spice_gl_refresh to 

trigger a spice_gl_draw. This solution works, but the downside is that 

it requires changes to SPICE, QEMU, and especially the 

QXLInterface—which is obviously not ideal.

Not ideal is a compliment. I would say complicated, hard to maintain, adding too much coupling.

So now I’m wondering: does anyone have a better idea for how to tackle 

this problem?

I would define "the problem" first, currently you mentioned a race condition without describing the details of the race.

Best regards,

Michael

I could suspect the race is more in the current implementation of the interface. Indeed that interface does not fit entirely in the Spice server model.

Externally there are 2 functions, spice_qxl_gl_scanout and spice_qxl_gl_draw_async, the callback async_complete is used to tell Qemu when we finish with the scanout. So, spice_qxl_gl_scanout should set the scanout (or frame if you prefer), while spice_qxl_gl_draw_async tells Spice to use the scanout, till async_complete is called (which should be done in a time fashion, I think Qemu timeout is 1 second). In theory the scanout can be reused for multiple draws (which was never the case, but that's another story). In theory a partial draw of the scanout can be requested. In theory the scanout should not be used after async_complete is called as Qemu could reuse the scanout for next drawings. That last point is a bit of a problem here and to be honest something I think is an issue of the external interface definition. In hardware you set the framebuffer and the video card will continue to use it, no matter what, the computer can freeze or panic and the video card will continue to use the same frame over and over. Also, considering that the maximum that can happen is to get a partial draw that will be fixed, I think it's correct to use the last scanout to solve your initial problem.

Internally Spice server stores the scanout in the RedQxl thread (Qemu I/O one) but uses it in the RedWorker thread. This is pretty uncommon, usually data is passed from a thread to the other, ownership included. This, probably, leads to the race you are facing. If that's the issue I think really the best option is to fix that race.

Regards,
  Frediano