On Tue, 13 May 2025 at 23:29, Joanne Koong <joannelkoong@xxxxxxxxx> wrote: > The results vary depending on how IO-intensive the server-side > processing logic is (eg ones that are not as intensive would show a > bigger relative performance speedup than ones where a lot of time is > spent on server-side processing). I can include the results from > benchmarks on our internal fuse server, which forwards the data in the > write buffer to a remote server over the network. For that, we saw > roughly a 5% improvement in throughput for 5 GB writes with 16 MB > chunk sizes, and a 2.45% improvement in throughput for 12 parallel > writes of 16 GB files with 64 MB chunk sizes. Okay, those are much saner numbers. Does the server use MSG_ZEROCOPY? Can you please include these numbers and the details on how the server takes advantage of splice in the patch header? Thanks, Miklos