prov/shm: Revert to a lock-unlock inject pool by zachdworkin · Pull Request #12109 · ofiwg/libfabric

zachdworkin · 2026-04-03T21:28:40Z

Remove the parallel command-inject resources and revert to using a lock-unlock inject buffer pool.

Update the inject protocol to use the old method.

There is a performance regression when using the "new shm" command-inject parallel data structure. This is due to the sender not being able to complete its transmission until the receiver returns the sender's command to the sender's return queue. In the old lock-unlock method the sender would allocate receiver side resources, copy its data into the receiver inject buffer and then complete. The old method allows MPIs and applications to assume that their inject message transmissions will complete quickly and since the new method does not complete as quickly it is likely the reason for this regression.

In addition we need to move the inject stack above the cmd stack in the shm region. This is because most of the time the new inject protocol will not need to use the cmd stack. If we swap the ordering in the region then we have to do less jumps through memory to get the resources we need.

We can also use the ofi freestack for the cmd stack because we only need to pop and push on the sender side so we don't need the extra functionality of the smr_freestack.

zachdworkin · 2026-04-04T14:27:13Z

bot:aws:retest

j-xiong · 2026-04-05T04:21:17Z

bot:aws:retest

zachdworkin · 2026-04-14T22:37:48Z

bot:aws:retest

zachdworkin · 2026-04-15T16:09:44Z

@sunkuamzn what is the AWS failure with this PR?

shijin-aws · 2026-04-15T16:45:36Z

@zachdworkin There is single node unexpected message test timeout


server_command: ssh -n -o StrictHostKeyChecking=no -o ConnectTimeout=30 -o BatchMode=yes 172.31.49.36 'timeout 1800 /bin/bash --login -c '"'"' FI_HMEM=system FI_LOG_LEVEL=warn /home/ec2-user/PortaFiducia/build/libraries/libfabric/pr12109-undebug/install/fabtests/bin/fi_unexpected_msg -e rdm -M 2048 -I 5 -f efa -v -S 512 -p efa -E=9234'"'"''

client_command: ssh -n -o StrictHostKeyChecking=no -o ConnectTimeout=30 -o BatchMode=yes 172.31.49.36 'timeout 1800 /bin/bash --login -c '"'"' FI_HMEM=system FI_LOG_LEVEL=warn /home/ec2-user/PortaFiducia/build/libraries/libfabric/pr12109-undebug/install/fabtests/bin/fi_unexpected_msg -e rdm -M 2048 -I 5 -f efa -v -S 512 -p efa -E=9234 172.31.49.36'"'"''
client_stdout:

client returncode: 124
server_stdout:

server returncode: 124

Place lighter protocol fields in the same cache-line grab as the atomic-queue cmd_entry grab. This way we if we are using a cpu without prefetcing algorithms (to grab the adjacent cacheline for us) we are optimizing the access of the fields we need for the lightweight/fast protocols. The heavier/slower protocols which use the second cache line fields will be unaffected by this change on older cpus since they already need to access both cache-lines. Signed-off-by: Zach Dworkin <zachary.dworkin@intel.com> Signed-off-by: Alexia Ingerson <alexia.ingerson@intel.com>

Remove the parallel command-inject resources and revert to using a lock-unlock inject buffer pool. Update the inject protocol to use the old method. There is a performance regression when using the "new shm" command-inject parallel data structure. This is due to the sender not being able to complete its transmission until the receiver returns the sender's command to the sender's return queue. In the old lock-unlock method the sender would allocate receiver side resources, copy its data into the receiver inject buffer and then complete. The old method allows MPIs and applications to assume that their inject message transmissions will complete quickly and since the new method does not complete `as` quickly it is likely the reason for this regression. This will also revert to the "old-shm" method of buffering all unexpected inject messages Signed-off-by: Zach Dworkin <zachary.dworkin@intel.com> Signed-off-by: Alexia Ingerson <alexia.ingerson@intel.com>

Replace hdr.status with hdr.smr_flags to indicate any error. This error will use the flag SMR_OP_ERROR for the sender to process its errors on return cmd. Signed-off-by: Zach Dworkin <zachary.dworkin@intel.com> Signed-off-by: Alexia Ingerson <alexia.ingerson@intel.com>

Command stack is less likely to be used in the inject protocol when resources are on the receiver side. If the inject pool is above it then we have to jump less, and do not have to jump over the command stack, when accessing it. Signed-off-by: Zach Dworkin <zachary.dworkin@intel.com>

SAR should never be handling 0 byte copies anymore since the inject protocol can handle delivery complete. Instead we will assert to make sure we aren't accidentally doing a 0-byte copy in SAR. Signed-off-by: Zach Dworkin <zachary.dworkin@intel.com>

Signed-off-by: Alexia Ingerson <alexia.ingerson@intel.com>

zachdworkin force-pushed the lock branch from 51f0da7 to 06a0834 Compare April 3, 2026 21:45

zachdworkin requested review from aingerson, j-xiong and shijin-aws April 3, 2026 22:06

j-xiong previously approved these changes Apr 5, 2026

View reviewed changes

aingerson added the ⚠️ Do not merge label Apr 6, 2026

zachdworkin dismissed j-xiong’s stale review via a9a2abf April 10, 2026 16:25

zachdworkin force-pushed the lock branch from 06a0834 to a9a2abf Compare April 10, 2026 16:25

zachdworkin marked this pull request as ready for review April 10, 2026 18:05

zachdworkin force-pushed the lock branch from a9a2abf to 145aa42 Compare April 14, 2026 15:11

zachdworkin force-pushed the lock branch 2 times, most recently from 6ab0fc8 to 7d18904 Compare May 1, 2026 19:25

zachdworkin mentioned this pull request May 1, 2026

prov/shm: inject protocol changes DO NOT MERGE for testing purposes only #12174

Closed

zachdworkin force-pushed the lock branch 3 times, most recently from fe058b6 to affab85 Compare May 4, 2026 22:22

yinliaws mentioned this pull request May 5, 2026

prov/shm: two performance fixes for the inject path zachdworkin/libfabric#1

Closed

zachdworkin force-pushed the lock branch 2 times, most recently from 915f9f0 to 30340a0 Compare May 5, 2026 15:24

zachdworkin added the prov/shm label May 5, 2026

zachdworkin force-pushed the lock branch from 30340a0 to 90205ba Compare May 7, 2026 13:53

zachdworkin added 2 commits May 7, 2026 09:04

zachdworkin force-pushed the lock branch from 90205ba to 17fd4ac Compare May 7, 2026 17:55

zachdworkin force-pushed the lock branch 2 times, most recently from bf4cfb0 to edd82eb Compare May 7, 2026 19:25

zachdworkin and others added 4 commits May 7, 2026 13:00

prov/shm: Remove 0-byte copy SAR

cd24355

SAR should never be handling 0 byte copies anymore since the inject protocol can handle delivery complete. Instead we will assert to make sure we aren't accidentally doing a 0-byte copy in SAR. Signed-off-by: Zach Dworkin <zachary.dworkin@intel.com>

prov/shm: preformat hdr before copying over to shm cmd

a4c5c60

Signed-off-by: Alexia Ingerson <alexia.ingerson@intel.com>

prov/shm: prefetch entire command header

f1014f7

Signed-off-by: Alexia Ingerson <alexia.ingerson@intel.com>

zachdworkin force-pushed the lock branch from edd82eb to f1014f7 Compare May 7, 2026 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prov/shm: Revert to a lock-unlock inject pool#12109

prov/shm: Revert to a lock-unlock inject pool#12109
zachdworkin wants to merge 7 commits intoofiwg:mainfrom
zachdworkin:lock

zachdworkin commented Apr 3, 2026 •

edited

Loading

Uh oh!

zachdworkin commented Apr 4, 2026

Uh oh!

j-xiong commented Apr 5, 2026

Uh oh!

zachdworkin commented Apr 14, 2026

Uh oh!

zachdworkin commented Apr 15, 2026

Uh oh!

shijin-aws commented Apr 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

zachdworkin commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zachdworkin commented Apr 4, 2026

Uh oh!

j-xiong commented Apr 5, 2026

Uh oh!

zachdworkin commented Apr 14, 2026

Uh oh!

zachdworkin commented Apr 15, 2026

Uh oh!

shijin-aws commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zachdworkin commented Apr 3, 2026 •

edited

Loading

shijin-aws commented Apr 15, 2026 •

edited

Loading