Skip to content
This repository was archived by the owner on Aug 5, 2022. It is now read-only.
This repository was archived by the owner on Aug 5, 2022. It is now read-only.

message size not multiple of 4 when sending extremely large messages #22

Description

@mattijsjanssens

We're occasionally seeing assert message of the form

ips_proto.c:1646: (scb->payload_size & 0x3) == 0

which seem to originate from somewhere in the network stack (e.g. https://github.com/intel/psm/blob/master/ptl_ips/ips_proto.c) when the size is not a multiple of 4.

  • this only happens occasionally
  • and only for extremely large messages (not sure but could be 100Mb or even Gb)
  • and only on omnipath

Is this a known problem? We don't pad our mpi messages to be multiple of 4 bytes.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions