Troubleshoot delivery of platform services messages (SMTT alarm)

Contributors netapp-perveilerk netapp-lhalbert

The Total Events (SMTT) alarm is triggered in the Grid Manager if a platform service message is delivered to an destination that cannot accept the data.

About this task

For example, an S3 multipart upload can succeed even though the associated replication or notification message cannot be delivered to the configured endpoint. Or, a message for CloudMirror replication can fail to be delivered if the metadata is too long.

The SMTT alarm contains a Last Event message that says, Failed to publish notifications for bucket-name object key for the last object whose notification failed.

Event messages are also listed in the /var/local/log/bycast-err.log log file. See the Log files reference.

For additional information about troubleshooting platform services, see the instructions for administering StorageGRID. You might need to access the tenant from the Tenant Manager to debug a platform service error.

Steps
  1. To view the alarm, select NODES > site > grid node > Events.

  2. View Last Event at the top of the table.

    Event messages are also listed in /var/local/log/bycast-err.log.

  3. Follow the guidance provided in the SMTT alarm contents to correct the issue.

  4. Select Reset event counts.

  5. Notify the tenant of the objects whose platform services messages have not been delivered.

  6. Instruct the tenant to trigger the failed replication or notification by updating the object’s metadata or tags.