Troubleshooting delivery of platform services messages (SMTT alarm)

The Total Events (SMTT) alarm is triggered in the Grid Manager if a platform service message is delivered to an destination that cannot accept the data.

About this task

For example, an S3 multipart upload can succeed even though the associated replication or notification message cannot be delivered to the configured endpoint. Or, a message for CloudMirror replication can fail to be delivered if the metadata is too long.

The SMTT alarm contains a Last Event message that says, Failed to publish notifications for bucket-name object key for the last object whose notification failed.

For additional information about troubleshooting platform services, see the instructions for administering StorageGRID. You might need to access the tenant from the Tenant Manager to debug a platform service error.

Procedure

  1. To view the alarm, select Nodes > site > grid node > Events.
  2. View Last Event at the top of the table.
    Event messages are also listed in /var/local/log/bycast-err.log.
  3. Follow the guidance provided in the SMTT alarm contents to correct the issue.
  4. Click Reset event counts.
  5. Notify the tenant of the objects whose platform services messages have not been delivered.
  6. Instruct the tenant to trigger the failed replication or notification by updating the object's metadata or tags.