Use the Overview dashboard in EDA workloads
As an IT administrator managing EDA workloads across multiple FSx for ONTAP file systems (ONTAP clusters), you can use the Overview dashboard to quickly assess cluster health and usage. Use it to decide where to place new volumes or jobs, identify candidates for moving volumes or SVMs, and determine when to scale capacity or throughput.
Overview
The Overview dashboard provides a centralized view of your FSx for ONTAP clusters, including capacity and throughput. Use it to place new volumes, rebalance workloads, and plan capacity or throughput scaling.
The dashboard includes:
-
Cluster health status: Information summarized at the top of the dashboard that highlights latency events, SSD utilization and capacity recommendations, and ONTAP EMS events across your file systems.
-
Clusters table: A detailed, searchable table showing usage and performance metrics for each cluster, with support for filtering, sorting, pagination, and CSV export.
Requirements
Before using the dashboard, ensure you meet the following requirements:
- AWS credentials with view permissions
-
You must configure AWS credentials in Workload Factory with at least read (view) permissions for General Storage. Basic credentials are not supported.
If you haven't configured credentials with view permissions, you are redirected to the AWS credentials setup page when you open the Overview tab.
If you haven't configured AWS credentials, see Add AWS credentials.
- Activate the dashboard
-
After Workload Factory confirms credentials with view permissions, you must activate the dashboard to begin collecting CloudWatch metrics for your FSx for ONTAP file systems.
|
|
Metrics collection can take some time after you provide consent. The dashboard notifies you while the initial collection is in progress. |
-
Log in using one of the console experiences.
-
Select the menu
and then select EDA. -
Select the Overview tab.
-
If no credentials with view permissions are detected, select Add credentials and follow the prompts to configure AWS credentials with view permissions. Then return to the Overview tab.
-
Review the consent prompt describing the CloudWatch metrics that will be collected for your FSx for ONTAP file systems.
-
Select Activate to activate the dashboard and begin metrics collection.
Workload Factory begins collecting CloudWatch metrics for all FSx for ONTAP file systems associated with your configured AWS credentials. The dashboard populates as metrics become available. A notification is displayed if collection is still in progress.
Filter the dashboard
Use the filters at the top of the dashboard to focus on specific file systems. These filters apply to the latency, utilization, and ONTAP events sections and to the clusters table.
Available filters:
-
Region: Filter by one or more AWS regions.
-
AWS account: Filter by one or more AWS accounts associated with your configured credentials.
When you update the filter selections, all information is refreshed to show only the matching file systems.
Cluster health status
At the top of the dashboard, a snapshot of health and activity across your filtered file systems is displayed. This information is only shown when at least one FSx for ONTAP link is associated with your file systems. If no links are available, the information is hidden.
-
Latency: Displays the number of latency events detected across the file systems in scope.
-
Utilization: Displays SSD utilization status and identifies file systems with active capacity recommendations.
-
ONTAP events: Displays the number of EMS events detected, categorized by Capacity, Availability & protection, and Security & other.
Latency
Displays the number of latency events detected across the file systems in scope.
-
When you select Review, the Latency tab is displayed.
-
You can only view latency information if you have enabled latency monitoring. If you have not configured latency thresholds, select Configure. For details on latency monitoring, see FSx latency analysis.
Utilization
Displays the number of file systems in scope that have at least one cluster with SSD usage above 80%, and the number of file systems with active capacity recommendations. This helps you quickly identify file systems that might require capacity attention.
Capacity recommendations
Workload Factory automatically runs a capacity recommendation algorithm for each FSx for ONTAP file system visible in your EDA inventory. The algorithm scans once every 24 hours and identifies when SSD capacity adjustments are recommended.
When a recommendation is identified:
-
You receive an immediate notification (email or WAD), based on your Workload Factory notification settings. Notifications are sent as soon as a recommendation is identified, rather than waiting for the weekly summary.
-
A lightbulb indicator appears in the Clusters table row for any file system with an active recommendation.
-
The total number of file systems with active recommendations is displayed. This ensures recommendations are visible even if the affected file system is not on the first page of the table.
View and apply capacity recommendations
-
Log in using one of the console experiences.
-
Select the menu
and then select EDA. -
Select the Overview tab.
-
In the Clusters table, locate a file system with a lightbulb indicator.
Hover over the lightbulb indicator to see a tooltip with a brief description of the recommendation.
-
Select the cluster name in the table to open it and view the recommendations.
-
Review the SSD recommendation and the capacity graph.
The recommendation explains the suggested change and the reasoning behind it. For example: We recommend increasing the SSD size based on your file system SSD usage pattern.
The graph presents current SSD usage alongside historical trends and shows you how the capacity recommendation algorithm would have adjusted capacity over time.
-
On the top right of the graph, select the time range to change the period displayed. The default is one week.
ONTAP events
Displays the number of EMS events detected across the file systems in scope, categorized by Capacity, Availability & protection, and Security & other.
Displays the number of EMS messages related to capacity issues, along with the number of file systems affected.
Examples of EMS events monitored include:
-
Aggregate nearly full/full
-
Volume nearly full/full
-
Snapshot reserve nearly full/full
-
Directory size full
-
FlexGroup full
-
Inodes full
For a full list of the EMS events monitored, see Capacity events.
Selecting Capacity navigates to the capacity analysis screen in Storage workloads.
Displays the number of EMS messages related to availability and data protection, along with the number of file systems affected.
Events monitored include FlexCache and SnapMirror-related EMS events.
Selecting Availability & protection navigates to the availability and protection analysis screen in Storage workloads.
Displays the number of EMS messages for events not categorized under Capacity or Availability & protection, along with the number of file systems affected. Events monitored include anti-ransomware protection, NFS authentication failures, and others.
Selecting Security & other navigates to the event analysis screen in Storage workloads.
Clusters table
Provides a detailed view of each FSx for ONTAP file system associated with your configured AWS credentials, filtered by the active region and AWS account selections. Data is collected from CloudWatch metrics. The table supports search, column filtering, pagination, column customization, and CSV export.
Key metrics include:
-
Name, region, and AWS account
-
SSD capacity metrics (used, total, and usage percentage)
-
Capacity pool storage
-
Throughput metrics (average, P95, P99, and max over the last 30 days)
Use SSD usage to identify file systems approaching capacity limits. Use Throughput usage (P99) to compare throughput demand to the provisioned throughput SKU. Hover over throughput column headers for calculation details.
Search and filter
-
Use the search to look up specific clusters by name, file system ID, region, or other attributes.
-
Select any column header to sort the table by that column.
-
Use per-column filter controls to narrow results within the table.
-
The table is paginated. Use the pagination controls at the bottom of the table to navigate between pages.
|
|
File systems with active capacity recommendations display a lightbulb indicator in the table. Select any file system name to view SSD capacity metrics, historical usage trends, and capacity recommendations for that file system. |
Customize columns
To add or remove columns from the clusters table:
-
Select the column selector icon above the table on the right.
-
Select or deselect the columns you want to show or hide.
-
Select Apply.
Export table to CSV
You can export the currently displayed table data to a CSV file for further analysis or reporting.
-
Apply any filters or column customizations you want reflected in the export.
-
Select the
button above the clusters table.
The CSV file is downloaded and contains all rows currently visible in the table, including only the columns currently shown.