Monitoring OnCommand Insight system health
You should periodically check the current status of your Insight system components by viewing the health page, which shows the status of each component and alerts you when there is an issue.
Steps
-
Log in to the Insightweb UI.
-
Click Admin and select Health.
The Health page is displayed.
-
View the summary of the current status of the components paying particular attention to any attention status in the Details column that is preceded by a red circle, which indicates an issue that requires your immediate attention.
The Health page displays information about any or all of the following Insight components based on your system configuration:
Component
Test
Details
Displays
Acquisition
Inventory data processing
Status of local acquisition unit
“OK” if number of concurrently-polling data sources is less than 75% of execution pool maximum (default maximum is 30). “Acquisition is busy” if usage is greater than 75%, and recommends increasing polling interval or adding more remote acquisition units.
DWH
Backup
Status of Data Warehouse scheduled backup
“OK” and the last successful DWH backup time if DWH scheduled backup is enabled. Otherwise, displays information about any error found.
DWH
ETL
Status of Data Warehouse ETL
“OK” and the last successful DWH build time if no errors. Otherwise, displays information about any error found.
Server
ASUP
Status of ASUP
“ASUP Enabled” and the last successful phonehome time if available. “ASUP Failed” if phonehome is enabled but encountered a problem.
+ "Invalid backup location" if backup directory is not valid.
+ Displays the last successful phonehome time as well as time of the last failed attempt if available.
+ “ASUP Disabled” if phonehome is disabled.
Server
Auto resolution
Status of automatic device resolution
“OK” if no errors. “Auto resolution is blocked” if identification errors prevent resolution progress.
+ “Low success rate” if less than 75% of generic devices could be identified.
Server
Elasticsearch
Status of elastic search data store
“OK” if no errors. “Service unavailable” if unable to connect to elastic search service.
+ "Cluster mode detected" if more than one node is detected.
+ "High memory utilization" if heap space used is more than 85%.
+ "Status: RED" indicates an error reported by elastic search. Displays information about the error and recommends contacting customer support.
Server
CPU
Insight CPU usage
“OK” if CPU load is less than 65%. “System CPU load is high. Reduce your CPU load.” if CPU load is greater than 65%.
Server
Disk space
Status of disk space
Free disk space, disk space in use by Insight, and recommended disk space reserved for Insight. “Low Disk Space” if disk utilization is more than 80%.
Server
EventBus
Status of EventBus
“EventBus is empty” if EventBus queue is empty, otherwise displays status of EventBus queue.
Server
Inventory data processing
Status of inventory data processing capability of Insight server
“OK” if Insight server is not busy. “Server is busy” if the server is busy at least 75% of the time for the last hour. Recommends not adding more data sources and recommends splitting the environment to several servers.
Server
MySQL
Status of MySQL database
“OK” if no problems are detected. “The database is having performance issues. Some queries are taking too long to run” if the number of slow queries is more than 5%.
+ “The database log file grew more than <size> in the past hour. Check MySQL log file” if the error log grows to more than 20 KB.
Server
Performance archive
Status of performance archive
“Performance archive is enabled” or “Performance archive is not enabled”.
Server
Physical memory
Status of physical memory
“OK” if memory usage is less than 85%. “Memory usage is high. Reduce your overall memory footprint for system stability” if memory usage is greater than 85%.
Server
Service pack
Service pack availability
Displays whether a service pack is available for Insight. If a service pack is available, displays instructions.
Server
Usage information
Status of sending of usage information
Displays whether sending of usage information to NetApp is enabled or disabled. Recommends enabling if disabled. Displays last attempted or last successful send time.
+ Displays information on any problems encountered.
Server
Violation
Status of open violations
“OK” if the number of open violations is less than 75% of the violations limit. “Maximum number of open violations allowed is <number>” if the number of open violations is greater than 75% of the violations limit. Recommends reviewing performance policy configuration.
+ “Violation manager is blocked” if the number of open violations is at the violations limit.
+ Note that the violation manager cannot create new violations and recommends reviewing performance policy configuration.
Server
Weekly backup
Status of weekly backup
“OK” if weekly backup is enabled, otherwise displays “Weekly backup is not enabled”.