Skip to main content
OnCommand Insight

Monitoring OnCommand Insight system health

Contributors netapp-alavoie

You should periodically check the current status of your Insight system components by viewing the health page, which shows the status of each component and alerts you when there is an issue.

Steps

  1. Log in to the Insightweb UI.

  2. Click Admin and select Health.

    The Health page is displayed.

  3. View the summary of the current status of the components paying particular attention to any attention status in the Details column that is preceded by a red circle, which indicates an issue that requires your immediate attention.

    The Health page displays information about any or all of the following Insight components based on your system configuration:

    Component

    Test

    Details

    Displays

    Acquisition

    Inventory data processing

    Status of local acquisition unit

    “OK” if number of concurrently-polling data sources is less than 75% of execution pool maximum (default maximum is 30). “Acquisition is busy” if usage is greater than 75%, and recommends increasing polling interval or adding more remote acquisition units.

    DWH

    Backup

    Status of Data Warehouse scheduled backup

    “OK” and the last successful DWH backup time if DWH scheduled backup is enabled. Otherwise, displays information about any error found.

    DWH

    ETL

    Status of Data Warehouse ETL

    “OK” and the last successful DWH build time if no errors. Otherwise, displays information about any error found.

    Server

    ASUP

    Status of ASUP

    “ASUP Enabled” and the last successful phonehome time if available. “ASUP Failed” if phonehome is enabled but encountered a problem.

    + "Invalid backup location" if backup directory is not valid.

    + Displays the last successful phonehome time as well as time of the last failed attempt if available.

    + “ASUP Disabled” if phonehome is disabled.

    Server

    Auto resolution

    Status of automatic device resolution

    “OK” if no errors. “Auto resolution is blocked” if identification errors prevent resolution progress.

    + “Low success rate” if less than 75% of generic devices could be identified.

    Server

    Elasticsearch

    Status of elastic search data store

    “OK” if no errors. “Service unavailable” if unable to connect to elastic search service.

    + "Cluster mode detected" if more than one node is detected.

    + "High memory utilization" if heap space used is more than 85%.

    + "Status: RED" indicates an error reported by elastic search. Displays information about the error and recommends contacting customer support.

    Server

    CPU

    Insight CPU usage

    “OK” if CPU load is less than 65%. “System CPU load is high. Reduce your CPU load.” if CPU load is greater than 65%.

    Server

    Disk space

    Status of disk space

    Free disk space, disk space in use by Insight, and recommended disk space reserved for Insight. “Low Disk Space” if disk utilization is more than 80%.

    Server

    EventBus

    Status of EventBus

    “EventBus is empty” if EventBus queue is empty, otherwise displays status of EventBus queue.

    Server

    Inventory data processing

    Status of inventory data processing capability of Insight server

    “OK” if Insight server is not busy. “Server is busy” if the server is busy at least 75% of the time for the last hour. Recommends not adding more data sources and recommends splitting the environment to several servers.

    Server

    MySQL

    Status of MySQL database

    “OK” if no problems are detected. “The database is having performance issues. Some queries are taking too long to run” if the number of slow queries is more than 5%.

    + “The database log file grew more than <size> in the past hour. Check MySQL log file” if the error log grows to more than 20 KB.

    Server

    Performance archive

    Status of performance archive

    “Performance archive is enabled” or “Performance archive is not enabled”.

    Server

    Physical memory

    Status of physical memory

    “OK” if memory usage is less than 85%. “Memory usage is high. Reduce your overall memory footprint for system stability” if memory usage is greater than 85%.

    Server

    Service pack

    Service pack availability

    Displays whether a service pack is available for Insight. If a service pack is available, displays instructions.

    Server

    Usage information

    Status of sending of usage information

    Displays whether sending of usage information to NetApp is enabled or disabled. Recommends enabling if disabled. Displays last attempted or last successful send time.

    + Displays information on any problems encountered.

    Server

    Violation

    Status of open violations

    “OK” if the number of open violations is less than 75% of the violations limit. “Maximum number of open violations allowed is <number>” if the number of open violations is greater than 75% of the violations limit. Recommends reviewing performance policy configuration.

    + “Violation manager is blocked” if the number of open violations is at the violations limit.

    + Note that the violation manager cannot create new violations and recommends reviewing performance policy configuration.

    Server

    Weekly backup

    Status of weekly backup

    “OK” if weekly backup is enabled, otherwise displays “Weekly backup is not enabled”.