This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Foglight Alerts and associated devices

Can Foglight create a reports  to show the monitored devices and there associated alerts. I'm not an administrator to verify all true, but being told this isn't possible? An 200 page report can be generated of all alerts but nothing can be created simply to list the devices monitored and there associated alert.

Parents
  • Hi Eric,

    The first step to see what rules apply to a "device" is to check the Agents Status page to see what type of device it is. The namespace and type columns will help. Most agents are fairly easy to figure out.

    Then we go to the Rules dashboard, and pick the cartridge type from the pulldown. Normally it's easy to see a 1:1 relationship from the prior dashboard, but I picked one where it's not. HostAgents generally fall under the Infrastructure cartridge. From there, we can see what rules are enabled for that type.

     

    A rule fires when a condition is met. The result is an alarm (or some will say an alert). There is a 1:n relation between a rule and alarms. For example, if we have 100 hosts being monitored, and they all have a condition where the "cpu utilization" rule evaluates to true, then we would have 100 cpu utilization alarms.

    The quick way that I take to see "what has fired" is to use the Alarms Analysis dashboard. Navigate to the Alarms dashboard, then Alarms Analysis tab. Pick a time range and then sort by Alarm Count.

    Clicking on an alarm in the Alarm Source column pops up additional detail. Select the "Error Instances" tab to see the objects (ie. devices, hosts, etc.) that the rule fired an alarm against.

    Hope that helps a bit.

     

     

     

     

     

     

     

Reply
  • Hi Eric,

    The first step to see what rules apply to a "device" is to check the Agents Status page to see what type of device it is. The namespace and type columns will help. Most agents are fairly easy to figure out.

    Then we go to the Rules dashboard, and pick the cartridge type from the pulldown. Normally it's easy to see a 1:1 relationship from the prior dashboard, but I picked one where it's not. HostAgents generally fall under the Infrastructure cartridge. From there, we can see what rules are enabled for that type.

     

    A rule fires when a condition is met. The result is an alarm (or some will say an alert). There is a 1:n relation between a rule and alarms. For example, if we have 100 hosts being monitored, and they all have a condition where the "cpu utilization" rule evaluates to true, then we would have 100 cpu utilization alarms.

    The quick way that I take to see "what has fired" is to use the Alarms Analysis dashboard. Navigate to the Alarms dashboard, then Alarms Analysis tab. Pick a time range and then sort by Alarm Count.

    Clicking on an alarm in the Alarm Source column pops up additional detail. Select the "Error Instances" tab to see the objects (ie. devices, hosts, etc.) that the rule fired an alarm against.

    Hope that helps a bit.

     

     

     

     

     

     

     

Children
No Data