Page 1 of 2 12 LastLast
Results 1 to 10 of 14

Thread: Unexpected notifications about dependent hosts

  1. #1
    Junior Member
    Join Date
    Sep 2012
    Posts
    12

    Unexpected notifications about dependent hosts

    Hello,

    I'm getting unexpected notifications in case of a network failure: the dependent hosts are being considered in a failure state, when they should be considered just unreachable. Here are the definitions related to one of these dependent hosts:

    Code:
    define host{
        use           http,debian
        contact_groups     admins,suporte
        host_name        erp
        alias          ERP
        address         erp.xxxxx.com.br
        parents         thomsongateway
        icon_set        server
        }
    
    define host{
        use           router
        contact_groups     admins
        host_name        thomsongateway
        alias          Thomson Gateway
        address         192.168.1.1
        parents         dlinkrouter
        icon_set        network_service
        check_interval     1
        }
    
    define host{
        use           http
        contact_groups     admins
        host_name        dlinkrouter
        alias          Dlink Router
        address         192.168.0.1
        parents         localhost
        icon_set        network_service
        }
    When host "dlinkrouter" goes down, I get mail notification about services in host "erp" going down too (state Critical, "No route to host&quot. I would expect notification only from host "dlinkrouter", the root problem, according to http://www.shinken-monitoring.org/wi...ies_in_shinken. Shinken is running in host "localhost".

    Any ideas?

    Thanks!

  2. #2
    Shinken project leader
    Join Date
    May 2011
    Location
    Bordeaux (France)
    Posts
    2,131

    Re: Unexpected notifications about dependent hosts

    Can you post the schedulerd.log lines relatives to theses hosts? Because yes, should not be sent if a parent is down.
    No direct support by personal message. Please open a thread so everyone can see the solution

  3. #3
    Junior Member
    Join Date
    Sep 2012
    Posts
    12

    Re: Unexpected notifications about dependent hosts

    Sure, here is schedulerd.log:
    Code:
    2012-11-27 12:05:38,645 [1354025138] SERVICE ALERT: erp;Mem;UNKNOWN;SOFT;1;CHECK_NRPE: Socket timeout after 9 seconds.
    2012-11-27 12:06:49,465 [1354025209] SERVICE ALERT: erp;Mem;UNKNOWN;HARD;2;CHECK_NRPE: Socket timeout after 9 seconds.
    2012-11-27 12:06:57,528 [1354025217] SERVICE ALERT: thomsongateway;Port n Link Status;UNKNOWN;SOFT;1;(Service Check Timed Out)
    2012-11-27 12:07:10,618 [1354025230] SERVICE ALERT: erp;Disks;UNKNOWN;SOFT;1;CHECK_NRPE: Socket timeout after 9 seconds.
    2012-11-27 12:08:04,204 [1354025284] SERVICE ALERT: dlinkrouter;Http;CRITICAL;SOFT;1;No route to host
    2012-11-27 12:08:10,238 [1354025290] SERVICE ALERT: erp;Http_AL;CRITICAL;SOFT;1;No route to host
    2012-11-27 12:08:12,253 [1354025292] SERVICE ALERT: thomsongateway;Port n Link Status;UNKNOWN;HARD;2;(Service Check Timed Out)
    2012-11-27 12:08:14,272 [1354025294] SERVICE ALERT: erp;Disks;CRITICAL;HARD;2;Connection refused or timed out
    2012-11-27 12:08:14,282 [1354025294] SERVICE NOTIFICATION: admin;erp;Disks;CRITICAL;notify-service-by-email;Connection refused or timed out
    2012-11-27 12:08:14,284 [1354025294] SERVICE NOTIFICATION: suporte;erp;Disks;CRITICAL;notify-service-by-email;Connection refused or timed out
    2012-11-27 12:08:14,285 [1354025294] SERVICE NOTIFICATION: susin;erp;Disks;CRITICAL;notify-service-by-email;Connection refused or timed out
    2012-11-27 12:09:10,666 [1354025350] SERVICE ALERT: dlinkrouter;Http;CRITICAL;HARD;2;No route to host
    2012-11-27 12:09:11,143 [1354025351] SERVICE NOTIFICATION: admin;dlinkrouter;Http;CRITICAL;notify-service-by-email;No route to host
    2012-11-27 12:09:11,145 [1354025351] SERVICE NOTIFICATION: susin;dlinkrouter;Http;CRITICAL;notify-service-by-email;No route to host
    2012-11-27 12:09:13,735 [1354025353] SERVICE ALERT: erp;Http_AL;CRITICAL;HARD;2;No route to host
    2012-11-27 12:09:14,219 [1354025354] SERVICE NOTIFICATION: admin;erp;Http_AL;CRITICAL;notify-service-by-email;No route to host
    2012-11-27 12:09:14,220 [1354025354] SERVICE NOTIFICATION: suporte;erp;Http_AL;CRITICAL;notify-service-by-email;No route to host
    2012-11-27 12:09:14,221 [1354025354] SERVICE NOTIFICATION: susin;erp;Http_AL;CRITICAL;notify-service-by-email;No route to host
    2012-11-27 12:09:26,824 [1354025366] SERVICE ALERT: erp;Load;CRITICAL;SOFT;1;Connection refused or timed out
    2012-11-27 12:09:37,173 [1354025377] SERVICE NOTIFICATION: admin;acate;Http;CRITICAL;notify-service-by-email;No route to host
    2012-11-27 12:09:37,175 [1354025377] SERVICE NOTIFICATION: susin;acate;Http;CRITICAL;notify-service-by-email;No route to host
    2012-11-27 12:10:32,648 [1354025432] SERVICE ALERT: erp;Load;CRITICAL;HARD;2;Connection refused or timed out
    2012-11-27 12:10:33,023 [1354025433] SERVICE NOTIFICATION: admin;erp;Load;CRITICAL;notify-service-by-email;Connection refused or timed out
    2012-11-27 12:10:33,025 [1354025433] SERVICE NOTIFICATION: suporte;erp;Load;CRITICAL;notify-service-by-email;Connection refused or timed out
    2012-11-27 12:10:33,026 [1354025433] SERVICE NOTIFICATION: susin;erp;Load;CRITICAL;notify-service-by-email;Connection refused or timed out
    2012-11-27 12:11:53,555 [1354025513] SERVICE ALERT: erp;Mem;OK;HARD;2;OK - 58.6% (599564 kB) used.
    2012-11-27 12:13:17,337 [1354025597] SERVICE ALERT: thomsongateway;Port n Link Status;OK;HARD;2;SNMP OK - up(1)
    2012-11-27 12:13:17,341 [1354025597] SERVICE ALERT: erp;Disks;OK;HARD;2;DISK OK - free space: / 33521 MB (71% inode=64%);
    2012-11-27 12:13:17,764 [1354025597] SERVICE NOTIFICATION: admin;erp;Disks;OK;notify-service-by-email;DISK OK - free space: / 33521 MB (71% inode=64%);
    2012-11-27 12:13:17,766 [1354025597] SERVICE NOTIFICATION: suporte;erp;Disks;OK;notify-service-by-email;DISK OK - free space: / 33521 MB (71% inode=64%);
    2012-11-27 12:13:17,768 [1354025597] SERVICE NOTIFICATION: susin;erp;Disks;OK;notify-service-by-email;DISK OK - free space: / 33521 MB (71% inode=64%);
    2012-11-27 12:14:12,859 [1354025652] SERVICE ALERT: dlinkrouter;Http;OK;HARD;2;HTTP OK: HTTP/1.1 200 OK - 9351 bytes in 0.016 second response time
    2012-11-27 12:14:13,684 [1354025653] SERVICE NOTIFICATION: admin;dlinkrouter;Http;OK;notify-service-by-email;HTTP OK: HTTP/1.1 200 OK - 9351 bytes in 0.016 second response time
    2012-11-27 12:14:13,685 [1354025653] SERVICE NOTIFICATION: susin;dlinkrouter;Http;OK;notify-service-by-email;HTTP OK: HTTP/1.1 200 OK - 9351 bytes in 0.016 second response time
    2012-11-27 12:14:18,895 [1354025658] SERVICE ALERT: erp;Http_AL;OK;HARD;2;HTTP OK: HTTP/1.1 200 OK - 8952 bytes in 0.534 second response time
    2012-11-27 12:14:19,765 [1354025659] SERVICE NOTIFICATION: admin;erp;Http_AL;OK;notify-service-by-email;HTTP OK: HTTP/1.1 200 OK - 8952 bytes in 0.534 second response time
    2012-11-27 12:14:19,767 [1354025659] SERVICE NOTIFICATION: suporte;erp;Http_AL;OK;notify-service-by-email;HTTP OK: HTTP/1.1 200 OK - 8952 bytes in 0.534 second response time
    2012-11-27 12:14:19,768 [1354025659] SERVICE NOTIFICATION: susin;erp;Http_AL;OK;notify-service-by-email;HTTP OK: HTTP/1.1 200 OK - 8952 bytes in 0.534 second response time

  4. #4
    Shinken project leader
    Join Date
    May 2011
    Location
    Bordeaux (France)
    Posts
    2,131

    Re: Unexpected notifications about dependent hosts

    I only see Service alerts, and no hosts one. What were the host states?
    No direct support by personal message. Please open a thread so everyone can see the solution

  5. #5
    Junior Member
    Join Date
    Sep 2012
    Posts
    12

    Re: Unexpected notifications about dependent hosts

    There are no host alerts in schedulerd.log. Only "SERVICE ALERT" and "SERVICE FLAPPING ALERT". Maybe a misconfiguration?



  6. #6
    Shinken project leader
    Join Date
    May 2011
    Location
    Bordeaux (France)
    Posts
    2,131

    Re: Unexpected notifications about dependent hosts

    Yes maybe. Please look in the WebUI at your host state, and if the check is really effective. A (bad) service check should raise an host check that will disable notifications, but if the host one is never raised, that's can be a problem here.
    No direct support by personal message. Please open a thread so everyone can see the solution

  7. #7
    Junior Member
    Join Date
    Sep 2012
    Posts
    12

    Re: Unexpected notifications about dependent hosts

    From WebUI:
    * erp - Host assumed to be UP, since 2d23h
    * thomsongateway - Host assumed to be UP, since 2d23h
    * dlinkrouter - Host assumed to be UP, since 2d23h

    Is there something more I could check?

  8. #8
    Shinken project leader
    Join Date
    May 2011
    Location
    Bordeaux (France)
    Posts
    2,131

    Re: Unexpected notifications about dependent hosts

    Can you look in your web interface for theses hosts if the check is enabled (like a next check or something like this).
    No direct support by personal message. Please open a thread so everyone can see the solution

  9. #9
    Junior Member
    Join Date
    Sep 2012
    Posts
    12

    Re: Unexpected notifications about dependent hosts

    From WebUI:

    * dlinkrouter
    Last Check: was 1m 25s ago
    Last State Change Fri Nov 30 15:20:01 2012
    Current Attempt 1/2 (HARD state)
    Next Active Check: in 3m 37s

    * thomsongateway
    Last Check: was 49s ago
    Last State Change Fri Nov 30 15:18:06 2012
    Current Attempt 1/2 (HARD state)
    Next Active Check: in 13s

    * erp
    Last Check: was 4m 13s ago
    Last State Change Fri Nov 30 15:18:23 2012
    Current Attempt 1/2 (HARD state)
    Next Active Check: in 49s

  10. #10
    Shinken project leader
    Join Date
    May 2011
    Location
    Bordeaux (France)
    Posts
    2,131

    Re: Unexpected notifications about dependent hosts

    The generic-host from who your hosts are inherited do not have a default check_command, so if you don't put on in router for example, a default "UP" check will be raised. You can add a check_command to this template (like a ping command, look in commands.cfg, should be one) and you will be ok
    No direct support by personal message. Please open a thread so everyone can see the solution

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •