OE Management and e-mail notifications

Posted by Dmitry Lishafaev on 17-Jun-2014 10:26

I have AIX-box with OEM installed and Windows server with some integration applications . Windows server is enabled for remote monitoring. I have configured  'FileStale' rule for log file (generated with custom program)

My settings are:

Resource:File

Schedule:    Default Shedule
Polling Interval:    1 minute
Alerts Enabled:    true
Trend Performance Data:    false
Trend every:    1 poll(s)

Rules selected:

Rule definition
Alert if file older than:    1 minutes
Alert Severity:    Severe
Throw alert after:    1 failed poll(s) 
Throw additional alerts:    true
On alert perform action:    IntegraAlert
Clear alert after:    1 successful poll(s) 
On clear perform action:    None 

I have stopped my process. After 1 minute I have received email with alert. But further polls doesn't generate any alerts.

How I can  receive alerts with every failed poll with this rule?

(ps: Resources such as  Appserver or FileSystem generates alerts successfully )

All Replies

Posted by Satya Prasad on 18-Jun-2014 01:22

While adding a rule to the log file resource, Try setting "Throw additional alerts" to every 1 failures  in case of Error/Severe. Please try with this option.

Hope this helps.

Regards,

Satya

Posted by Dmitry Lishafaev on 24-Jun-2014 01:03

Thank you, but this condition was already checked?

Throw alert after:     1 failed poll(s)

Throw additional alerts:     true

Posted by Libor Laubacher on 24-Jun-2014 08:06

Dmitrij,

what versions are involved here ?

"On alert perform action: IntegraAlert" - I am assuming this is correctly configured.

Also - I'd like to check one thing - is the alert thown and email does not arrive, or the alert is not thrown as well. Did the alert count within the thrown alert increased at least ?

Posted by Dmitry Lishafaev on 24-Jun-2014 18:59

Libor,

10.2B08/AIX64

IntegraAlert was correctly configured. I have only one  letter when this error occurs.

Error count was increased.

Maybe I don't understand this rule correctly?

My goal is:

When my process stops - he doesn't update log-file. "File Age" rule polls each 60 seconds and if log file wasn't updated at least in 100 seconds I should receive message.

I have problem only with this "File rule". Everything but this rule works fine - I recieve many messages daily. Should I check my mailserver in this case?

And no errors in adminserver log. (at least at standard log level).

Posted by Libor Laubacher on 25-Jun-2014 03:49

Dmitrij,

your understanding of the rule is correct, that's what it is supposed to do. Others are using it in a similar manner.

> Should I check my mailserver in this case?

I don't think it's a mailserver issue since your other alerts are working fine. And also you get one email from this one. The alert should be also posted in the admserv.log file. Since the error count gets increased, the poll/rule is processed. I am going to try and see if I can observe it here in a similar envrionment.

You said this file is on the remote Windows container? What Windows and also 10.2B08 ?

Also can you do "fathom -dump ll.xml" from proenv on your OEM and send me the ll.xml file ? (offline, email might work best)

Thanks

/Libor

Posted by Dmitry Lishafaev on 28-Jun-2014 06:28

Libor,

Yes, all servers running 10.2B08.

I should move production this weekend to new IBM server (fresh 10.2B08 OE/OEM install, not upgrading), after this event I will test File Age rule in new environment again.

This thread is closed