We're currently testing Hyperic HQ as monitoring-tool for our oracle databases. We want it to scan the alter.log files for ORA- errors periodically and, if a new one is found, report the error through mail.
The current method we're using is this:
We set up a New Platform Service for the Server hosting the DB. Resource Type is Process. We set it up to querry the oracle smon process (State.Name.eq=oracle,Args.*.eq=ora_smon_INSTANCE) and enabled the service.log tracking.
service.log_track.include ORA-
service.log_track.files /ora/app/oracle/admin/INSTANCE/bdump/alert_INSTANCE.log
Then we set up an alarm.
If Condition: Event/Log Level(ANY) and matching substring "ORA-"
Enable Action(s): Each time conditions are met.
When we enable this monitor and insert a ORA- error into the .log file, Hyperic HQ sends us a Mail reporting an ORA- error happened and lists the -first- ORA- error that appears in the log. If we insert more errors Hyperic only reports the -total number- of errors that appear in the log, but no specifics, just a number.
- Triggering Condition(s):
If Event/Log Level(ANY) and matching substring "ORA-"
Log: /ora/app/oracle/admin/INSTANCE/bdump/alert_INSTANCE.log: Message 'ORA-' repeated 21 times
So, our question basicly boils down to: Is there a better/right way to do what we want to do?
The current method we're using is this:
We set up a New Platform Service for the Server hosting the DB. Resource Type is Process. We set it up to querry the oracle smon process (State.Name.eq=oracle,Args.*.eq=ora_smon_INSTANCE) and enabled the service.log tracking.
service.log_track.include ORA-
service.log_track.files /ora/app/oracle/admin/INSTANCE/bdump/alert_INSTANCE.log
Then we set up an alarm.
If Condition: Event/Log Level(ANY) and matching substring "ORA-"
Enable Action(s): Each time conditions are met.
When we enable this monitor and insert a ORA- error into the .log file, Hyperic HQ sends us a Mail reporting an ORA- error happened and lists the -first- ORA- error that appears in the log. If we insert more errors Hyperic only reports the -total number- of errors that appear in the log, but no specifics, just a number.
- Triggering Condition(s):
If Event/Log Level(ANY) and matching substring "ORA-"
Log: /ora/app/oracle/admin/INSTANCE/bdump/alert_INSTANCE.log: Message 'ORA-' repeated 21 times
So, our question basicly boils down to: Is there a better/right way to do what we want to do?