Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Trigger

When the PSC - Purdue System Cloud systems detect a host failure an email will be sent to  the IOC.

These alerts may come from either psc-hosting-alert-bounces@lists.purdue.edu or prtg_admin@pnw.edu

It will resemble this:

Email Ex:

From: psc-hosting-alert-bounces@lists.purdue.edu

To: ioc@groups.purdue.edu

Subject: [psc-hosting-alert] [VMware vCenter - Alarm alarm.HostConnectivityAlarm] Host ntnx-pfw-esx-22.psc.purdue.edu in PSC-PFW is not responding

Target: ntnx-pfw-esx-22.psc.purdue.edu

Stateless event alarm

Alarm Definition:

([Event alarm expression: Cannot connect host - network error] OR [Event alarm expression: Cannot connect host - time-out] OR [Event alarm expression: Host connection lost])

Event details:

Host ntnx-pfw-esx-22.psc.purdue.edu in PSC-PFW is not responding


Section
bordertrue

Additional Email to Watch For 



Action

Immediately contact the PSC Hosting Infrastructure on-call Administrator (765-496-3613) and report the alert. (Nights & Weekends only. Ignore during business hours 8am - 5pm

  • If there is no answer from the on-call Administrator, retry in 10 minutes. 
  • If there is no answer from the on-call Administrator after the second attempt, contact the Manager (Sheila Williams; 407-474-1034)If there is no answer from Manager, call the Director (Keith Duvall; 219-448-8281)
  • Still no contact then reach out to IOC on call for the night or weekend. 
  • Follow all other normal procedures for IOC. Follow up email (  psc-hosting@purdue.edu ) and log etc.