-
Type:
Bug
-
Status: Closed
-
Priority:
Critical
-
Resolution: Cancelled
-
Affects Version/s: None
-
Fix Version/s: None
-
Component/s: Platform
-
Labels:None
-
Environment:Production
-
Bug Severity:Medium
-
Module:Platform
-
Reported by:Client
-
Company:All Clients/Multiple Clients
Concern: Production down; Can log in but getting a blank screen or a server error
Date: 24th Oct:
Time: 11:35 AM PST to 04:22 PM PST
Cause: Database 1 and 3 disk drive were not accessible, due to which cluster were not able to move
Correction: Verizon restarted the database and cluster
Root cause ticket open with Verizon, Below is the reply from Verizon Engineer
“Created Microsoft case for this as well, as I see no obvious reason for this cluster behaviour. Also we will go through the logs to get the bottom of this”
Error message – We observed that below error was coming from 09/06/2017 till cluster failover.
Cluster network name resource 'Cluster Name' cannot be brought online. The computer object associated with the resource could not be
updated in domain 'managed.cln' for the following reason:
Unable to update password for computer account.
The text for the associated error code is: Access is denied.
The cluster identity 'DAC30415VIR001$' may lack permissions required to update the object. Please work with your domain administrator to
ensure that the cluster identity can update computer objects in the domain.
Please find below image in which time is matching with exact production fail and up time, this service is the cause of production down.
Cluster resource 'Cluster Disk 1 - Q:\Quorum' in clustered service or application 'Cluster Group' failed.
Our analysis : Both error logs stopped after cluster restart. Which means first is a root cause and second is an impact. When restarted, the server first and second both error stopped logging.