Mirrored disks going offline due to controller issues, which can force a disk group to dismount to prevent data loss. Unbalanced I/O:
Automatic Storage Management (ASM) is a vital component of Oracle databases, providing a robust and efficient way to manage storage resources. The ASM health checker is a built-in tool that monitors the health of the ASM infrastructure, identifying potential issues before they become critical problems. When the ASM health checker reports "1 new failures updated," it indicates that a problem has been detected and needs attention. In this article, we'll explore what this message means, its implications, and step-by-step guidance on how to resolve the issue.
Troubleshooting "ASM Health Checker Found 1 New Failures" in Oracle Grid Infrastructure
when ASM's internal monitoring identifies a change in the status or integrity of a disk group. Incident Summary ASM Health Checker 1 New Failure Found asm health checker found 1 new failures updated
-- Query the location of the alert log SELECT value FROM v$parameter WHERE name = 'background_dump_dest'; Use code with caution.
ASM Health Check Report – New Failures Detected
Do not ignore this alert. Follow this standard triage procedure: Mirrored disks going offline due to controller issues,
Ensure your disk groups use appropriate redundancy (External, Normal, or High). Under normal redundancy, ensure no single failure group contains all copies of your critical data.
If a disk is offline, check the operating system messages (e.g., /var/log/messages on Linux or dmesg ). Look for SCSI errors or timeout messages. If the OS cannot see the LUN, the issue is at the hardware or SAN level, not the Oracle level.
health checker has detected a potential issue, such as disk corruption or a hardware-related failure group problem . In Oracle environments, this message often appears in the When the ASM health checker reports "1 new
More importantly, this message serves as a crucial reminder that proactive maintenance is the cornerstone of a resilient database environment. Regular monitoring, automated health checks, and well-tested backups will ensure that when the health checker does find a failure, it's merely a minor bump in the road rather than a catastrophic event.
The ASM health checker runs periodically, checking for various conditions that could indicate a problem, such as:
Sometimes the health checker flags a failure if a rebalance is stuck. SELECT * FROM v$asm_operation; Use code with caution.