200
Oracle® Clusterware Administration and Deployment Guide 11g Release 1 (11.1) B28255-07 August 2010

Administration and Deployment Guide 11g Release 1 (11.1)mirrors.iyunwei.com/oracle/docs/11.1-E28359-01/rac.111/b28255.pdf · Administration and Deployment Guide 11g Release 1

Embed Size (px)

Citation preview

Oracle® ClusterwareAdministration and Deployment Guide

11g Release 1 (11.1)

B28255-07

August 2010

Oracle Clusterware Administration and Deployment Guide, 11g Release 1 (11.1)

B28255-07

Copyright © 2007, 2010, Oracle and/or its affiliates. All rights reserved.

Primary Author: Richard Strohm

Contributing Authors: Ahmed Abbas, Mark Bauer, Eric Belden, Jonathan Creighton, Andrey Gusev, Winston Huang, Rajiv Jayaraman, Sameer Joshi, Roland Knapp, Erich Kriesler, Raj Kumar, Ivan Lam, Karen Li, Barb Lundhild, Bill Manry, Saar Maoz, Markus Michalewicz, Anil Nair, Philip Newlan, Kevin Reardon, Viv Schupmann, K.P. Singh, Duane Smith, Alok Srivastava, Janet Stern, Su Tang, Juan Tellez, Ramkumar Venkatesan, Douglas Williams

This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited.

The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing.

If this software or related documentation is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable:

U.S. GOVERNMENT RIGHTS Programs, software, databases, and related documentation and technical data delivered to U.S. Government customers are "commercial computer software" or "commercial technical data" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, the use, duplication, disclosure, modification, and adaptation shall be subject to the restrictions and license terms set forth in the applicable Government contract, and, to the extent applicable by the terms of the Government contract, the additional rights set forth in FAR 52.227-19, Commercial Computer Software License (December 2007). Oracle USA, Inc., 500 Oracle Parkway, Redwood City, CA 94065.

This software is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous applications, including applications which may create a risk of personal injury. If you use this software in dangerous applications, then you shall be responsible to take all appropriate fail-safe, backup, redundancy, and other measures to ensure the safe use of this software. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this software in dangerous applications.

Oracle is a registered trademark of Oracle Corporation and/or its affiliates. Other names may be trademarks of their respective owners.

This software and documentation may provide access to or information on content, products, and services from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content, products, or services.

iii

Contents

Preface ................................................................................................................................................................. xi

Audience....................................................................................................................................................... xiDocumentation Accessibility ..................................................................................................................... xiRelated Documents .................................................................................................................................... xiiConventions ................................................................................................................................................ xii

What's New in Oracle Clusterware Administration and Deployment? ...................... xv

Oracle Database 11g Release 1 (11.1) New Features in Oracle Clusterware..................................... xv

1 Introduction to Oracle Clusterware

What is Oracle Clusterware? .................................................................................................................. 1-1Oracle Clusterware Hardware Concepts and Requirements ...................................................... 1-3Oracle Clusterware Software Concepts and Requirements......................................................... 1-4

Overview of Oracle Clusterware Platform-Specific Software Components ................................ 1-5Oracle Clusterware Processes on Linux and UNIX Systems....................................................... 1-5Oracle Clusterware Services on Windows Systems ..................................................................... 1-5Oracle Clusterware Subcomponent Processes and Background Processes............................... 1-6

Overview of Installing Oracle Clusterware ........................................................................................ 1-7Oracle Clusterware Version Compatibility .................................................................................... 1-8About the Oracle Clusterware Installation .................................................................................... 1-9

Overview of Managing Oracle Clusterware Environments ......................................................... 1-10Overview of Extending or Removing Oracle Clusterware in Grid Environments .................. 1-11Overview of the Oracle Clusterware High Availability Framework and Application Programming Interface......................................................................................................................... 1-12

2 Administering Oracle Clusterware

Administering Voting Disks .................................................................................................................. 2-1Backing Up Voting Disks .................................................................................................................. 2-1Recovering Voting Disks................................................................................................................... 2-2Adding, Moving, or Deleting Voting Disks ................................................................................... 2-2

Administering the Oracle Cluster Registry......................................................................................... 2-3Adding, Replacing, Repairing, and Removing the Oracle Cluster Registry ............................. 2-3

Adding an Oracle Cluster Registry ......................................................................................... 2-4Replacing an Oracle Cluster Registry ..................................................................................... 2-5Repairing an Oracle Cluster Registry Configuration on a Local Node .............................. 2-5

iv

Removing an Oracle Cluster Registry...................................................................................... 2-6Overriding the Oracle Cluster Registry Data Loss Protection Mechanism........................ 2-6

Managing Backups and Recovering the Oracle Cluster Registry ............................................... 2-7Overview of Restoring the Oracle Cluster Registry from OCR Backups ........................... 2-8Restoring the Oracle Cluster Registry on Linux or UNIX Systems ..................................... 2-8Restoring the Oracle Cluster Registry on Windows Systems .............................................. 2-9

Diagnosing Oracle Cluster Registry Problems .............................................................................. 2-9Using the OCRDUMP Utility.................................................................................................... 2-9Using the OCRCHECK Utility ............................................................................................... 2-10

Administering the Oracle Cluster Registry with OCR Export and Import Commands ...... 2-10Importing Oracle Cluster Registry Content on Linux or UNIX Systems ....................... 2-10Importing Oracle Cluster Registry Content on Windows Systems ................................. 2-11

Implementing the Oracle HARD Initiative for the Oracle Cluster Registry .......................... 2-11Upgrading and Downgrading the Oracle Cluster Registry Configuration............................ 2-11

Changing Network Addresses............................................................................................................ 2-11Changing Public Network Addresses.......................................................................................... 2-12Changing Private Network Addresses ........................................................................................ 2-14

3 Cloning Oracle Clusterware

Introduction to Cloning Oracle Clusterware ...................................................................................... 3-1Preparing the Oracle Clusterware Home for Cloning....................................................................... 3-3Cloning Oracle Clusterware to Create a New Cluster ...................................................................... 3-5Cloning to Extend Oracle Clusterware to More Nodes in the Same Cluster............................. 3-10Cloning Script Variables Reference .................................................................................................. 3-12Locating and Viewing Log Files Generated During Cloning....................................................... 3-15

4 Adding and Deleting Oracle Clusterware Homes

Prerequisite Steps for Adding Oracle Clusterware ........................................................................... 4-1Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems ........................ 4-3

Adding an Oracle Clusterware Home to a New Node On Linux or UNIX Systems............... 4-3Adding an Oracle Clusterware Home to a New Node Using Oracle Universal Installerin Interactive Mode..................................................................................................................... 4-4Adding an Oracle Clusterware Home to a New Node Using Oracle Universal Installerin Silent Mode.............................................................................................................................. 4-6

Deleting an Oracle Clusterware Home from a Linux or UNIX System..................................... 4-7Deleting an Oracle Clusterware Home Using Oracle Universal Installerin Interactive Mode..................................................................................................................... 4-7Deleting an Oracle Clusterware Home from a Node that Cannot Be Accessed................ 4-9

Adding and Deleting Oracle Clusterware Homes on Windows Systems.................................. 4-10Adding an Oracle Clusterware Home to a Windows System.................................................. 4-10Deleting an Oracle Clusterware Home from a Windows System ........................................... 4-13

Deleting an Oracle Clusterware Home Using Oracle Universal Installerin Interactive Mode.................................................................................................................. 4-13Deleting an Oracle Clusterware Home Using Oracle Universal Installerin Silent Mode........................................................................................................................... 4-14

v

5 Making Applications Highly Available Using Oracle Clusterware

Overview of Using Oracle Clusterware Commands to Enable High Availability ...................... 5-1Overview of Managing Custom Applications with Oracle Clusterware Commands ............... 5-2Creating Application Profiles ................................................................................................................ 5-3

Application Resource Profiles .......................................................................................................... 5-4Required and Optional Profile Attributes ............................................................................... 5-4Application Profile Attributes................................................................................................... 5-4Default Profile Locations ........................................................................................................... 5-7

Using Entry Points to Manage Resources Using Oracle Clusterware........................................ 5-7Example of Using Oracle Clusterware Commands to Create Application Resources................ 5-8

Using the crs_profile Command to Create An Application Resource Profile........................... 5-8Oracle Clusterware Required Resources List.............................................................................. 5-10

Creating VIPs for Applications .............................................................................................. 5-10Application Placement Policies..................................................................................................... 5-11Optional Resources in Placement Decisions ............................................................................... 5-12

Oracle Clusterware Action Program Guidelines ............................................................................ 5-12How Oracle Clusterware Runs Action Programs ..................................................................... 5-13User Defined Attributes ................................................................................................................. 5-13Windows crsuser Program ............................................................................................................ 5-14

Using Oracle Clusterware Commands .............................................................................................. 5-14Registering Application Resources............................................................................................... 5-14Starting Application Resources..................................................................................................... 5-14Relocating Applications and Application Resources................................................................. 5-15Stopping Applications and Application Resources ................................................................... 5-16Managing Automatic Oracle Clusterware Resource Operations for Action Scripts............. 5-16

Preventing Automatic Restarts .............................................................................................. 5-16Automatically Manage Restart Attempts Counter for Resources .................................... 5-17Implications of Restart and Timeout Features for Previous Releases ............................. 5-18

Unregistering Applications and Application Resources........................................................... 5-18Displaying Clusterware Application and Application Resource Status Information .......... 5-18

A Cluster Verification Utility Reference

Using the Cluster Verification Utility ................................................................................................. A-1Cluster Verification Utility Requirements ......................................................................................... A-2Understanding CVU Commands, Help, Output, and Nodelist Shortcuts .................................. A-2

Using CVU Help ............................................................................................................................... A-3Verbose Mode and UNKNOWN Output ...................................................................................... A-3Cluster Verification Utility Nodelist Shortcuts............................................................................. A-3Cluster Verification Utility Configuration File ............................................................................ A-4

Performing Various CVU Tests............................................................................................................. A-5Cluster Verification Utility System Requirements Verifications................................................ A-5Cluster Verification Utility Storage Verifications......................................................................... A-5Cluster Verification Utility Connectivity Verifications ............................................................... A-6Cluster Verification Utility User and Permissions Verifications................................................ A-7Cluster Verification Utility Node Comparisons and Verifications............................................ A-7Cluster Verification Utility Installation Verifications .................................................................. A-8

vi

Cluster Verification Utility Cluster Integrity Verifications......................................................... A-8Cluster Verification Utility Oracle Clusterware Component Verifications.............................. A-8Cluster Verification Utility Argument and Option Definitions ................................................. A-9

Known Issues for the Cluster Verification Utility ............................................................................ A-9Database Versions Supported by Cluster Verification Utility.................................................... A-9Linux Shared Storage Accessibility (ssa) Check Reports Limitations ....................................... A-9Shared Disk Discovery on Red Hat Linux................................................................................... A-10

B OLSNODES Command Reference

C Oracle Interface Configuration (OIFCFG) Command Reference

Starting the OIFCFG Command-Line Interface ................................................................................ C-1Summary of the OIFCFG Usage ........................................................................................................... C-1

OIFCFG Command Format ............................................................................................................. C-2OIFCFG Commands ......................................................................................................................... C-2OIFCFG Command Parameters ...................................................................................................... C-2OIFCFG Usage Notes ....................................................................................................................... C-3OIFCFG Examples............................................................................................................................. C-4

D High Availability Oracle Clusterware Command-Line Reference and C API

Using Oracle Clusterware Commands ................................................................................................ D-1Application Profile Syntax............................................................................................................... D-1Security and Permissions ................................................................................................................. D-1

Oracle Clusterware Commands............................................................................................................ D-2crs_getperm........................................................................................................................................ D-3

Syntax and Options for crs_getperm ...................................................................................... D-3Example of crs_getperm............................................................................................................ D-3

crs_profile ........................................................................................................................................... D-4Syntax and Options for the crs_profile Command ............................................................... D-4Examples of the crs_profile Command................................................................................... D-7

crs_register ......................................................................................................................................... D-7Syntax and Options for the crs_register Command.............................................................. D-8Example of the crs_register Command................................................................................... D-9Related Commands for the crs_register Command.............................................................. D-9

crs_relocate......................................................................................................................................... D-9Syntax and Options for the crs_relocate Command ........................................................... D-10Example of the crs_relocate Command ................................................................................ D-11

crs_setperm ...................................................................................................................................... D-12Syntax and Options for the crs_setperm Command........................................................... D-12Example of the crs_setperm Command................................................................................ D-12

crs_stat .............................................................................................................................................. D-12Syntax and Options for the crs_stat Command................................................................... D-13Examples of the crs_stat Command...................................................................................... D-14

crs_start............................................................................................................................................. D-15Syntax and Options for the crs_start Command ................................................................. D-15Examples of the crs_start Command .................................................................................... D-16

vii

crs_stop ............................................................................................................................................. D-16Syntax and Options for the crs_stop Command ................................................................. D-16Examples of the crs_stop Command..................................................................................... D-17Status Messages for the crs_stop Command........................................................................ D-17

crs_unregister .................................................................................................................................. D-17Syntax and Options for the crs_unregister Command....................................................... D-17Example of the crs_unregister Command............................................................................ D-17

C Application Programming Interface to Oracle Clusterware ..................................................... D-18clscrs_init_crs ................................................................................................................................... D-18

Parameters................................................................................................................................. D-18Returns....................................................................................................................................... D-18Syntax ........................................................................................................................................ D-18

clscrs_term_crs................................................................................................................................. D-18Parameter .................................................................................................................................. D-18Returns....................................................................................................................................... D-18Syntax ........................................................................................................................................ D-18

clscrs_getnodename........................................................................................................................ D-19Parameters ................................................................................................................................ D-19Syntax ........................................................................................................................................ D-19

clscrs_env_create ............................................................................................................................. D-19Parameters................................................................................................................................. D-19Returns....................................................................................................................................... D-19Syntax ........................................................................................................................................ D-19

clscrs_env_set................................................................................................................................... D-19Parameters................................................................................................................................. D-19Returns....................................................................................................................................... D-19Syntax ........................................................................................................................................ D-19

clscrs_env_delete............................................................................................................................. D-20Parameters................................................................................................................................. D-20Returns....................................................................................................................................... D-20Syntax ........................................................................................................................................ D-20

clscrs_env_format............................................................................................................................ D-20Parameters................................................................................................................................. D-20Returns....................................................................................................................................... D-20Syntax ........................................................................................................................................ D-20

clscrs_start_resource ....................................................................................................................... D-20Parameters................................................................................................................................. D-20Returns....................................................................................................................................... D-21Syntax ........................................................................................................................................ D-21

clscrs_stop_resource ....................................................................................................................... D-21Parameters................................................................................................................................. D-21Returns....................................................................................................................................... D-21Syntax ........................................................................................................................................ D-21

clscrs_check_resource..................................................................................................................... D-21Parameters................................................................................................................................. D-21Returns....................................................................................................................................... D-22Syntax ........................................................................................................................................ D-22

viii

clscrs_register_resource.................................................................................................................. D-22Parameters ................................................................................................................................ D-22Returns....................................................................................................................................... D-22Syntax ........................................................................................................................................ D-22

clscrs_unregister_resource............................................................................................................. D-22Parameters................................................................................................................................. D-22Returns....................................................................................................................................... D-23Syntax ........................................................................................................................................ D-23

clscrs_stat.......................................................................................................................................... D-23Parameters................................................................................................................................. D-23Returns....................................................................................................................................... D-23Syntax ........................................................................................................................................ D-23

Functions for Managing Resource Structures.................................................................................. D-24Export Operations ........................................................................................................................... D-24

Stringpair Operations.............................................................................................................. D-24Splist Operations...................................................................................................................... D-25Resource Operations................................................................................................................ D-34Resource List Operations ........................................................................................................ D-35

E Managing the Oracle Cluster Registry

F Troubleshooting Oracle Clusterware

Monitoring Oracle Clusterware............................................................................................................ F-1Dynamic Debugging............................................................................................................................... F-3Component Level Debugging............................................................................................................... F-3

Enabling Debugging for CRS, OCR, CSS, and EVM Modules ................................................... F-3Creating an Initialization File to Contain the Debugging Level ................................................ F-5

Oracle Clusterware Shutdown and Startup ....................................................................................... F-5Enabling and Disabling Oracle Clusterware Daemons................................................................... F-6Determining the Active Versions and Software Versions ............................................................... F-6Diagnostics Collection Script................................................................................................................ F-7Oracle Clusterware Alerts...................................................................................................................... F-7Resource Debugging............................................................................................................................... F-7Checking the Health of the Clusterware ............................................................................................ F-7Clusterware Log Files and the Unified Log Directory Structure ................................................... F-8Troubleshooting the Oracle Cluster Registry .................................................................................... F-9

Using the OCRDUMP Utility to View Oracle Cluster Registry Content.................................. F-9OCRDUMP Utility Syntax and Options ................................................................................ F-9OCRDUMP Utility Examples ................................................................................................ F-10Sample OCRDUMP Utility Output....................................................................................... F-10

Using the OCRCHECK Utility ...................................................................................................... F-11Oracle Cluster Registry Troubleshooting .................................................................................... F-11

Enabling Additional Tracing for Oracle Clusterware High Availability ................................... F-11Generating Additional Trace Information for a Running Resource........................................ F-12Verifying Event Manager Daemon Communications ............................................................... F-12

ix

G Oracle Clusterware API Messages

H Oracle Clusterware Alert Messages

Index

x

xi

Preface

The Oracle Clusterware Administration and Deployment Guide describes the Oracle Clusterware architecture and provides an overview of this product. This book also describes administrative and deployment topics for Oracle Clusterware.

Information in this manual applies to Oracle Clusterware as it runs on all platforms unless otherwise noted. In addition, the content of this manual supplements administrative and deployment topics for Oracle single-instance databases that appear in other Oracle documentation. Where necessary, this manual refers to platform-specific documentation. This Preface contains these topics:

■ Audience

■ Documentation Accessibility

■ Related Documents

■ Conventions

AudienceThe Oracle Clusterware Administration and Deployment Guide is intended for database administrators, network administrators, and system administrators who perform the following tasks:

■ Install and configure Oracle RAC databases

■ Administer and manage Oracle RAC databases

■ Manage and troubleshoot clusters and networks that use Oracle RAC

Documentation AccessibilityOur goal is to make Oracle products, services, and supporting documentation accessible, with good usability, to the disabled community. To that end, our documentation includes features that make information available to users of assistive technology. This documentation is available in HTML format, and contains markup to facilitate access by the disabled community. Accessibility standards will continue to evolve over time, and Oracle is actively engaged with other market-leading technology vendors to address technical obstacles so that our documentation can be accessible to all of our customers. For more information, visit the Oracle Accessibility Program Web site at

http://www.oracle.com/accessibility/

xii

Accessibility of Code Examples in DocumentationScreen readers may not always correctly read the code examples in this document. The conventions for writing code require that closing braces should appear on an otherwise empty line; however, some screen readers may not always read a line of text that consists solely of a bracket or brace.

Accessibility of Links to External Web Sites in DocumentationThis documentation may contain links to Web sites of other companies or organizations that Oracle does not own or control. Oracle neither evaluates nor makes any representations regarding the accessibility of these Web sites.

TTY Access to Oracle Support ServicesOracle provides dedicated Text Telephone (TTY) access to Oracle Support Services within the United States of America 24 hours a day, 7 days a week. For TTY support, call 800.446.2398. Outside the United States, call +1.407.458.2479.

Related DocumentsFor more information, see the Oracle resources listed in this section.

■ Platform-specific Oracle Clusterware and Oracle RAC installation guides

Each platform-specific Oracle Database 11g installation media contains a copy of an Oracle Clusterware and Oracle RAC platform-specific installation and configuration guide in HTML and PDF formats. These installation books contain the preinstallation, installation, and post-installation information for the various UNIX, Linux, and Windows platforms on which Oracle Clusterware and Oracle RAC operate.

■ Oracle Database 2 Day + Real Application Clusters Guide

This task-oriented guide helps you understand the basic steps required to install, configure, and administer Oracle Clusterware and Oracle Real Application Clusters on a two-node system using Red Hat Linux system.

■ Oracle Real Application Clusters Administration and Deployment Guide

This is an essential companion book that describes Oracle Clusterware components such as the voting disks and the Oracle Cluster Registry (OCR).

■ Oracle Database 2 Day DBA

■ Oracle Database Administrator's Guide

■ Oracle Database Net Services Administrator's Guide

■ Oracle Database Platform Guide for Microsoft Windows

■ Oracle Database Administrator's Reference 11g Release 1 (11.1) for UNIX Operating Systems: AIX Systems, HP-UX, Linux, and the Solaris Operating System (SPARC)

Database error messages descriptions are available online or by way of a Tahiti documentation search.

ConventionsThe following text conventions are used in this document:

xiii

Convention Meaning

boldface Boldface type indicates graphical user interface elements associated with an action, or terms defined in text or the glossary.

italic Italic type indicates book titles, emphasis, or placeholder variables for which you supply particular values.

monospace Monospace type indicates commands within a paragraph, URLs, code in examples, text that appears on the screen, or text that you enter.

xiv

xv

What's New in Oracle ClusterwareAdministration and Deployment?

This section describes the new administration and deployment features for Oracle Clusterware in Oracle Database 11g Release 1 (11.1).

Oracle Database 11g Release 1 (11.1) New Features in Oracle Clusterware

This section describes the Oracle Database 11g Release 1 (11.1) features for Oracle Clusterware administration and deployment.

■ Oracle Clusterware Documented Separately

Oracle Clusterware is now documented in its own book. This book, Oracle Clusterware Administration and Deployment Guide, describes how to administer and deploy Oracle Clusterware. This book also contains information about the Oracle Clusterware API and API commands.

■ Oracle cloning procedures for Oracle Clusterware

This book includes new step-by-step procedures for cloning Oracle Clusterware homes to quickly create Oracle Clusterware environments on other nodes.

■ Optimal Flexible Architecture (OFA) ensures reliable installations and improves software manageability

This feature improves manageability by making default Oracle Database installations more compliant with Optimal Flexible Architecture (OFA) specifications. As a part of this feature, the Oracle Universal Installer has been updated so that the default installation follows Oracle's Optimal Flexible Architecture. This ensures reliable installations and improves software manageability.

In addition, Oracle Clusterware can be installed only once on the system, so a counter to track the number of times that Oracle Clusterware is installed is not required as it is for Oracle database homes and Oracle Database installations with Oracle RAC.

See Also: Your platform-specific Oracle Clusterware installation guide for more information about installing Oracle Clusterware

See Also: Chapter 3, "Cloning Oracle Clusterware"

xvi

■ New ocrconfig command options

The ocrconfig command now includes a new -manualbackup option that enables you to force a manual backup at any time, rather than wait for the automatic backup that occurs at 4 (or more) hour intervals. The new -manualbackup option provides a method for obtaining a binary backup on demand, such as after you make changes to the Oracle Cluster Registry (OCR).

In addition, the existing -showbackup option includes new auto and manual flags that you can optionally specify to display only the automatic backup information or manual backup information, respectively.

■ Add voting disks dynamically, with no downtime

You can now add a voting disk while the cluster is active. Cluster Synchronization Services (CSS) start to use the voting disk without the need to restart the cluster.

■ Improved Oracle Clusterware performance monitoring and diagnostics in Oracle Enterprise Manager

Both Oracle Enterprise Manager Database Control and Oracle Enterprise Manager Grid Control are cluster aware and provide a central console to manage your cluster database. From any location where you can access a web browser, you can manage Oracle Clusterware, application servers, host computers, and Web applications, as well as related hardware and software.

You now have the ability to see any given metric across database instances or hosts in the cluster as a tile chart. This high-level view capability means that you can review issues that are affecting the entire cluster as well as those that are affecting individual instances. This is a major enhancement in terms of how metrics are monitored for Oracle Clusterware. With Oracle Database 11g, you can see the roll-up or summary-based views as well as tile based views if you want to monitor how a metric performs across different hosts over a period of time.

See Also: "About the Oracle Clusterware Installation" and your platform-specific Oracle Clusterware and Oracle Real Application Clusters installation and configuration guide

See Also: "Managing Backups and Recovering the Oracle Cluster Registry" on page 2-7 and Appendix E, "Managing the Oracle Cluster Registry"

See Also: Monitoring Oracle Clusterware on page F-1 and Oracle Database 2 Day + Real Application Clusters Guide

Introduction to Oracle Clusterware 1-1

1Introduction to Oracle Clusterware

This chapter introduces Oracle Clusterware and describes how to install, administer, and deploy it. This chapter includes the following topics:

■ What is Oracle Clusterware?

■ Overview of Oracle Clusterware Platform-Specific Software Components

■ Overview of Installing Oracle Clusterware

■ Overview of Extending or Removing Oracle Clusterware in Grid Environments

■ Overview of Managing Oracle Clusterware Environments

■ Overview of the Oracle Clusterware High Availability Framework and Application Programming Interface

What is Oracle Clusterware?Oracle Clusterware is software that enables servers to operate together as if they are one server. Each server looks like any standalone server. However, each server has additional processes that communicate with each other so the separate servers appear as if they are one server to applications and end users.

Figure 1–1 shows a configuration that uses Oracle Clusterware to extend the basic single-instance Oracle Database architecture. In the figure, both Cluster 1 and Cluster 2 are connected to Oracle Database and are actively servicing applications and users. Using Oracle Clusterware, you can use the same high availability mechanisms to make your Oracle database and your custom applications highly available.

What is Oracle Clusterware?

1-2 Oracle Clusterware Administration and Deployment Guide

Figure 1–1 Oracle Clusterware Configuration

The benefits of using a cluster include:

■ Scalability for applications

■ Using lower-cost hardware

■ Ability to fail over

■ Ability to grow the capacity over time by adding servers, when needed

You can program Oracle Clusterware to manage the availability of user applications and Oracle databases.

In an Oracle Real Application Clusters (Oracle RAC) environment, Oracle Clusterware manages all of the Oracle Database processes automatically. Anything that Oracle Clusterware manages is known as a cluster resource, which could be a database, an instance, a service, a listener, a virtual IP (VIP) address, an application process, and so on.

Creating a cluster with Oracle Clusterware provides the ability to:

■ Eliminate unplanned downtime due to a hardware or software malfunction

■ Reduce or eliminate planned downtime for software maintenance

■ Increase throughput for applications by allowing the applications to run on all of the nodes in the cluster

Heartbeat hb

Oracle Net Services Client Access

Application / Web Servers

Pool of Storage

hbhb

Node 3Node 1

Cluster 2Cluster 1

Node 2

What is Oracle Clusterware?

Introduction to Oracle Clusterware 1-3

■ Increase throughput for the application, as required, by adding servers to the cluster, when necessary

■ Reduce the total cost of ownership for infrastructure by providing a scalable system with low cost commodity hardware

Oracle Clusterware is a requirement for using Oracle RAC and it is the only clusterware that you need for most platforms on which Oracle RAC operates. Although Oracle RAC continues to support select third-party clusterware products on specific platforms, you must also install and use Oracle Clusterware. Note that the servers on which you want to install and run Oracle Clusterware must be running the same operating system.

Using Oracle Clusterware eliminates the need for proprietary vendor clusterware and provides the benefit of using only Oracle software. Oracle provides an entire software solution, including everything from disk management with Oracle Automatic Storage Management (ASM) to data management with Oracle Database and Oracle RAC. In addition, Oracle Database features, such as Oracle Services, provide advanced features when used with the underlying Oracle Clusterware high availability framework.

Oracle Clusterware requires two components: a voting disk to record node membership information and the Oracle Cluster Registry (OCR) to record cluster configuration information. The voting disk and the OCR must reside on shared storage.

To use and install Oracle Clusterware, you need to understand the hardware and software concepts and requirements, as described in the following sections:

■ Oracle Clusterware Hardware Concepts and Requirements

■ Oracle Clusterware Software Concepts and Requirements

Oracle Clusterware Hardware Concepts and RequirementsMany hardware providers have validated cluster configurations that provide a single part number for a cluster. If you are new to clustering, the information in this section will make the hardware procurement easier when you work with hardware vendors to purchase the appropriate hardware to create a cluster.

A cluster is made up of one or more servers. A server in a cluster is similar to any standalone server, but a cluster requires a second network called the interconnect network. Therefore, the server minimally requires two network interface cards: one for the public network and one for the private network. The interconnect network is a private network using a switch (or multiple switches) that only the nodes in the cluster can access.1 Crossover cables are not supported for use with Oracle Clusterware interconnects.

The size of the server is dictated by the requirements of the workload you want to run on the cluster and the number of nodes you have chosen to configure in the cluster. If you are implementing the cluster for high availability, then configure redundancy for all components of the infrastructure. Therefore, you need to configure:

■ A network interface for the public network (generally this is an internal LAN)

■ A redundant network interface for the public network

■ A network interface for the private interconnect network

■ A redundant network interface for the private interconnect network

1 Oracle Clusterware supports up to 100 nodes in a cluster on configurations running Oracle Database 10g Release 2 (10.2) and later releases.

What is Oracle Clusterware?

1-4 Oracle Clusterware Administration and Deployment Guide

The cluster requires cluster-aware storage2 that is connected to each server in the cluster. This may also be referred to as a multihost device. Oracle Database supports both Storage Area Network (SAN) storage or Network Attached (NAS) storage.

Similar to the network, there are generally at least two connections from each server to the cluster-aware storage to provide redundancy. There may be more connections depending on your I/O requirements. It is important to consider the I/O requirements of the entire cluster when choosing the storage subsystem.

Most servers have at least one local disk that is internal to the server. Often, this disk is used for the operating system binaries and you can also use this disk for the Oracle binaries. The benefit of each server having its own copy of the binaries is that it simplifies rolling upgrades.

Oracle Clusterware Software Concepts and RequirementsOracle Clusterware uses a shared common disk for its configuration files.

Oracle Clusterware requires two configuration files: a voting disk to record node membership information and the OCR to record cluster configuration information. During the Oracle Clusterware installation, Oracle recommends that you configure multiple voting disks and the OCR:

■ Voting Disk

Oracle Clusterware uses the voting disk to determine which instances are members of a cluster. The voting disk must reside on a shared disk. For high availability, Oracle recommends that you have a minimum of three voting disks. If you configure a single voting disk, then you should use external mirroring to provide redundancy. You can have up to 32 voting disks in your cluster.

■ Oracle Cluster Registry (OCR)

Oracle Clusterware uses the OCR to store and manage information about the components that Oracle Clusterware controls, such as Oracle RAC databases, listeners, VIPs, and services and any applications. The OCR repository stores configuration information in a series of key-value pairs in a directory tree structure.

Oracle recommends that you use a multiplexed OCR to ensure cluster high availability. Consider the following points regarding the OCR:

– The OCR must reside on a shared disk that is accessible by all of the nodes in the cluster.

– Oracle Clusterware can multiplex the OCR.

– You can replace a failed OCR online.

– You must update the OCR through supported APIs such as Oracle Enterprise Manager, the Server Control Utility (SRVCTL), or the Database Configuration Assistant (DBCA).

– Oracle Clusterware requires that each node be connected to a private network by way of a private interconnect. For redundancy, you can have up to 32 voting disks and a mirror of the OCR.

2 Cluster-aware storage may also be referred to as a multihost device.

See Also: Chapter 2, "Administering Oracle Clusterware" for more information about voting disks and the OCR

Overview of Oracle Clusterware Platform-Specific Software Components

Introduction to Oracle Clusterware 1-5

Oracle Clusterware requires a virtual IP address for each node in the cluster. This IP address must be on the same subnet as the public IP address for the node and should be an address that is assigned a name in the Domain Name Service, but is unused and cannot be pinged in the network before installation of Oracle Clusterware. The VIP is a node application (nodeapp) defined in the OCR that is managed by Oracle Clusterware. The VIP is configured with the VIPCA utility. The root script calls the VIPCA utility in silent mode.

Each server must first have an operating system that is certified with the Oracle Clusterware version you are installing. See the certification matrices available on OracleMetaLink (http://certify.oraclecorp.com/certifyv3/certify/cert_views.group_selection?p_html_source=0) for details. Once the operating system is installed and working, you can then install Oracle Clusterware to create the cluster.

Oracle Clusterware is installed independently of Oracle Database. Once Oracle Clusterware is installed, you can install ASM, Oracle Database, or Oracle RAC on any of the nodes in the cluster.

Overview of Oracle Clusterware Platform-Specific Software ComponentsWhen Oracle Clusterware operates, several platform-specific processes or services also run on each node in the cluster. The UNIX, Linux, and Windows processes are described in the following sections:

■ Oracle Clusterware Processes on Linux and UNIX Systems

■ Oracle Clusterware Services on Windows Systems

Oracle Clusterware Processes on Linux and UNIX SystemsOracle Clusterware processes on Linux and UNIX systems include the following:

■ crsd—Performs high availability recovery and management operations such as maintaining the OCR and managing application resources. This process runs as LocalSystem. This process restarts automatically upon failure.

■ evmd—Event manager daemon. This process also starts the racgevt process to manage FAN server callouts.

■ ocssd—Manages cluster node membership and runs as the oracle user; failure of this process results in a node restart.

■ oprocd—Process monitor for the cluster. Note that this process only appears on platforms that do not use third-party vendor clusterware with Oracle Clusterware.

Oracle Clusterware Services on Windows Systems Oracle Clusterware services on Windows systems include the following:

See Also: Your platform-specific Oracle database installation documentation

Note: Oracle Clusterware on Linux platforms can have multiple threads that appear as separate processes with separate process identifiers.

Overview of Oracle Clusterware Platform-Specific Software Components

1-6 Oracle Clusterware Administration and Deployment Guide

■ OracleCRService—Performs high availability recovery and management operations such as maintaining the OCR and managing application resources. This process as the LocalSystem user on Windows. This process restarts automatically upon failure.

■ OracleCSService—Manages cluster node membership and runs as the oracle user who installed Oracle Clusterware; failure of this process results in node restart.

■ OracleEVMService—Event manager service. This process also starts the racgevt process to manage FAN server callouts.

■ OraFenceService—Process monitor for the cluster.

■ Oracle Process Manager Daemon (OPMD)—OPMD is registered with the Windows Service Control Manager (WSCM) and the startup of all Oracle Clusterware services is dependent on OPMD. On system startup, and after the default time period of 60 seconds has elapsed, OPMD automatically starts all of the registered Oracle Clusterware services. This startup delay enables other services to start that are outside of the scope of Oracle control, such as storage access, anti-virus, or firewall services. You can set OPMD to start manually. However, this will delay the startup of the rest of the affected Oracle Clusterware components.

Oracle Clusterware Subcomponent Processes and Background ProcessesOracle Clusterware comprises several processes that facilitate cluster operations. The Cluster Ready Services (CRS), Cluster Synchronization Service (CSS), Event Management (EVM), and Oracle Clusterware components communicate with other cluster component layers in the other instances in the same cluster database environment. These components are also the main communication links between Oracle Database, applications, and the Oracle Clusterware high availability components. In addition, these background processes monitor and manage database operations.

The following list describes some of the major Oracle Clusterware background processes. The list includes components that are processes on Linux and UNIX operating systems, or services on Windows.

■ Cluster Ready Services (CRS)—The primary program for managing high availability operations in a cluster. Anything that the CRS process manages is known as a cluster resource, which could be a database, an instance, a service, a listener, a virtual IP (VIP) address, an application process, and so on. The CRS process manages cluster resources based on the resource’s configuration information that is stored in the OCR. This includes start, stop, monitor and failover operations. The CRS process generates events when a resource status changes. When you have installed Oracle RAC, the CRS process monitors the Oracle database instance, listener, and so on, and automatically restarts these components when a failure occurs. By default, the CRS process makes three attempts to start the Oracle Notification Service (ONS), one attempt to start an Oracle database, and five attempts to restart other database components. The CRS process does not attempt to restart the VIP. After these initial attempts, the CRS process does not make further restart attempts if the resource does not restart.

See Also: Chapter 5, "Making Applications Highly Available Using Oracle Clusterware" for more detailed information about the Oracle Clusterware API

Overview of Installing Oracle Clusterware

Introduction to Oracle Clusterware 1-7

■ Cluster Synchronization Services (CSS)—Manages the cluster configuration by controlling which nodes are members of the cluster and by notifying members when a node joins or leaves the cluster. If you are using certified third-party clusterware, then the css process interfaces with your clusterware to manage node membership information.

■ Event Management (EVM)—A background process that publishes events that Oracle Clusterware creates.

■ Oracle Notification Service (ONS)—A publish and subscribe service for communicating Fast Application Notification (FAN) events.

■ Oracle Process Monitor Daemon (OPROCD)—This process (the OraFenceService service in Windows) is locked in memory to monitor the cluster and provide I/O fencing. The OPROCD periodically wakes up and checks that the interval since it last awoke is within the expected time. If not, then OPROCD resets the processor and restarts the node. An OPROCD failure results in Oracle Clusterware restarting the node.

■ RACG—Extends clusterware to support Oracle-specific requirements and complex resources. Runs server callout scripts when FAN events occur.

In Table 1–1, if a UNIX or a Linux system process has an (r) beside it, then the process runs as the root user. If a Windows system service has an (A) beside it, then the service runs as the Administrative user. Otherwise the process or service runs as the oracle user.

Overview of Installing Oracle ClusterwareInstall Oracle Clusterware with the Oracle Universal Installer.

The following sections introduce the installation processes for Oracle Clusterware:

Table 1–1 List of Processes and Services Associated with Oracle Clusterware

Oracle Clusterware Component

Linux/UNIX Process Windows Services

Windows Processes

Oracle Process Monitor Daemon

oprocd (r) OraFenceService

RACG racgmain, racgimon

racgmain.exe racgimon.exe

Oracle Notification Service (ONS)

ons ons.exe

Event Manager evmd (r), evmd.bin, evmlogger

OracleEVMService evmlogger.exe, evmd.exe

Cluster Ready Services (CRS)

crsd.bin (r) OracleCRService (A)

OracleCRSToken_user

crsd.exe

Cluster Synchronization Services (CSS)

init.cssd (r), ocssd (r), ocssd.bin

OracleCSService ocssd.exe

See Also: "Clusterware Log Files and the Unified Log Directory Structure" on page F-8 for information about the location of log files created for processes

Overview of Installing Oracle Clusterware

1-8 Oracle Clusterware Administration and Deployment Guide

■ Oracle Clusterware Version Compatibility

■ About the Oracle Clusterware Installation

Oracle Clusterware Version CompatibilityYou can install different releases of Oracle Clusterware, ASM, and Oracle Database software on your cluster. Follow these guidelines when installing different releases of software on your cluster:

■ There can be only be one installation of Oracle Clusterware running in the cluster, and it must be installed into its own home (CRS_home). The release of Oracle Clusterware you use must be equal to, or higher than the ASM and Oracle RAC versions running in the cluster; you cannot install a version of Oracle RAC that was released after the version of Oracle Clusterware that you are running on the cluster. That is:

– Oracle Clusterware Release 11.1 supports ASM Release 11.1, 10.2, and 10.1.

– Oracle Clusterware Release 11.1 supports Oracle Database 11g Release 1 (11.1), Oracle Database 10g Release 2 (10.2), and Release 1 (10.1).

– ASM Release 11.1 requires Oracle Clusterware Release 11.1 and supports Oracle Database 11g Release 1 (11.1), Oracle Database 10g Release 2 (10.2), and Release 1 (10.1).

– Oracle Database 11g Release 1 (11.1) requires Oracle Clusterware Release 11.1 and (if you are using ASM storage) you can run different releases of Oracle Database and ASM.

For example:

* If you have Oracle Clusterware Release 11.1 installed as your clusterware, then you can have an Oracle Database 10g Release 1 (10.1) single-instance database running on one node, and separate Oracle Real Application Clusters 10g Release 1, Release 2, and Oracle Real Application Clusters 11g Release 1 databases also running on the cluster. However, you cannot have Oracle Clusterware 10g Release 2 installed on your cluster, and install Oracle Real Application Clusters 11g. You can install Oracle Database 11g (single-instance) on a node in an Oracle Clusterware 10g Release 2 cluster.

* When using different release ASM and Oracle Database releases, the functionality of each is dependent on the functionality of the earlier software release. Thus, if you install Oracle Clusterware 11g and you later install ASM, and you use it to support an existing Oracle Database 10g Release 10.2.0.3 installation, then ASM functionality is equivalent only to that available in the 10.2 release version.

■ There can be multiple Oracle homes for the Oracle database (both single instance and Oracle RAC) in the cluster. Note that the Oracle RAC databases must be running Oracle Database 10g Release 1 (10.1) or higher.

■ You can use different users for the Oracle Clusterware and Oracle database homes as long as they belong to the same primary group.

■ There can only be one installation of ASM running in the cluster. It is recommended that ASM is running the same (or higher) release than that of the Oracle database.

■ For Oracle RAC running Oracle9i you must run an Oracle9i cluster. For UNIX systems, that is HACMP, Serviceguard, Sun Cluster, or Veritas SF. For Windows

Overview of Installing Oracle Clusterware

Introduction to Oracle Clusterware 1-9

and Linux systems, that is the Oracle Cluster Manager. If you want to install Oracle RAC 10g, then you must also install Oracle Clusterware.

■ You cannot install Oracle9i RAC on an Oracle Database 10g cluster. If you have an Oracle9i RAC cluster, you can add Oracle RAC 10g and they will work together. However, once you have installed Oracle Clusterware 10g, you can no longer install any new Oracle9i RAC.

■ Oracle recommends that you do not run different cluster software on the same servers unless they are certified to work together. However, if you are adding Oracle RAC to servers that are part of a cluster, either migrate to Oracle Clusterware or ensure that:

– The clusterware you are running is supported to run with Oracle RAC Release 10g.

– You have installed the correct options for Oracle Clusterware and the other-vendor clusterware to work together.

About the Oracle Clusterware Installation This section discusses Oracle Clusterware installations at a high level. For detailed installation instructions, see your platform-specific Oracle Clusterware installation guide.

Oracle Clusterware is distributed on the Oracle database installation media. The Oracle Universal Installer installs Oracle Clusterware into a directory structure referred to as CRS home. This home is separate from the home directories for other Oracle products installed on the same server. Oracle Universal Installer creates the Oracle Clusterware home directory for you. Before you start the installation, you must have sufficient disk space on a file system for the Oracle Clusterware directory. As a part of the installation and configuration, the CRS home and all of its parent directories are changed to be owned by the root user.

Because Oracle Clusterware works closely with the operating system, system administrator access is required for some of the installation tasks. In addition, some of the Oracle Clusterware processes must run as the system administrator, which is generally the root user on Linux and UNIX systems and the LocalSystem account on Windows systems.

Before you install Oracle Clusterware, Oracle recommends that you run the Cluster Verification Utility (CVU) to ensure that your environment meets the Oracle Clusterware installation requirements. Oracle Universal Installer also automatically runs CVU at the end of the clusterware installation to verify various clusterware components. The CVU simplifies the installation, configuration, and overall management of the Oracle Clusterware installation process by identifying problems in cluster environments.

During the Oracle Clusterware installation, you must identify three IP addresses for each node that is going to be part of your installation. One IP address is for the private interconnect, one is for the public interconnect, and the third IP address is the virtual IP address that clients will use to connect to each instance.

The Oracle Clusterware installation process creates the voting disk and OCR on shared storage. When you use normal redundancy, Oracle Clusterware maintains two copies of the OCR file and three copies of the voting disk file. This prevents the files from

See Also: Your platform-specific Oracle Clusterware installation guide for more version compatibility information

Overview of Managing Oracle Clusterware Environments

1-10 Oracle Clusterware Administration and Deployment Guide

becoming single points of failure. Normal redundancy also eliminates the need for third-party storage redundancy solutions.

Overview of Managing Oracle Clusterware EnvironmentsThe following list describes the tools and utilities available to manage your Oracle Clusterware environment:

■ Oracle Enterprise Manager—Oracle Enterprise Manager has both the Database Control and Grid Control GUI interfaces for managing both single instance and Oracle RAC database environments. Oracle recommends using Oracle Enterprise Manager to perform administrative tasks.

■ Cluster Verification Utility (CVU)—CVU is a command-line tool that you can use to verify a range of cluster and Oracle RAC specific components such as shared storage devices, networking configurations, system requirements, and Oracle Clusterware, as well as operating system groups and users.

Install and use CVU before you install Oracle Clusterware to ensure your configuration meets the minimum installation requirements. Also, use CVU to verify your configuration after completing administrative tasks, such as node additions and node deletions. You can use CVU for preinstallation checks as well as for post-installation checks of your cluster environment. CVU is especially useful during preinstallation and during installation of Oracle Clusterware and Oracle RAC components.

■ Server Control (SRVCTL)—SRVCTL is a command-line interface that you can use to manage Oracle Clusterware, such as changing the VIP interface or nodeapps, from a single system.

■ Cluster Ready Services Control (CRSCTL)—CRSCTL is a command-line tool that you can use to manually control Oracle Clusterware. You use crsctl commands to start and stop Oracle Clusterware. The crsctl command has many options that help you perform a number tasks, such as enabling online debugging and dynamically adding and removing voting disks.

Note: If you choose external redundancy for the OCR and voting disk, then to enable redundancy, the disk subsystem must be configurable for RAID mirroring. Otherwise, your system may be vulnerable because the OCR and voting disk are single points of failure.

See Also: Oracle Database 2 Day + Real Application Clusters Guide and Oracle Real Application Clusters Administration and Deployment Guide for more information about administering Oracle Clusterware with Oracle Enterprise Manager

See Also: Your platform-specific Oracle Clusterware and Oracle RAC installation guide for information about how to manually install CVU, and Appendix A, "Cluster Verification Utility Reference" for more information about using CVU

See Also: Server Control Utility reference appendix in the Oracle Real Application Clusters Administration and Deployment Guide

Overview of Extending or Removing Oracle Clusterware in Grid Environments

Introduction to Oracle Clusterware 1-11

■ Oracle Interface Configuration Tool (OIFCFG)—OIFCFG is a command-line tool for both single-instance Oracle databases and Oracle RAC environments that you can use to allocate and deallocate network interfaces to components. You can also use OIFCFG to direct components to use specific network interfaces and to retrieve component configuration information.

■ OCR Configuration Tool (OCRCONFIG)—OCRCONFIG is a command-line tool for OCR administration. You can also use the OCRCHECK and OCRDUMP utilities to troubleshoot configuration problems that affect the OCR.

Overview of Extending or Removing Oracle Clusterware in Grid Environments

You can extend Oracle Clusterware in grid environments that have large numbers of nodes using cloned images of Oracle Clusterware homes. Oracle cloning is the preferred method of creating many new clusters by copying images of Oracle Clusterware software to other nodes that have similar hardware and software. Cloning is best suited to scenarios where you need to quickly create several clusters of the same configuration.

Oracle provides the following methods of extending Oracle Clusterware environments:

■ Oracle cloning procedure

■ Oracle Enterprise Manager cloning

■ The addNode.sh script

For new installations or if you have to install only one cluster, then you should use the traditional automated and interactive installation methods, such as Oracle Universal Installer or the Provisioning Pack feature of Oracle Enterprise Manager. If your goal is to add or delete Oracle Clusterware from nodes in the cluster, you can use the addNode.sh and rootdelete.sh scripts.

The cloning process assumes you successfully installed an Oracle Clusterware home on at least one node using the instructions in your platform-specific Oracle Clusterware installation guide. In addition, ensure that all root scripts run successfully on the node from which you are extending your cluster.

See Also: Chapter 2, "Administering Oracle Clusterware" for more information about the crsctl commands

See Also: Appendix C, "Oracle Interface Configuration (OIFCFG) Command Reference"

See Also: Chapter 2, "Administering Oracle Clusterware" for more information about managing the OCR

See Also:

■ Chapter 3, "Cloning Oracle Clusterware" for step-by-step cloning procedures

■ Oracle Enterprise Manager online Help system for more information about the Provisioning Pack

■ Chapter 4, "Adding and Deleting Oracle Clusterware Homes"

Overview of the Oracle Clusterware High Availability Framework and Application Programming Interface

1-12 Oracle Clusterware Administration and Deployment Guide

Overview of the Oracle Clusterware High Availability Framework and Application Programming Interface

Oracle Clusterware provides a high availability application programming interface (API) that you use to enable Oracle Clusterware to manage applications or processes that run a cluster. The API enables you to provide high availability for all of your applications. Oracle Clusterware with ASM enables you to create a consolidated pool of storage to support both single-instance Oracle databases and the Oracle RAC databases that are running.

You can define a virtual IP address for an application so users can access the application independently of the node in the cluster where the application is running. This is referred to as the application VIP. You can define multiple application VIPs, with generally one application VIP defined for each application running. The application VIP is tied to the application by making it dependent on the application resource defined by Cluster Ready Services (CRS).

To maintain high availability, Oracle Clusterware components can respond to status changes to restart applications and processes according to defined high availability rules. You can use the Oracle Clusterware high availability framework by registering your applications with Oracle Clusterware and configuring the clusterware to start, stop, or relocate your application processes. That is, you can make custom applications highly available by using Oracle Clusterware to create profiles that monitor, relocate, and restart your applications.

For Oracle RAC to respond consistently and quickly to a failure, the virtual IP address removes network timeouts from the recovery process. When a node fails, its virtual IP relocates to another node in the cluster.

See Also: Chapter 5, "Making Applications Highly Available Using Oracle Clusterware" for more detailed information about the Oracle Clusterware API

Administering Oracle Clusterware 2-1

2Administering Oracle Clusterware

This chapter describes how to administer the voting disks and the Oracle Cluster Registry (OCR) under the following topics:

■ Administering Voting Disks

■ Administering the Oracle Cluster Registry

■ Changing Network Addresses

Administering Voting DisksOracle Clusterware includes two important components: the voting disk and OCR.

■ The voting disk is a file that manages information about node membership.

■ OCR is a file that manages cluster and Oracle Real Application Clusters (Oracle RAC) database configuration information.

Oracle recommends that you select the option to configure multiple voting disks during Oracle Clusterware installation to improve availability. If necessary, you can dynamically add voting disks after you complete the Oracle Clusterware installation process.

After installation, use the following procedures to regularly backup your voting disks and to recover them as needed:

■ Backing Up Voting Disks

■ Recovering Voting Disks

■ Adding, Moving, or Deleting Voting Disks

Backing Up Voting DisksOracle recommends that you back up your voting disk after the initial cluster creation and after you complete any node addition or deletion procedures.

Determine the current voting disk by issuing the following command:

crsctl query css votedisk

Issue the dd or ocopy command to back up a voting disk, as appropriate. In the following examples, voting_disk_name is the name of the active voting disk and backup_file_name is the name of the file to which you want to back up the voting disk contents:

■ On Linux or UNIX systems:

dd if=voting_disk_name of=backup_file_name

Administering Voting Disks

2-2 Oracle Clusterware Administration and Deployment Guide

Oracle recommends you use the dd command to backup the voting disk with a minimum block size of 4KB.

■ On Windows systems, use the ocopy command:

ocopy voting_disk_name backup_file_name

To display online documentation for OCOPY, enter OCOPY by itself at the Windows prompt.

Recovering Voting DisksTo restore the backup of your voting disk, issue the dd or ocopy command for Linux and UNIX systems or for Windows systems respectively. In the following examples, backup_file_name is the name of the voting disk backup file and voting_disk_name is the name of the active voting disk:

■ On Linux or UNIX systems:

dd if=backup_file_name of=voting_disk_name

Oracle recommends that you use the dd command to backup the voting disk with a minimum block size of 4KB.

■ On Windows systems, use the ocopy command:

ocopy backup_file_name voting_disk_name

If you have multiple voting disks, then you can remove the voting disks and add them back into your environment using the following commands, where path is the complete path of the location where the voting disk resides:

crsctl delete css votedisk pathcrsctl add css votedisk path

Adding, Moving, or Deleting Voting DisksYou can add, move, and remove voting disks after Oracle Clusterware has been installed using the following commands:

■ To retrieve the current voting disk, issue the following command:

crsctl query css votedisk

This command lists the voting disks used by Cluster Synchronization Services (CSS).

■ To add a voting disk, issue the following command as the root user, replacing the path variable with the fully qualified path name for the voting disk you want to add:

crsctl add css votedisk path -force

■ To move a voting disk, issue the following commands as the root user, replacing the path variable with the fully qualified path name for the voting disk you want to move:

crsctl delete css votedisk path -forcecrsctl add css votedisk path -force

Administering the Oracle Cluster Registry

Administering Oracle Clusterware 2-3

■ To remove a voting disk, issue the following command as the root user, replacing the path variable with the fully qualified path name for the voting disk you want to remove:

crsctl delete css votedisk path -force

After modifying the voting disk, restart Oracle Clusterware using the crsctl start crs command on all nodes, and verify the voting disk location using the following command:

crsctl query css votedisk

Administering the Oracle Cluster RegistryThis section describes how to administer the OCR using the OCR tools: OCRCONFIG, OCRDUMP, and OCRCHECK.

The OCR contains information about the cluster node list, instance-to-node mapping information, and information about Oracle Clusterware resource profiles for applications that you have customized as described in Chapter 5, "Making Applications Highly Available Using Oracle Clusterware".

This section describes how to administer the OCR in the following topics:

■ Adding, Replacing, Repairing, and Removing the Oracle Cluster Registry

■ Managing Backups and Recovering the Oracle Cluster Registry

■ Diagnosing Oracle Cluster Registry Problems

■ Overriding the Oracle Cluster Registry Data Loss Protection Mechanism

■ Administering the Oracle Cluster Registry with OCR Export and Import Commands

■ Implementing the Oracle HARD Initiative for the Oracle Cluster Registry

■ Upgrading and Downgrading the Oracle Cluster Registry Configuration

Adding, Replacing, Repairing, and Removing the Oracle Cluster RegistryThe Oracle installation process for Oracle Clusterware gives you the option of automatically mirroring the OCR. You can put the mirrored OCR on an Oracle cluster file system disk, on a shared raw device, or on a shared raw logical volume. In addition, Oracle supports block devices for the OCR on Linux.

You can manually mirror the OCR, as described in the "Adding an Oracle Cluster Registry" section, if you:

■ Upgraded to release 11.1 but did not choose to mirror the OCR during the upgrade

Caution: If your cluster is down, then you can include the -force option to modify the voting disk configuration, without interacting with active Oracle Clusterware daemons. However, using the -force option while any cluster node is active may corrupt your configuration.

See Also: Appendix E, "Managing the Oracle Cluster Registry" for information about the OCRCONFIG tool, and Appendix F, "Troubleshooting Oracle Clusterware" for information about the OCRDUMP and OCRCHECK utilities

Administering the Oracle Cluster Registry

2-4 Oracle Clusterware Administration and Deployment Guide

■ Created only one OCR during the Oracle Clusterware installation

In addition to mirroring the OCR, you can also:

■ Replace the OCR if Oracle Database displays an OCR failure alert in Oracle Enterprise Manager or in the Oracle Clusterware alert log file, as described in the "Replacing an Oracle Cluster Registry" section.

■ Repair an OCR location if there is a misconfiguration or other type of OCR error, as described in the "Repairing an Oracle Cluster Registry Configuration on a Local Node" section.

■ Remove an OCR location if, for example, your system experiences a performance degradation due to OCR processing or if you transfer your OCR to RAID storage devices and chose to no longer use multiple OCRs. This is described in the "Removing an Oracle Cluster Registry" section.

Adding an Oracle Cluster Registry You can add an OCR location after an upgrade or after completing the Oracle Clusterware installation. If you already mirror the OCR, then you do not need to add an OCR location; Oracle Database automatically manages two OCRs when it mirrors the OCR. Oracle Clusterware environments do not support more than two OCRs: a primary OCR and a secondary OCR.

As the root user, issue the following command to add an OCR location using either destination_file or disk to designate the target location of the additional OCR:

ocrconfig -replace ocr destination_file or disk

Run the following command to add an OCR mirror location using either destination_file or disk to designate the target location of the additional OCR:

ocrconfig -replace ocrmirror destination_file or disk

Note: Oracle strongly recommends that you use mirrored OCRs if the underlying storage is not RAID. Mirroring can help to prevent the OCR from becoming a single point of failure.

Note: The operations in this section affect the OCR clusterwide: they change the OCR configuration information in the ocr.loc file on Linux and UNIX systems and the Registry keys on Windows systems. However, the ocrconfig command cannot modify OCR configuration information for nodes that are shut down or for nodes on which Oracle Clusterware is not running.

Note: If the OCR resides on a cluster file system file or if the OCR is on a network file system, then create a dummy file before performing the procedures in this section.

Note: On Linux and UNIX systems, you must be root user to run ocrconfig commands. On Windows systems, the user must be a member of the Administrator's group.

Administering the Oracle Cluster Registry

Administering Oracle Clusterware 2-5

Replacing an Oracle Cluster Registry If you need to change the location of an existing OCR, or change the location of a failed OCR to the location of a working one, you can use the following procedure as long as one OCR file remains online.

To change the location of an OCR:1. Use the OCRCHECK utility to verify that a copy of the OCR other than the one

you are going to replace is online, using the following command:

ocrcheck

OCRCHECK displays all OCR files that are registered and whether or not they are available (online). If an OCR file suddenly becomes unavailable, it might take a short period of time for Oracle Clusterware to show the change in status.

2. Use the following command to verify that Oracle Clusterware is running on the node on which the you are going to perform the replace operation:

crsctl check crs

3. Run the following command as root user to replace the primary OCR using either destination_file or disk to indicate the target OCR location:

ocrconfig -replace ocr destination_fileocrconfig -replace ocr disk

4. Run the following command as root user to replace a secondary OCR using either destination_file or disk to indicate the target OCR location:

ocrconfig -replace ocrmirror destination_fileocrconfig -replace ocrmirror disk

5. If any node that is part of your current Oracle RAC cluster is shut down, then run the following command on the stopped node to let that node rejoin the cluster after the node is restarted:

ocrconfig -repair

Repairing an Oracle Cluster Registry Configuration on a Local NodeYou may need to repair an OCR configuration on a particular node if your OCR configuration changes while that node is stopped. For example, you may need to repair the OCR on a node that was not up while you were adding, replacing, or removing an OCR. To repair an OCR configuration, run the following command on the node on which you have stopped the Oracle Clusterware daemon:

ocrconfig –repair ocrmirror device_name

This operation only changes the OCR configuration on the node from which you run this command. For example, if the OCR mirror device name is /dev/raw1, then use the command syntax ocrconfig -repair ocrmirror /dev/raw1 on this node to repair its OCR configuration.

Note: The OCR that you are replacing can be either online or offline.

Note: You cannot perform this operation on a node on which the Oracle Clusterware daemon is running.

Administering the Oracle Cluster Registry

2-6 Oracle Clusterware Administration and Deployment Guide

Removing an Oracle Cluster RegistryTo remove an OCR location, at least one other OCR must be online. You can remove an OCR location to reduce OCR-related overhead or to stop mirroring your OCR because you moved the OCR to redundant storage such as RAID.

Perform the following procedure as the root user to remove an OCR location from your Oracle Clusterware environment:

1. Ensure that at least one OCR other than the OCR that you are removing is online.

2. Run the following command on any node in the cluster to remove the OCR device:

ocrconfig -replace ocr

Run the following command on any node in the cluster to remove the mirrored OCR device:

ocrconfig -replace ocrmirror

These commands update the OCR configuration on all of the nodes on which Oracle Clusterware is running.

Overriding the Oracle Cluster Registry Data Loss Protection MechanismThe OCR has a mechanism that prevents data loss due to accidental overwrites. If you configure a mirrored OCR and if Oracle Clusterware cannot access the two mirrored OCR locations and also cannot verify that the available OCR location contains the most recent configuration, then Oracle Clusterware prevents further modification to the available OCR location. In addition, the process prevents overwriting by prohibiting Oracle Clusterware from starting on the node on which only one OCR is available. In such cases, Oracle Database displays an alert message in either Oracle Enterprise Manager, the Oracle Clusterware alert log files, or both. If this problem is local to only one node, you can use other nodes to start your cluster database.

However, if you are unable to start any cluster node in your environment and if you can neither repair the OCR nor restore access to all OCR locations, then you can override the protection mechanism. The procedure described in the following list enables you to start the cluster using the available OCR location. However, overriding the protection mechanism can result in the loss of data that was not available at the time that the previous known good state was created.

Caution: Do not perform this OCR removal procedure unless there is at least one other active OCR online.

Note: When removing an OCR location, the remaining OCR must be online. If you remove a primary OCR, then the mirrored OCR becomes the primary OCR.

Administering the Oracle Cluster Registry

Administering Oracle Clusterware 2-7

Perform the following procedure to overwrite the OCR if a node cannot start up and if the alert log contains CLSD-1009 and CLSD-1011 messages.

1. Attempt to resolve the cause of the CLSD-1009 and CLSD-1011 messages.

Compare the node's OCR configuration (ocr.loc on Linux and UNIX systems and the Registry on Windows systems) with other nodes on which Oracle Clusterware is running.

■ If the configurations do not match, then run ocrconfig -repair.

■ If the configurations match, then ensure that the node can access all of the configured OCRs by running an ls command on Linux and UNIX systems. On windows, use a dir command if the OCR location is a file and run GuiOracleObjectManager.exe to verify that the partition with the name exists.

2. Ensure that the most recent OCR contains the latest OCR updates.

Look at output from the ocrdump command and determine whether it has your latest updates.

3. If you cannot resolve the problem that caused the CLSD message, then run the command ocrconfig -overwrite to bring up the node.

Managing Backups and Recovering the Oracle Cluster RegistryThis section describes how to backup OCR content and use it for recovery. The first method uses automatically generated OCR file copies and the second method enables you to issue a backup command on demand:

■ Automatic backups—Oracle Clusterware automatically creates OCR backups every four hours. At any one time, Oracle Database always retains the last three backup copies of the OCR. The CRSD process that creates the backups also creates and retains an OCR backup for each full day and at the end of each week. You cannot customize the backup frequencies or the number of files that Oracle Database retains.

■ Manual backups—Use the ocrconfig -manualbackup command to force Oracle Clusterware to perform a backup of the OCR at any time, rather than wait for the automatic backup that occurs at 4-hour intervals. The -manualbackup option is especially useful when you to need to obtain a binary backup on demand, such as before you make changes to the OCR.

You can use any backup software to copy the automatically generated backup files at least once daily to a different device from where the primary OCR resides.

The default location for generating backups on Linux or UNIX systems is CRS_home/cdata/cluster_name where cluster_name is the name of your cluster. The Windows default location for generating backups uses the same path structure.

Note: Overriding the OCR using the following procedure can result in the loss of OCR updates that were made between the time of the last known good OCR update made to the currently accessible OCR and the time at which you performed the overwrite. In other words, running the ocrconfig -overwrite command can result in data loss if the OCR location that you are using to perform the overwrite does not contain the latest configuration updates for your cluster environment.

Administering the Oracle Cluster Registry

2-8 Oracle Clusterware Administration and Deployment Guide

Because the default backup is on a local file system, Oracle recommends that you include the backup file created with the ocrconfig command as part of your operating system backup using standard operating system or third-party tools.

You can issue the ocrconfig -showbackup command to display the backup location, timestamp, and the originating node name of the backup files that Oracle Database created. By default, the -showbackup option displays information for both automatic and manual backups but you can include the auto or manual flag to display only the automatic backup information or only the manual backup information, respectively.

Overview of Restoring the Oracle Cluster Registry from OCR BackupsIf an application fails, then before attempting to restore the OCR, restart the application. As a definitive verification that the OCR failed, run an ocrcheck and if the command returns a failure message, then both the primary OCR and the OCR mirror have failed. Attempt to correct the problem using one of the following platform-specific OCR restoration procedures.

■ Restoring the Oracle Cluster Registry on Linux or UNIX Systems

■ Restoring the Oracle Cluster Registry on Windows Systems

Restoring the Oracle Cluster Registry on Linux or UNIX SystemsUse the following procedure to restore the OCR on Linux or UNIX systems:

1. Identify the OCR backups using the ocrconfig -showbackup command. Review the contents of the backup using ocrdump -backupfile file_name where file_name is the name of the backup file.

2. Stop Oracle Clusterware on all of the nodes by executing the crsctl stop crs command on all of the nodes.

3. Perform the restore by applying an OCR backup file that you identified in Step 1 using the following command where file_name is the name of the OCR that you want to restore. Make sure that the OCR devices that you specify in the OCR

See Also: You can use the ocrconfig -backuploc option to move the location where OCR creates backups. Appendix E, "Managing the Oracle Cluster Registry" describes the OCR configuration tool options.

Note: On Linux and UNIX systems, you must be root user to run ocrconfig commands. On Windows systems, the user must be a member of the Administrator's group.

See Also: "Administering the Oracle Cluster Registry with OCR Export and Import Commands" on page 2-10 to use manually created OCR export files to copy OCR content and use it for recovery

Note: You cannot restore your configuration from an OCR backup file using the -import option, which is explained in "Administering the Oracle Cluster Registry with OCR Export and Import Commands" on page 2-10. You must instead use the -restore option, as described in the following sections.

Administering the Oracle Cluster Registry

Administering Oracle Clusterware 2-9

configuration exist and that these OCR devices are valid before running this command.

ocrconfig -restore file_name

4. Restart Oracle Clusterware on all of the nodes in your cluster by restarting each node or by running the crsctl start crs command.

5. Run the following command to verify the OCR integrity where the -n node_list all argument retrieves a listing of all of the cluster nodes that are configured as part of your cluster:

cluvfy comp ocr [-n node_list] [-verbose]

Restoring the Oracle Cluster Registry on Windows SystemsUse the following procedure to restore the OCR on Windows systems:

1. Identify the OCR backups using the ocrconfig -showbackup command. Review the contents of the backup using ocrdump -backupfile file_name where file_name is the name of the backup file.

2. On all of the remaining nodes, disable the following OCR clients and stop them using the Service Control Panel: OracleClusterVolumeService, OracleCSService, OracleCRService, and the OracleEVMService.

3. Execute the restore by applying an OCR backup file that you identified in Step 1 with the ocrconfig -restore file name command. Make sure that the OCR devices that you specify in the OCR configuration exist and that these OCR devices are valid.

4. Start all of the services that were stopped in step 2. Restart all of the nodes and resume operations in cluster mode.

5. Run the following command to verify the OCR integrity where the -n node_list | all argument retrieves a listing of all of the cluster nodes that are configured as part of your cluster:

cluvfy comp ocr [-n node_list] | all [-verbose]

Diagnosing Oracle Cluster Registry ProblemsYou can use the OCRDUMP and OCRCHECK utilities to diagnose OCR problems as described under the following topics:

■ Using the OCRDUMP Utility

■ Using the OCRCHECK Utility

Using the OCRDUMP UtilityUse the OCRDUMP utility to write the OCR contents to a file so that you can examine the OCR content.

See Also: Appendix F, "Troubleshooting Oracle Clusterware" for more information about enabling and using CVU

See Also: Appendix A for more information about enabling and using CVU

See Also: "OCRDUMP Utility Syntax and Options" on page F-9 for more information about the OCRDUMP utility

Administering the Oracle Cluster Registry

2-10 Oracle Clusterware Administration and Deployment Guide

Using the OCRCHECK UtilityUse the OCRCHECK utility to verify the OCR integrity.

Administering the Oracle Cluster Registry with OCR Export and Import CommandsIn addition to using the automatically created OCR backup files, you should also export the OCR contents before and after making significant configuration changes, such as adding or deleting nodes from your environment, modifying Oracle Clusterware resources, or creating a database. Do this by using the ocrconfig -export command. This exports the OCR content to a file format.

Using the ocrconfig -export command enables you to restore the OCR using the -import option if your configuration changes cause errors. For example, if you have unresolvable configuration problems, or if you are unable to restart your clusterware after such changes, then restore your configuration using one of the following platform-specific procedures:

■ Importing Oracle Cluster Registry Content on Linux or UNIX Systems

■ Importing Oracle Cluster Registry Content on Windows Systems

Importing Oracle Cluster Registry Content on Linux or UNIX Systems Use the following procedure to import the OCR on Linux or UNIX systems:

1. Stop Oracle Clusterware by issuing the following command (as root user) on all of the nodes:

crsctl stop crs

2. Import the OCR by applying an OCR export file by issuing the following command (as root user), where file_name is the name of the OCR file from which you want to import OCR information:

ocrconfig -import file_name

3. Restart Oracle Clusterware on all of the nodes in your cluster:

crsctl start crs

4. Verify the OCR integrity by issuing the following Cluster Verification Utility (CVU) command, where the -n node_list all argument retrieves a listing of all of the cluster nodes that are configured as part of your cluster:

cluvfy comp ocr [-n node_list] [-verbose]

See Also: "Using the OCRCHECK Utility" on page F-11 for more information about the OCRCHECK utility

Note: Most configuration changes that you make not only change the OCR contents, configuration changes also cause file and database object creation. Some of these changes are often not restored when you restore the OCR. Do not perform an OCR restore as a correction to revert to previous configurations if some of these configuration changes should fail. This may result in an OCR that has contents that do not match the state of the rest of your system.

Changing Network Addresses

Administering Oracle Clusterware 2-11

Importing Oracle Cluster Registry Content on Windows Systems Use the following procedure to import the OCR on Windows systems:

1. Identify the OCR export file that you want to import by running the ocrconfig -showbackup command.

2. Stop the following OCR clients on each node in your Oracle Clusterware environment using the Service Control Panel: OracleClusterVolumeService, OracleCMService, OracleEVMService, OracleCSService, and OracleCRService.

3. Import an OCR export file using the ocrconfig -import command from one node.

4. Restart all of the affected services on all of the nodes.

5. Run the following Cluster Verification Utility (CVU) command to verify the OCR integrity where node_list is a list of all of the nodes in your cluster database:

cluvfy comp ocr [-n node_list] [-verbose]

Implementing the Oracle HARD Initiative for the Oracle Cluster RegistryThe Oracle Hardware Assisted Resilient Data (HARD) initiative prevents data corruptions from being written to permanent storage. If you enable HARD, then the OCR writes HARD-compatible blocks. To determine whether the device used by the OCR supports HARD and then enable it, review the Oracle HARD white paper at:

http://www.oracle.com/technology/deploy/availability/htdocs/HARD.html

Upgrading and Downgrading the Oracle Cluster Registry ConfigurationWhen you install Oracle Clusterware, Oracle Database automatically runs the ocrconfig -upgrade command. To downgrade, follow the downgrade instructions for each component and also downgrade the OCR using the ocrconfig -downgrade command. If you are upgrading the OCR, then you can use the ocrcheck command to verify the integrity of the OCR.

Changing Network AddressesIn an Oracle Clusterware configuration, there must be a minimum of two network connections. One connection is for the public network interface on which users and application servers connect to access data on the database server, and the other connection is for the private network interface between the nodes in the cluster.

This section contains the following topics:

Note: You cannot import an exported OCR backup file. (Managing backups is described in "Managing Backups and Recovering the Oracle Cluster Registry" on page 2-7.) You must instead use the -import option as described in the following sections.

See Also: Appendix F, "Troubleshooting Oracle Clusterware" for more information about enabling and using CVU

See Also: Appendix F, "Troubleshooting Oracle Clusterware" for more information about enabling and using CVU

Changing Network Addresses

2-12 Oracle Clusterware Administration and Deployment Guide

■ Changing Public Network Addresses

■ Changing Private Network Addresses

Changing Public Network AddressesClients use the Virtual Internet Protocol (VIP) address to connect to the Oracle RAC database instances. The VIP address resolves all timeouts entering the TCP/IP stack, resulting in higher application availability. The VIP address is a static IP address that defines and resolves to a hostname through the Domain Name System (DNS).

During the installation of Oracle Clusterware, you are prompted to enter a VIP address and a virtual hostname for each node in the cluster. These items are stored in the OCR and different components in the high availability framework depend on the VIP addresses.

Changing the VIP address requires modification of the node applications, including the VIP address, Global Services Daemon (GSD), the listener, and Oracle Notification Services (ONS). You can modify the VIP address while the node applications are running. However, the changes do not take effect until you restart the VIP address (hence, restarting the node applications). Other resources on a node, such as database instances and ASM instances, are dependent on the VIP address. Therefore, stopping the node applications takes the VIP address offline only if you stop the other resources.

Perform the following steps to change a VIP address:

1. Stop all database and ASM instances. Stop all resources that are dependent on the VIP address on a given node. This includes all Oracle RAC database instances on that node, and the ASM instance on that node, if it exists:

a. Stop database instances:

srvctl stop instance -d grid -i grid1

This example specifies the database name (grid) using the -d option and specifies the instance (grid1) on the appropriate node using the -i option.

Note: The following instructions describe how to change only a VIP address and assume that the hostname associated with the VIP address will not change.

■ If you are changing only the VIP address, then it is not necessary to make changes to the listener.ora and tnsnames.ora files, provided they are using the virtual hostnames for the HOST= entries.

■ If you are changing only the VIP address, then update the Domain Name System (DNS) or the hosts of the clients. Also, update the hosts entries of the server, too, if those are used for VIPs.

■ If you are changing both the virtual hostname and the VIP address for a node, then you must modify the listener.ora file and change the HOST= entries to the new virtual hostname.

You can use an editing tool to modify the listener.ora file manually or use the Net Configuration Assistant (NETCA) to remove the old listener and create a new listener. In addition, you do not need to make changes to the tnsnames.ora file of any clients connecting to the old hostname.

Changing Network Addresses

Administering Oracle Clusterware 2-13

Alternatively, to stop the entire database, on all nodes, you can issue the stop database command, as follows:

srvctl stop database -d grid

b. To stop the ASM instance on the node, issue the following command:

srvctl stop asm -n mynode

2. Confirm the current IP address for the VIP by issuing the ifconfig -a command. This command displays the current VIP bound to one of the network interfaces. The following example displays the current VIP address that is bound to one of the network interfaces (eth0:1):

eth0:1 Link encap:Ethernet HWaddr 00:01:03:2C:69:BB inet addr:192.168.1.125 Bcast:192.168.1.255 Mask:255.255.255.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 ... ...

3. Stop node applications using the srvctl stop nodeapps command. For example:

srvctl stop nodeapps -n mynode

This command stops the VIP address, the GSD, the listener, and the ONS daemon currently running on the specified node.

4. Verify that the VIP address is no longer running by issuing the ifconfig -a command on Linux and UNIX systems (or issue the ipconfig /all command on Windows systems), and confirm that the interface (in the example it was eth0:1) is no longer listed in the output.

If the interface is still online, it indicates that a resource, which is dependent on the VIP address, is still running. Use the crs_stat command to show resources that are still online.

5. Make any changes necessary to the /etc/hosts files on all nodes on Linux and Unix systems, or the %windir%\system32\drivers\etc\hosts file on Windows systems, and make any necessary DNS changes to associate the new IP address with the old host name.

6. Modify the node applications and provide the new VIP address using the following srvctl modify nodeapps command syntax:

srvctl modify nodeapps -n node_name; [-o oracle_home] [-A new_vip_address]

In the syntax statement include the following options:

Note: Alternatively, you can stop these resources using SQL*Plus statements, or on Windows systems you can stop the associated services.

Note: On Windows systems, issue the ipconfig /all command, to list all IP addresses bound to a given adapter, including the VIP address.

Changing Network Addresses

2-14 Oracle Clusterware Administration and Deployment Guide

■ -n node_name is the node name

■ -o Oracle_home is the Oracle home for the cluster database.

■ -A new_vip_address is the node-level VIP address: name|ip/netmask[/if1[|if2|...]])

For example, issue the following command as the root user:

srvctl modify nodeapps -n mynode -A 192.168.2.125/255.255.255.0/eth0

Attempting to issue this command as the oracle user results in the PRKO-2117: This command should be executed as the system privilege user error

7. Start node-level applications by issuing the srvctl start nodeapps command:

srvctl start nodeapps -n node_name

The following command example starts applications on the node named mynode:

srvctl start nodeapps -n mynode

8. Repeat the steps for each node in the cluster.

Because the SRVCTL utility is a cluster-wide management tool, you can accomplish these tasks for any specific node from any node in the cluster, without the need to log in to each of the cluster nodes.

9. Run the following command to verify node connectivity between all of the nodes for which your cluster is configured. This command discovers all of the network interfaces available on the cluster nodes and verifies the connectivity between all of the nodes by way of the discovered interfaces. This command also lists all of the interfaces available on the nodes which are suitable for use as VIP addresses.

cluvfy comp nodecon -n node_list all [-verbose]

Changing Private Network AddressesOracle Clusterware requires that each node is connected through a private network (in addition to the public network). The private network connection is referred to as the cluster interconnect. During the installation of Oracle Clusterware, you are prompted to enter a private network IP address for each node in the cluster. This address—unlike the VIP addresses—must be assigned already to a certain network interface on each node.

In addition, you must provide a private node name, which resolves to the IP address given, and is usually stored in the hosts file of the operating system. This private node name is stored automatically in the OCR. Oracle does not recommend using Domain Name Services (DNS) to resolve private network IP address names.

Oracle currently only supports clusters in which all nodes use the same network interface connected to the same subnet (defined as a global interface with the oifcfg command). You cannot use different network interfaces for each node (node-specific interfaces). See Appendix C, "Oracle Interface Configuration (OIFCFG) Command Reference" for more information about global and node-specific interfaces.

Table 2–1 describes how the Network Interface Card (NIC), the private IP address, and the private node name are stored.

Changing Network Addresses

Administering Oracle Clusterware 2-15

The following topics describe how to change a private network address, the network interface, or both.

To change a private network IP address:If you want to change a private network IP address on one node and the new IP address is on the same subnet as the former IP address, perform the following steps:

1. Stop Oracle Clusterware by issuing the following command as root user on the node on which you want to change the IP address:

crsctl stop crs

2. Assign a new network address to the Network Interface Card (NIC).

As root user, assign a new network address to the NIC using the ifconfig operating system command. Ensure the new private IP address is in the same subnet as the private IP addresses of the remaining nodes in the cluster.

3. Changes made by ifconfig are not persistent. In order to make ifconfig changes persistent, update the network configuration settings on the operating system, accordingly.

4. Restart Oracle Clusterware by issuing the following command as root user

crsctl start crs

The changes take effect when Oracle Clusterware restarts. To change more than one private IP address on more than one node in the cluster, repeat steps 1 through 3 in a rolling manner for each of those nodes.

Table 2–1 Storage for the NIC, Private IP Address, and Private Host Name

Entity Stored in ... Comments

Network Interface Card (NIC) name

Operating system

For example: eth1

Must be the same on all nodes. It can be changed globally.

Private network IP address

Operating system; assigned to the network interface. For example: eth1 10.10.1.1

Configure this as root using the ifconfig command. You can change the IP address locally if it is in the same subnet. Otherwise, change this globally.

Private node name The /etc/hosts file, which resolves the private IP address by name

You cannot use the steps described in this section to change the private node name, which is also stored in the OCR, after Oracle Clusterware is installed.

Note: You cannot use the steps in this section to change the private node name after you have installed Oracle Clusterware.

Note: Oracle does not recommend that you use the Domain Name System (DNS) for the private node name resolution. However, if you use DNS, you must change the DNS entries accordingly.

See Also: Your operating system documentation for more information about how to make ifconfig commands persistent

Changing Network Addresses

2-16 Oracle Clusterware Administration and Deployment Guide

To change a private network IP address so that a new subnet is used:If you want to change a private network IP address so that a new subnet is used, you must change all IP addresses on all cluster nodes to reflect the change of the IP subnet. Perform the following steps to change the subnet:

1. On all nodes, stop Oracle Clusterware by issuing the following command as root user:

crsctl stop crs

2. Assign a new network address from the new subnet to the NIC.

As root user, assign a new network address to the NIC using the ifconfig operating system command. Ensure all private IP addresses are taken from the same subnet.

3. Changes made by ifconfig are not persistent. In order to make ifconfig changes persistent, update the network configuration settings on the operating system, accordingly.

4. Restart Oracle Clusterware by issuing the following command as root user

crsctl start crs

The changes take effect when Oracle Clusterware restarts.

5. Issue the oifcfg command to change and store the new private IP address subnet in the OCR. See Appendix C for more information about using the OIFCFG command. The IP address changes take effect when Oracle Clusterware restarts.

If you use the CLUSTER_INTERCONNECTS initialization parameter, you must update it to reflect the changes. If you do not use the CLUSTER_INTERCONNECT parameter, then skip this step.

Note: Oracle does not recommend that you use the Domain Name System (DNS) for the private node name resolution. However, if you use DNS, you must change the DNS entries accordingly.

See Also: Your operating system documentation for more information about how to make ifconfig commands persistent

Note: To minimize downtime, you can perform these steps, as follows:

1. Perform step 1 (to stop Oracle Clusterware) on all nodes in the cluster except one.

2. Perform steps 2 and 3 to change the IP address on each stopped node, but complete the steps on one node before performing the steps on the next node.

3. Perform step 1 to stop Oracle Clusterware on the running (unchanged) node.

4. Restart the nodes on which you have already changed the IP address.

5. Perform steps 2 and 3 to change the IP address on the last unchanged node.

6. Restart the last node.

Changing Network Addresses

Administering Oracle Clusterware 2-17

The private network address is a static IP address. Changes made by ifconfig are not persistent. In order to make ifconfig changes persistent, update the network configuration settings on the operating system, accordingly.

To change the Network Interface Card (NIC) for the interconnect:If you want to change the network interface for the private interconnect (for example, eth1), you must perform the change on all nodes (globally). This is because Oracle currently does not support the use of different network interface cards in the same subnet for the cluster interconnect.

Perform the following steps:

1. On all nodes, stop Oracle Clusterware by issuing the following command as root user:

crsctl stop crs

2. Assign the current network address to the new NIC using the ifconfig command.

As root user, issue the ifconfig operating system command to assign the currently used private network address to the NIC intended to be used for the interconnect. This usually requires some downtime for the current interface and the new interface. See your platform-specific operating system documentation for more information about issuing the ifconfig command.

3. Changes made by ifconfig are not persistent. In order to make ifconfig changes persistent, update the network configuration settings on the operating system, accordingly.

4. Restart Oracle Clusterware by issuing the following command as root user on all nodes:

crsctl start crs

5. Issue the oifcfg command to change and store the new private network/private IP address subnet assignment in the OCR. See Appendix C for more information about using the OIFCFG command. The changes take effect when Oracle Clusterware restarts.

Note: Oracle does not recommend setting the CLUSTER_INTERCONNECTS parameter.

Note: Oracle does not recommend that you use the Domain Name System (DNS) for the private node name resolution. However, if you use DNS, you must change the DNS entries accordingly.

Note: Oracle does not recommend that you use the Domain Name System (DNS) for the private node name resolution. However, if you use DNS, you must change the DNS entries accordingly.

See Also: Your operating system documentation for more information about how to make ifconfig commands persistent

Changing Network Addresses

2-18 Oracle Clusterware Administration and Deployment Guide

If you want to change both the private IP address and the network interface card in one step, assign the new IP address to the new NIC and then perform the remaining steps accordingly.

Note: To minimize downtime, you can perform these steps as follows:

1. Perform step 1 (to stop Oracle Clusterware) on all nodes in the cluster except one.

2. Perform steps 2 and 3 to change the IP address on each stopped node, but complete the steps on one node before performing the steps on the next node.

3. Perform step 1 to stop Oracle Clusterware on the running (unchanged) node.

4. Restart the nodes on which you have already changed the IP address.

5. Perform steps 2 and 3 to change the IP address on the last unchanged node.

6. Restart the last node.

Cloning Oracle Clusterware 3-1

3Cloning Oracle Clusterware

This chapter describes how to clone an existing Oracle Clusterware home and use it to create a new cluster or to extend Oracle Clusterware to new nodes on the same cluster. You implement cloning through the use of scripts in silent mode.

The cloning procedures described in this chapter are applicable to Linux, UNIX, and Windows systems. Although the examples in this chapter use Linux and UNIX commands, the cloning concepts and procedures apply to all platforms. For the Windows platform, you need to adjust the examples or commands to be Windows specific.

This chapter contains the following topics:

■ Introduction to Cloning Oracle Clusterware

■ Preparing the Oracle Clusterware Home for Cloning

■ Cloning Oracle Clusterware to Create a New Cluster

■ Cloning to Extend Oracle Clusterware to More Nodes in the Same Cluster

■ Cloning Script Variables Reference

■ Locating and Viewing Log Files Generated During Cloning

Introduction to Cloning Oracle ClusterwareCloning is the process of copying an existing Oracle installation to a different location and then updating the copied installation to work in the new environment. The changes made by one-off patches applied on the source Oracle home are also present after the clone operation. During cloning, you run a script that replays the actions that installed the Oracle Clusterware home.

Cloning requires that you start with a successfully installed Oracle Clusterware home that you use as the basis for implementing a script that extends the Oracle Clusterware home to either create a new cluster or to extend the Oracle Clusterware environment to more nodes in the same cluster. Manually creating the cloning script can be prone to errors, because you must prepare the script without the benefit of any interactive checks to validate your input. Despite this, the initial effort is worthwhile for scenarios where you run a single script to install tens or even hundreds of clusters. If you have only one cluster to install, then you should use the traditional automated and interactive installation methods, such as Oracle Universal Installer or the Provisioning Pack feature of Oracle Enterprise Manager.

Introduction to Cloning Oracle Clusterware

3-2 Oracle Clusterware Administration and Deployment Guide

The following list describes some situations in which cloning is useful:

■ Cloning provides a way to prepare a Oracle Clusterware home once and deploy it to many hosts simultaneously. You can complete the installation in silent mode, as a noninteractive process. You do not need to use a graphical user interface (GUI) console, and you can perform cloning from a Secure Shell (SSH) terminal session, if required.

■ Cloning enables you to create a new installation (copy of a production, test, or development installation) with all patches applied to it in a single step. Once you have performed the base installation and applied all patch sets and patches on the source system, the clone performs all of these individual steps as a single procedure. This is in contrast to going through the installation process to perform the separate steps to install, configure, and patch the installation on each node in the cluster.

■ Installing Oracle Clusterware by cloning is a quick process. For example, cloning an Oracle Clusterware home to a new cluster with more than two nodes requires a few minutes to install the Oracle software, plus a few minutes more for each node (approximately the amount of time it takes to run the root.sh script).

■ Cloning provides a guaranteed method of repeating the same Oracle Clusterware installation on multiple clusters.

The cloned installation acts the same as the source installation. For example, you can remove the cloned Oracle Clusterware home using Oracle Universal Installer or patch it using OPatch. You can also use the cloned Oracle Clusterware home as the source for another cloning operation. You can create a cloned copy of a test, development, or production installation by using the command-line cloning scripts. The default cloning procedure is adequate for most cases. However, you can also customize some aspects of the cloning process, for example, to specify custom port assignments or to preserve custom settings.

The cloning process works by copying all of the files from the source Oracle Clusterware home to the destination Oracle Clusterware home. Thus, any files used by the source instance that are located outside the source Oracle Clusterware home's directory structure are not copied to the destination location.

The size of the binary files at the source and the destination may differ because these are relinked as part of the cloning operation, and the operating system patch levels may also differ between these two locations. Additionally, the number of files in the cloned home would increase because several files copied from the source, specifically those being instantiated, are backed up as part of the clone operation.

Note: Cloning is not a replacement for Oracle Enterprise Manager cloning that is a part of the Provisioning Pack. During Oracle Enterprise Manager cloning, the provisioning process simplifies the process by interactively asking you the details about the Oracle home (such as the location to which you want to deploy the clone, the name of the Oracle database home, a list of the nodes in the cluster, and so on).

The Provisioning Pack feature of Oracle Enterprise Manager Grid Control provides a framework that automates the provisioning of new nodes and clusters. For data centers with many clusters, the investment in creating a cloning procedure to provision new clusters and new nodes to existing clusters is worth the effort.

Preparing the Oracle Clusterware Home for Cloning

Cloning Oracle Clusterware 3-3

Preparing the Oracle Clusterware Home for CloningTo prepare the source Oracle Clusterware home to be cloned, you create a copy of an installed Oracle Clusterware home that you then use to perform the cloning procedure on one or more nodes.

Use the following step-by-step procedure to prepare a copy of the Oracle Clusterware home.

Step 1 Install Oracle Clusterware.Use the detailed instructions in your platform-specific Oracle Clusterware installation guide to perform the following steps on the source node:

1. Install the Oracle Clusterware 11g Release 1 (11.1).

2. Install any patches that are required (for example, 11.1.0.n), if necessary.

3. Apply one-off patches, if necessary.

Step 2 Shutdown Oracle Clusterware.Before copying the source Oracle Clusterware home, shut down Oracle Clusterware using the crsctl stop crs command. The following example shows the command and the messages that display during the shutdown:

[root@node1 root]# crsctl stop crsStopping resources.This could take several minutes.Successfully stopped Oracle Clusterware resourcesStopping Cluster Synchronization Services.Shutting down the Cluster Synchronization Services daemon.Shutdown request successfully issued.

Note that you copy the Oracle Clusterware home from only one of the nodes.

Step 3 Create a copy of the Oracle Clusterware home.To keep the installed Oracle Clusterware home as a working home, make a full copy of the source Oracle Clusterware home that you will use for cloning. Because the Oracle Clusterware home contains files that are relevant only to the source node, you can optionally remove the unnecessary files from the copy.

Use one of the following methods to create a compressed copy of the Oracle Clusterware home.

Method 1: Create a copy of the Oracle Clusterware home and remove the unnecessary files from the copy:

1. On the source node, create a copy of the Oracle Clusterware home. To keep the installed Oracle Clusterware home as a working home, make a full copy of the source Oracle Clusterware home and remove the unnecessary files from the copy. For example, as the root user on Linux systems, run the cp command:

cp -prf CRS_HOME location_of_the_copy_of_CRS_HOME

2. Delete unnecessary files from the copy.

The Oracle Clusterware home contains files that are relevant only to the source node, so you can remove the unnecessary files from the copy in the log,

Note: When creating the copy, the best practice is to include the release number in the name of the file.

Preparing the Oracle Clusterware Home for Cloning

3-4 Oracle Clusterware Administration and Deployment Guide

crs/init, racg/dump, srvm/log, and cdata directories. The following example for Linux and UNIX systems shows the commands you can run to remove unnecessary files from the copy of the Oracle Clusterware home:

[root@node1 root]# cd /opt/oracle/product/11g/crs[root@node1 crs]# rm –rf ./opt/oracle/product/11g/crs/log/hostname[root@node1 crs]# find . -name '*.ouibak' -exec rm {} \;[root@node1 crs]# find . -name '*.ouibak.1' -exec rm {} \;[root@node1 crs]# rm –rf root.sh*[root@node1 crs]# cd cfgtoollogs[root@node1 cfgtoollogs]# find . -type f -exec rm -f {} \;

3. Create a compressed copy of the above copied Oracle Clusterware home using tar or gzip on Linux and UNIX systems and WinZip on Windows systems. Ensure that the tool you use preserves the permissions and file timestamps. For example:

■ On Linux and UNIX systems:

[root@node1 root]# cd /opt/oracle/product/11g/crs/[root@node1 crs]# tar –zcvf /pathname/crs11101.tgz .

In the example, the cd command changes the location to the Oracle Clusterware home, and the tar command creates the copy named crs11101.tgz. In the tar command, the pathname variable represents the location of the file.

■ On AIX or HPUX systems:

tar cpf - . | compress -fv > temp_dir/crs11101.tar.Z

■ On Windows systems, use WinZip to create a zip file

Method 2: Create a compressed copy of the Oracle Clusterware home using the –X option:

1. Create a file that lists the unnecessary files in the Oracle Clusterware home. For example, list the following file names (using the (*) wild card) in a file called excludeFileList:

./opt/oracle/product/11g/crs/log/hostname

./opt/oracle/product/11g/crs/root.sh*

2. Use the tar command or Winzip to create a compressed copy of the Oracle Clusterware home. For example, on Linux and UNIX systems, run the following command to archive and compress the source Oracle Clusterware home:

tar cpfX - excludeFileList . | compress -fv > temp_dir/crs11101.tar.Z

Step 4 Remove unnecessary files from the copy of the Oracle Clusterware home.The Oracle Clusterware home contains files that are relevant only to the source node so you should remove the unnecessary files from the copy. You should exclude files in the log, crs/init, racg/dump, srvm/log, and cdata directories.

Use one of the following methods to exclude files from your backup file:

■ Make a copy of the source Oracle Clusterware home and delete the unnecessary files from the copy.

Note: Do not use the jar utility to copy and compress the Oracle Clusterware home.

Cloning Oracle Clusterware to Create a New Cluster

Cloning Oracle Clusterware 3-5

The following example shows the commands you can run to remove unnecessary files from the copy of the Oracle Clusterware home:

[root@node1 root]# cd /opt/oracle/product/11g/crs[root@node1 crs]# rm –rf ./opt/oracle/product/11g/crs/log/hostname[root@node1 crs]# find . -name '*.ouibak' -exec rm {} \;[root@node1 crs]# find . -name '*.ouibak.1' -exec rm {} \;[root@node1 crs]# rm –rf ./cdata/*[root@node1 crs]# rm –rf root.sh*[root@node1 crs]# cd cfgtoollogs[root@node1 cfgtoollogs]# find . -type f -exec rm -f {} \;

■ Create an excludeFileList file and then use the tar command or Winzip to create a copy of the Oracle Clusterware home. For example, on Linux, run the tar cpfX - excludFileList.txt command to create a tar file that does not excludes the unnecessary files.

Step 5 Create a copy of the source Oracle Clusterware home.On the source node, create a copy of the Oracle Clusterware home using tar or gzip on Linux and UNIX systems and WinZip on Windows systems. Make sure that the tool that you use preserves the permissions and file timestamps.

When creating the copy, the best practice is to include the release number in the name of the file. For example, the following Linux example uses the cd command to change to the Oracle Clusterware home location, and then uses the tar command to create the copy named crs11101.tgz.

The following examples describe how to archive and compress the source Oracle Clusterware home on various platforms:

■ On Linux and UNIX systems, run the following command if you are using an excludeFileList file:

tar cpfX - excludeFileList . | compress -fv > temp_dir/crs11101.tar.Z

The following example shows the Linux and UNIX commands to create a copy when you are not using an excludeFileList file. In the tar command, the pathname variable represents the location of the file:

[root@node1 root]# cd /opt/oracle/product/11g/crs/[root@node1 crs]# tar –zcvf /pathname/crs11101.tgz .

■ On AIX or HPUX systems:

tar cpf - . | compress -fv > temp_dir/crs11101.tar.Z

■ On Windows systems, use WinZip to create a zip file

Cloning Oracle Clusterware to Create a New ClusterThis section explains how to create a new cluster by cloning a successfully installed Oracle Clusterware environment and copying it to the nodes on the destination cluster. The procedures in this section describe how to use cloning for Linux, UNIX, and Windows systems.

Note: Do not use the jar utility to copy and compress the Oracle Clusterware home.

Cloning Oracle Clusterware to Create a New Cluster

3-6 Oracle Clusterware Administration and Deployment Guide

For example, you can use cloning to quickly duplicate a successfully installed Oracle Clusterware environment to create a new cluster. Figure 3–1 shows the end result of a cloning procedure in which the Oracle Clusterware home on Node 1 has been cloned to Node 2 and Node 3 on Cluster 2, making Cluster 2 a new two-node cluster.

Figure 3–1 Cloning to Create a New Oracle Clusterware Environment

At a high level, the steps to create a new cluster through cloning are as follows:

1. Prepare the new cluster nodes

2. Deploy Oracle Clusterware on the destination nodes

3. Run the clone.pl script on each destination node

4. Run the orainstRoot.sh script on each node

5. Run the CRS_home/root.sh script

6. Run the configuration assistants and the Oracle Cluster Verify utility

Step 1 Prepare the new cluster nodesOn each destination node, perform the following preinstallation steps:

■ Specify the kernel parameters.

■ Configure block devices for Oracle Clusterware devices.

■ Ensure you have set the block device permissions correctly.

■ Use short, nondomain-qualified names for all names in the Hosts file.

■ Test whether or not the interconnect interfaces are reachable using the ping command.

■ Verify that the VIP addresses are not active at the start of the cloning process by using the ping command (the ping command of the VIP address must fail).

■ Run the Cluster Verification Utility (CVU) to verify your hardware and operating system environment.

See your platform-specific Oracle Clusterware installation guide for the complete preinstallation checklist.

Note: Unlike traditional methods of installation, the cloning process does not validate your input during the preparation phase. (By comparison, during the traditional method of installation using Oracle Universal Installer, various checks take place during the interview phase.) Thus, if you make any mistakes during the hardware setup or in the preparation phase, then the cloned installation will fail.

hb

hbNode 1

Cluster 1

Node 2

Cluster 2

Node 3

Heartbeat hb

Cloning Oracle Clusterware to Create a New Cluster

Cloning Oracle Clusterware 3-7

Step 2 Deploy Oracle Clusterware on the destination nodesBefore you begin the cloning procedure described in this section, ensure that you have completed the prerequisite tasks to create a copy of the Oracle Clusterware home, as described in the "Preparing the Oracle Clusterware Home for Cloning" section on page 3-3.

On each destination node, deploy the copy of the Oracle Clusterware home by performing the following steps:

1. If you do not have a shared Oracle Clusterware home, then restore the copy of the Oracle Clusterware home on each node in the destination cluster in the equivalent directory structure as the directory structure in which the Oracle Clusterware home resided on the source node. Skip this step if you have a shared Oracle Clusterware home.

For example:

■ On Linux or UNIX systems, run commands similar to the following:

[root@node1 root]# mkdir -p /opt/oracle/product/11g/crs[root@node1 root]# cd /opt/oracle/product/11g/crs[root@node1 crs]# tar –zxvf /pathname/crs11101.tgz

In the example, the pathname variable represents the directory structure in which you want to install the Oracle Clusterware home.

■ On Windows systems, unzip the Oracle Clusterware home on the destination node in the equivalent directory structure as the directory structure in which the Oracle Clusterware home resided on the source node.

2. Change the ownership of all files to oracle:oinstall group, and create a directory for the Oracle Inventory. For example, the following commands are for a Linux system:

[root@node1 crs]# chown –R oracle:oinstall /opt/oracle/product/11g/crs[root@node1 crs]# mkdir -p /opt/oracle/oraInventory[root@node1 crs]# chown oracle:oinstall /opt/oracle/oraInventory

3. Run the preupdate.sh script from the CRS_Home/install directory on each target node as follows:

@ preupdate.sh -crshome target_crs_oh -crsuser user_who_runs_cloning -noshutdown

Step 3 Run the clone.pl script on each destination nodeTo set up the new Oracle Clusterware environment, the clone.pl script requires you to provide a number of setup values to the script. You can supply the variables by either supplying input when you run the clone.pl script, or by creating a file in which you can assign values to the cloning variables. The following discussions describe these options.

Note: You can perform this step at the same time you perform steps 3 and 4 that run the clone.pl and orainstRoot.sh scripts on each cluster node.

See Also: "Locating and Viewing Log Files Generated During Cloning" on page 3-15

Cloning Oracle Clusterware to Create a New Cluster

3-8 Oracle Clusterware Administration and Deployment Guide

■ Supplying Input to the clone.pl Script On the Command Line

If you do not have a shared Oracle Clusterware home, navigate to the $ORACLE_HOME/clone/bin directory on each destination node and run the clone.pl script, which performs the main Oracle Clusterware cloning tasks. To run the script, you must supply input to a number of variables.

For example:

On Linux and UNIX systems:

perl clone.pl ORACLE_BASE=/opt/oracle ORACLE_HOME=CRS_home ORACLE_HOME_NAME=CRS_HOME_NAME '-On_storageTypeVDSK=2' '-On_storageTypeOCR=2' '-O"sl_tableList={node2:node2-priv:node2-vip, node3:node3-priv:node3-vip}"' '-O"ret_PrivIntrList=private_interconnect_list"'-O's_votingdisklocation=voting_disk1' -O's_OcrVdskMirror1RetVal=voting_disk2' -O's_VdskMirror2RetVal=voting_disk3' "-O's_ocrpartitionlocation=OCR_loc' -O's_ocrMirrorLocation=OCRMirror_loc'" -O'INVENTORY_LOCATION=oraInventory_loc' -O'-noConfig'

On Windows systems:

perl clone.pl ORACLE_BASE=D:\oracle ORACLE_HOME=CRS_home ORACLE_HOME_NAME=CRS_HOME_NAME '-On_storageTypeVDSK=2' '-On_storageTypeOCR=2' '-O"sl_tableList={node2:node2-priv:node2-vip, node3:node3-priv:node3-vip}"' '-O"ret_PrivIntrList=private_interconnect_list"' '-O"sl_OHPartitionsAndSpace_valueFromDlg={partition and space information}"' -O'-noConfig'

Refer to Table 3–1 and Table 3–2 for descriptions of the various variables in the preceding examples.

If you have a shared Oracle Clusterware home, then append the -cfs option to the command example in this step and provide a complete path location for the cluster file system. Ensure that the variables n_storageTypeOCR and n_storageTypeVDSK are set to 2 for redundant storage or 1 for nonredundant storage. In this case, you must also specify the mirror locations.

For Windows platforms, on all other nodes, run the same command, by adding an additional argument: PERFORM_PARTITION_TASKS=FALSE.

For example:

perl clone.pl ORACLE_BASE=/opt/oracle ORACLE_HOME=CRS_home ORACLE_HOME_NAME=CRS_home_name '-On_storageTypeVDSK=2' '-On_storageTypeOCR=2' '-O"sl_tableList={node2:node2-priv:node2-vip, node3:node3-priv:node3-vip}"' '-O"ret_PrivIntrList=private interconnect list"' '-O"sl_OHPartitionsAndSpace_valueFromDlg={partition and space information}"' -O'-noConfig' PERFORM_PARTITION_TASKS=FALSE

Note: When cloning Oracle Clusterware using the clone.pl script, you must set a value for the ORACLE_BASE variable even though specifying Oracle base is not a requirement of the Oracle Clusterware installation. You can set the ORACLE_BASE variable to any directory location (for example, you could set it to the Oracle Clusterware Home location), because the value is ignored.

See Also: The "Cloning Script Variables Reference" section on page 3-12 for more information about setting these variables.

Cloning Oracle Clusterware to Create a New Cluster

Cloning Oracle Clusterware 3-9

■ Supplying Input to the clone.pl Script in a File

Because the clone.pl script is sensitive to the parameters being passed to it, you must be accurate in your use of brackets, single quotation marks, and double quotation marks. To make running the clone.pl script less prone to errors, you can create a file that is similar to the start.sh script shown in Example 3–1 in which you can specify environment variables and cloning parameters to the clone.pl script.

Example 3–1 shows an excerpt from the example file example called start.sh script that calls the clone.pl script and has been set up for a cluster named crscluster. Invoke the script as the operating system user that installed Oracle Clusterware.

Example 3–1 Excerpt From the start.sh Script to Clone Oracle Clusterware

#!/bin/shORACLE_BASE=/opt/oracleCRS_home=/opt/oracle/product/11g/crsE01=CRS_home=/opt/oracle/product/11g/crsE02=ORACLE_HOME=${CRS_home}E03=ORACLE_HOME_NAME=OraCrs11gE04=ORACLE_BASE=/opt/oracle#C00="-O'-debug'"C01="-O's_clustername=crscluster'"C02="-O'INVENTORY_LOCATION=/opt/oracle/oraInventory'"C03="-O'sl_tableList={node1:node1int:node1vip:N:Y,node2:node2int:node2vip:N:Y}'"C04="-O'ret_PrivIntrList={eth0:144.25.212.0:1,eth1:10.10.10.0:2}'"C05="-O'n_storageTypeVDSK=1'"C06="-O's_votingdisklocation=/dev/sdc1' -O's_OcrVdskMirror1RetVal=/dev/sdd1' -O's_VdskMirror2RetVal=/dev/sde1'"C07="-O'n_storageTypeOCR=1'"C08="-O's_ocrpartitionlocation=/dev/sdc2' -O's_ocrMirrorLocation=/dev/sdd2'"

perl CRS_home/clone/bin/clone.pl $E01 $E02 $E03 $E04 $C01 $C02 $C03 $C04 $C05 $C06 $C07 $C08

The start.sh script sets several environment variables and cloning parameters, as described in Table 3–1 and Table 3–2, respectively.

Table 3–1 describes the environment variables E01, E02, E03, and E04 that are shown in bold typeface in Example 3–1.

Table 3–1 Environment Variables Passed to the clone.pl Script

Symbol Variable Description

E01 CRS_home The location of the Oracle Clusterware home. This directory location must exist and must be owned by the Oracle operating system group: oinstall.

E02 ORACLE_HOME The location of the Oracle Clusterware home. This directory location must exist and must be owned by the Oracle operating system group: oinstall.

E03 ORACLE_HOME_NAME The name of the Oracle Clusterware home. This is stored in the Oracle Inventory.

E04 ORACLE_BASE The location of the Oracle base directory.

Cloning to Extend Oracle Clusterware to More Nodes in the Same Cluster

3-10 Oracle Clusterware Administration and Deployment Guide

Also, see "Cloning Script Variables Reference" on page 3-12 for a description of the cloning parameters C01 through C08, that are shown in bold typeface in Example 3–1.

Step 4 Run the orainstRoot.sh script on each nodeIn the Central Inventory directory on each destination node, run the orainstRoot.sh script as the operating system user that installed Oracle Clusterware. This script populates the /etc/oraInst.loc directory with the location of the central inventory.

Note that you can run the script on each node simultaneously. For example:

[root@node1 root]# /opt/oracle/oraInventory/orainstRoot.sh

Ensure the orainstRoot.sh script has completed on each destination node before proceeding to the next step.

Step 5 Run the CRS_home/root.sh scriptOn each destination node, run the CRS_home/root.sh script. You can run the script on only one node at a time. The following example is for a Linux or UNIX system:

1. On the first node, run the following command:

[root@node1 root]# /opt/oracle/product/11g/crs/root.sh

Ensure the CRS_home/root.sh script has completed on the first node before running it on the second node.

2. On each subsequent node, run the following command:

[root@node2 root]# /opt/oracle/product/11g/crs/root.sh

The root.sh script automatically sets up the node applications: Global Services Daemon (GSD), Oracle Notification Services (ONS), and Virtual IP (VIP) resources in the OCR.

Step 6 Run the configuration assistants and the Oracle Cluster Verify utilityAt the end of the Oracle Clusterware installation on each new node, run the configuration assistants and CVU using the commands in the CRS_home/cfgtoollogs/configToolAllCommands file.

Cloning to Extend Oracle Clusterware to More Nodes in the Same ClusterFor example, you can use cloning to quickly extend a successfully installed Oracle Clusterware environment to more nodes in the same cluster. Figure 3–2 shows the end result of a cloning procedure in which the Oracle Clusterware home on Node 1 has been cloned to Node 2 in the same cluster, making it a two-node cluster.

Cloning to Extend Oracle Clusterware to More Nodes in the Same Cluster

Cloning Oracle Clusterware 3-11

Figure 3–2 Cloning to Extend the Oracle Clusterware Environment to Another Node

At a high level, the steps to extend Oracle Clusterware to more nodes are nearly identical to the steps described in the "Cloning Oracle Clusterware to Create a New Cluster" section on page 3-5.

The following list describes the steps you perform to extend Oracle Clusterware to additional nodes in the cluster:

1. Prepare the new cluster nodes on page 3-6.

2. Deploy Oracle Clusterware on the destination nodes on page 3-7.

3. Run the clone.pl script on each destination node. The following example is for Linux or UNIX systems:

perl clone.pl ORACLE_BASE=/opt/oracle ORACLE_HOME=$ORACLE_HOME ORACLE_HOME_NAME=Oracle_home_name "sl_tableList={node2:node2_priv:node2-vip}" INVENTORY_LOCATION=central_inventory_location -noConfig

The following example is for Windows systems:

perl clone.pl ORACLE_BASE=D:\oracle ORACLE_HOME=CRS_HOME ORACLE_HOME_name=CRS_HOME_name'-O"sl_tableList={node2:node2-priv:node2-vip}"' '-O-noConfig' '-OPERFORM_PARTITION_TASKS=FALSE'

4. Run the orainstRoot.sh script on each node on page 3-10 on each destination node.

5. Run the addNode script on the source node.

Run the following command on the source node, where new_node is the name of the new node, new_node-priv is the private interconnect protocol address for the new node, and new_node-vip is the virtual interconnect protocol address for the new node:

$ORACLE_HOME/oui/bin/addNode.sh –silent "CLUSTER_NEW_NODES={new_nodes}" "CLUSTER_NEW_PRIVATE_NODE_NAMES={new_node-priv}" "CLUSTER_NEW_VIRTUAL_HOSTNAMES={new_node-vip}" –noCopy

6. On the source node, run a script to instantiate the node:

■ On Linux and UNIX systems, run the rootaddnode.sh script from the CRS_HOME/install directory as root user.

■ On Windows systems, run the crssetup.add.bat script from the %CRS_HOME%\install directory.

Note: Because the clone.pl script has already been run on the new node, this step only updates the inventories on the nodes and instantiates scripts on the local node

Node 1

Cluster 1

Node 2

Cloning Script Variables Reference

3-12 Oracle Clusterware Administration and Deployment Guide

7. Run the CRS_home/root.sh script on page 3-10 on each destination node in Linux and UNIX environments.

8. Run the configuration assistants and the CLUVFY utility.

As the user that owns the clusterware on the source node of the cluster, run the configuration assistants as described in the following steps. You can obtain the remote port from the $CRS_HOME/cfgtoollogs/configToolAllCommands file on the target node.

a. On Linux or UNIX systems, run the following onsconfig command:

onsconfig add_config node2:remote_port node3:remote_port

On Windows systems, run the onsconfig command:

./onsconfig add_config node2:remote_port node3:remote_port

b. On Linux, UNIX, or Windows systems, run the CLUVFY utility in postinstallation verification mode to confirm that the installation of Oracle Clusterware was successful. For example:

CRS_HOME/bin/cluvfy stage -post crsinst -n node1,node2

Cloning Script Variables Reference Table 3–2 describes the variables that can be passed to the clone.pl script when you include the -O option on the command.

Table 3–2 Variables for the clone.pl Script with the -O option

Variable Datatype Description

s_clustername String Set the value for this variable to be the unique name of the cluster that you are creating from a cloning operation. Use a maximum of 15 characters. Valid characters for the cluster name can be any combination of lower and uppercase alphabetic characters A to Z, numerics 0 through 9 , hyphens (-), pound signs (#) and underscores (_).

INVENTORY_LOCATION String The location of the inventory. This directory location must exist and must be owned by the Oracle operating system group: oinstall.

sl_tableList String List A list of the nodes that make up the cluster. The format is a comma-delimited list of public_name:private_name:vip_name:N:Y.

Set the value of this variable to be equal to the information in the cluster configuration information table. This file contains a comma-delimited list of values. The first field designates the public node name, the second field designates the private node name, and the third field designates the virtual host name. The fourth and fifth fields are used only by Oracle Universal Installer and should default to N:Y. Oracle Universal Installer parses these values and assign s_publicname and s_privatename variables accordingly. For example:

{"node1:node1-priv:node1-vip:N:Y","node2:node2-priv:node2-vip:N:Y"}.

Cloning Script Variables Reference

Cloning Oracle Clusterware 3-13

ret_PrivIntrList String List This is the return value from the Private Interconnect Enforcement table. This variable has values in the format {Interface Name, Subnet, Interface Type}. The value for Interface Type can be one of the following:

■ 1 to denote public,

■ 2 to denote private

■ 3 to denote Do Not Use

For example:

{"eth0:10.87.24.0:2","eth1:140.87.24.0:1","eth3:140.74.30.0:3"}

You can run the ipconfig command to identify the initial values from which you can determine the entries for ret_PrivIntrList.

n_storageTypeVDSK Integer If you are using:

■ A single voting disk, set this parameter to 2 (not redundant).

■ Multiple voting disks, set this parameter to 1 (redundant).

n_storageTypeOCR Integer If you are using:

■ A single OCR disk, set this parameter to 2 (not redundant).

■ Multiple OCR disks, set this parameter to 1 (redundant).

s_clustername String This variable contains user-entered cluster name information; allow a maximum of 15 characters.

VdskMirrorNotReqd String This variable is not required in the Oracle Cluster Registry (OCR) dialog.

CLUSTER_CONFIGURATION_FILE

String This variable is used to pass the cluster configuration file information which is the same file as that specified during installation. You may use this file instead of sl_tablelist. This file contains the public node name, private node name, and virtual host name which is white space-delimited information for the nodes of the cluster. For example,

node1 node1-priv node1-vipnode2 node2-priv node2-vip

Note that if you are cloning from an existing installation, then you should use sl_tableList. Do not specify this variable for a clone installation.

s_votingdisklocation String Set the value of this variable to be the location of the voting disk. For example:

/oradbshare/oradata/vdisk

If you are using:

■ A single voting disk, only specify the voting disk location with the s_votingdisklocation parameter.

■ Multiple voting disks, set the s_votingdisklocation, s_OcrVdskMirror1RetVal, and the s_VdskMirror2RetVal parameters.

Table 3–2 (Cont.) Variables for the clone.pl Script with the -O option

Variable Datatype Description

Cloning Script Variables Reference

3-14 Oracle Clusterware Administration and Deployment Guide

s_OcrVdskMirror1RetVal String Set the value of this variable to be the location of the first additional voting disk. You must set this variable if you choose a value of 1 for the n_storageTypeVDSK variable or Not Redundant. For example:

/oradbshare/oradata/vdiskmirror1

s_ocrpartitionlocation String Set the value of this variable to the OCR location. Oracle Database places this value in the ocr.loc file when you run the root.sh script. For example:

/oradbshare/oradata/ocr

If you are using:

■ A single OCR disk, only set the s_ocrpartitionlocation parameter to specify the location of the OCR partition.

■ Multiple OCR disks, set the s_ocrpartitionlocation parameter and the s_ocrMirrorLocation parameter.

s_ocrMirrorLocation String Set the value of this variable to the value for the OCR mirror location. Oracle Database places this value in the ocr.loc file when you run the root.sh script. You must set this variable if you choose a value of 1 for the n_storageTypeOCR variable or Not Redundant. For example:

/oradbshare/oradata/ocrmirror

s_VdskMirror2RetVal String Set the value of this variable to be the location of the second additional voting disk. You must set this variable if you choose a value of 1 for the n_storageTypeVDSK variable or Not Redundant.

/oradbshare/oradata/vdiskmirror2

CLUSTER_NODES String List The value of this variable represents the cluster node names that you selected for installation. For example, if you selected node1:

CLUSTER_NODES = {"node1"}

b_Response Boolean Only set this variable when performing a silent installation with a response file. The valid values are true or false.

Table 3–2 (Cont.) Variables for the clone.pl Script with the -O option

Variable Datatype Description

Locating and Viewing Log Files Generated During Cloning

Cloning Oracle Clusterware 3-15

Locating and Viewing Log Files Generated During CloningThe cloning script runs multiple tools, each of which may generate its own log files. After the clone.pl script finishes running, you can view log files to obtain more information about the cloning process.

The following log files that are generated during cloning are the key log files of interest for diagnostic purposes:

■ Central_Inventory/logs/cloneActions timestamp.log

Contains a detailed log of the actions that occur during the Oracle Universal Installer part of the cloning.

■ Central_Inventory/logs/oraInstall timestamp.err

Contains information about errors that occur when Oracle Universal Installer is running.

■ Central_Inventory/logs/oraInstall timestamp.out

Contains other miscellaneous messages generated by Oracle Universal Installer.

■ $ORACLE_HOME/clone/logs/clone timestamp.log

Contains a detailed log of the actions that occur prior to cloning as well as during the cloning operations.

■ $ORACLE_HOME/clone/logs/error timestamp.log

sl_OHPartitionsAndSpace_valueFromDlg

String List Set the value for this variable using the following format:

1 = disk number

2 = partition number

3 = partition size

4 = format type, 0 for raw and 1 for cluster file system

5 = Drive Letter (this value is not applicable if you use raw devices, use the available drive letter if you are using a cluster file system.

6 Usage type values:

■ 0 = Data or software use only

■ 1 = Primary OCR only

■ 2 = Voting disk only

■ 3 = Primary OCR and voting disk on the same cluster file system partition

■ 4 = OCR mirror only

■ 5 = OCR mirror and voting disk on the same cluster file system partition

For example, to configure the OCR and voting disk on raw devices and to not use a cluster file system for either data or software, set sl_OHPartitionsAndSpace_valueFromDlg to list only the partitions that you intend to use for an Oracle Clusterware installation using the following format:

sl_OhPartitionsAndSpace_valueFromDlg = {Disk,Partition,partition size, 0,N/A,1,Disk,Partition, partition size,0,N/A,2,.....)

Table 3–2 (Cont.) Variables for the clone.pl Script with the -O option

Variable Datatype Description

Locating and Viewing Log Files Generated During Cloning

3-16 Oracle Clusterware Administration and Deployment Guide

Contains information about errors that occur prior to cloning as well as during cloning operations.

Table 3–3 describes how to find the location of the Oracle inventory directory.

Table 3–3 Finding the Location of the Oracle Inventory Directory

Type of System Location of the Oracle Inventory Directory

All UNIX computers except Linux and IBM AIX

/var/opt/oracle/oraInst.loc

IBM AIX and Linux /etc/oraInst.loc file.

Windows Obtain the location from the Windows Registry key: HKEY_LOCAL_MACHINE\SOFTWARE\ORACLE\INST_LOC

Adding and Deleting Oracle Clusterware Homes 4-1

4Adding and Deleting Oracle Clusterware

Homes

This chapter describes how to use the addNode.sh and rootdeletenode.sh scripts to copy the Oracle Clusterware home from an existing Oracle Clusterware home to other nodes. This chapter provides instructions for Linux and UNIX systems, and Windows systems.

You should use the add node procedures described in this chapter to add or delete Oracle Clusterware from nodes in the cluster. If your goal is create new clusters or extend Oracle Clusterware to more nodes in the same cluster, then use the cloning procedures that are described in Chapter 3, "Cloning Oracle Clusterware".

The topics in this chapter include the following:

■ Prerequisite Steps for Adding Oracle Clusterware

■ Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems

■ Adding and Deleting Oracle Clusterware Homes on Windows Systems

Prerequisite Steps for Adding Oracle ClusterwareThe following steps assume that you already have an operative Linux or UNIX environment.

Complete the following steps to prepare the new nodes in the cluster:

1. Make physical connections

Connect the new nodes' hardware to the network infrastructure of your cluster. This includes establishing electrical connections, configuring network interconnects, configuring shared disk subsystem connections, and so on. See your hardware vendor documentation for details about this step.

2. Install the operating system

Install a cloned image of the operating system that matches the operating system on the other nodes in your cluster. This includes installing required service patches and drivers. See your hardware vendor documentation for details about this process.

3. Create Oracle users.

Note: Perform this step only for Linux and UNIX systems. For Windows systems, skip to Step 4.

Prerequisite Steps for Adding Oracle Clusterware

4-2 Oracle Clusterware Administration and Deployment Guide

As root user, create the Oracle users and groups using the same user ID and group ID as on the existing nodes.

4. Verify the installation with the Cluster Verification Utility (CVU) using the following steps:

a. From the /bin directory in the CRS_home on the existing nodes, run the CVU command to verify your installation at the post hardware installation stage as shown in the following example, where node_list is a comma-delimited list of nodes you want in your cluster:

cluvfy stage -post hwos -n node_list|all [-verbose]

This command causes CVU to verify your hardware and operating system environment at the post-hardware setup stage. After you have configured the hardware and operating systems on the new nodes, you can use this command to verify the node is reachable, for example, to all of the nodes from the local node. You can also use this command to verify user equivalence to all given nodes the local node, node connectivity among all of the given nodes, accessibility to shared storage from all of the given nodes, and so on.

b. From the /bin directory in the CRS_home on the existing nodes, run the CVU command to obtain a detailed comparison of the properties of the reference node with all of the other nodes that are part of your current cluster environment where ref_node is a node in your existing cluster against which you want CVU to compare, for example, the newly added nodes that you specify with the comma-delimited list in node_list for the -n option, orainventory_group is the name of the Oracle inventory group, and osdba_group is the name of the OSDBA group:

cluvfy comp peer [ -refnode ref_node ] -n node_list[ -orainv orainventory_group ] [ -osdba osdba_group ] [-verbose]

5. Check the installation

To verify that your installation is configured correctly, perform the following steps:

a. Ensure that the new nodes can access the private interconnect. This interconnect must be properly configured before you can complete the procedures described in this chapter.

b. If you are not using a cluster file system, then determine the location on which your cluster software was installed on the existing nodes. Make sure that you have at least 250MB of free space on the same location on each of the new

Note: You can only use the all option with the -n argument if you have set the CV_NODE_ALL variable to represent the list of nodes on which you want to perform the CVU operation.

See Also: Appendix A, "Cluster Verification Utility Reference" for more information

Note: For the reference node, select a node from your existing cluster nodes against which you want CVU to compare, for example, the newly added nodes that you specify with the -n option.

Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems

Adding and Deleting Oracle Clusterware Homes 4-3

nodes to install Oracle Clusterware. In addition, ensure you have enough free space on each new node to install the Oracle binaries.

c. Ensure that the Oracle Cluster Registry (OCR) and the voting disk are accessible by the new nodes using the same path as the other nodes use. In addition, the OCR and voting disk devices must have the same permissions as on the existing nodes.

d. Verify user equivalence to and from an existing node to the new nodes using rsh or ssh on Linux and UNIX systems, or on Window systems make sure that you can run the following command from all of the existing nodes of your cluster where the hostname is the public network name of the new node:

NET USE \\hostname\C$

You have the required administrative privileges on each node if the operating system responds with:

Command completed successfully.

After completing the procedures in this section, your new nodes are connected to the cluster and configured with the required software to make them visible to Oracle Clusterware.

Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems

This section explains Oracle Clusterware home addition and deletion on Linux and UNIX systems and it assumes that you have already performed the steps in the "Prerequisite Steps for Adding Oracle Clusterware" section.

For node addition, ensure that you install the required operating system patches and updates on the new nodes. Then configure the new nodes to be part of your cluster at the network level. Use the instructions in this section to extend the Oracle Clusterware home from an existing Oracle Clusterware home to the new nodes

Finally, you can optionally extend the Oracle database software with Oracle Real Application Clusters (Oracle RAC) components to the new nodes and make the new nodes members of the existing Oracle RAC database. See the node addition procedures described in Oracle Real Application Clusters Administration and Deployment Guide.

This section includes the following topics:

■ Adding an Oracle Clusterware Home to a New Node On Linux or UNIX Systems

■ Deleting an Oracle Clusterware Home from a Linux or UNIX System

Adding an Oracle Clusterware Home to a New Node On Linux or UNIX SystemsThis section describes how to use Oracle Universal Installer to add an Oracle Clusterware home to a node in your cluster. This documentation assumes:

■ There is an existing cluster that has a node named node1

Note: Avoid changing host names after you complete the Oracle Clusterware installation, including adding or deleting domain qualifications. Nodes with changed host names must be deleted from the cluster and added back with the new name.

Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems

4-4 Oracle Clusterware Administration and Deployment Guide

■ You are adding Oracle Clusterware from node2

■ You have already successfully installed Oracle Clusterware on node1 in a nonshared home, where CRS_home represents the successfully installed home

You can use either of the following procedures to add an Oracle Clusterware home to a node:

■ Adding an Oracle Clusterware Home to a New Node Using Oracle Universal Installer in Interactive Mode

■ Adding an Oracle Clusterware Home to a New Node Using Oracle Universal Installer in Silent Mode

Adding an Oracle Clusterware Home to a New Node Using Oracle Universal Installer in Interactive ModeThis procedure assumes that you have performed the tasks outlined in "Prerequisite Steps for Adding Oracle Clusterware" on page 4-1. Oracle Universal Installer requires access to the private interconnect that you verified as part of the installation validation in Step 1. If Oracle Universal Installer cannot make the required connections, you will not be able to complete the following steps to add Oracle Clusterware to other nodes.

1. Ensure that you have successfully installed Oracle Clusterware on at least one node in your cluster environment. Also, for these procedures to complete successfully, you must ensure that CRS_home identifies your successfully installed Oracle Clusterware home.

2. Start Oracle Universal Installer:

Go to CRS_home/oui/bin and run the addNode.sh script on one of the existing nodes. Oracle Universal Installer runs in add node mode and the Welcome page displays. Click Next and the Specify Cluster Nodes for Node Addition page displays.

3. Oracle Universal Installer displays the Node Selection Page on which you should enter the node or nodes that you want to add and click Next.

The upper table on the Specify Cluster Nodes for Node Addition page shows the existing nodes, the private node names, and the virtual IP (VIP) addresses that are associated with Oracle Clusterware. Use the lower table to enter the public, private node names and the virtual hostnames of the new nodes.

4. Verify the entries that Oracle Universal Installer displays on the Summary Page and click Next.

If any verifications fail, Oracle Universal Installer redisplays the Specify Cluster Nodes for Node Addition page with a Status column in both tables indicating errors. Correct the errors or deselect the nodes that have errors and proceed. However, you cannot deselect existing nodes; you must correct problems on nodes that are already part of your cluster before you can proceed with node addition. If all the checks succeed, Oracle Universal Installer displays the Node Addition Summary page.

Note: Instead of performing the first six steps of this procedure, you can alternatively run the addNode.sh script in silent mode as described "Adding an Oracle Clusterware Home to a New Node Using Oracle Universal Installer in Silent Mode" on page 4-6.

Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems

Adding and Deleting Oracle Clusterware Homes 4-5

5. The Node Addition Summary page displays the following information showing the products that are installed in the Oracle Clusterware home that you are extending to the new nodes:

■ The source for the add node process, which in this case is the Oracle Clusterware home

■ The private node names that you entered for the new nodes

■ The new nodes that you entered

■ The required and available space on the new nodes

■ The installed products listing the products that are already installed on the existing Oracle Clusterware home

Click Next and Oracle Universal Installer displays the Cluster Node Addition Progress page.

6. The Cluster Node Addition Progress page shows the status of the cluster node addition process. The table on this page has two columns showing the four phases of the node addition process and the phases' statuses as follows:

■ Instantiate Root Scripts—Instantiates rootaddNode.sh with the public nodes, private node names, and virtual hostnames that you entered on the Cluster Node Addition page.

■ Copy the Oracle Clusterware home to the New Nodes—Copies the Oracle Clusterware home to the new nodes unless the Oracle Clusterware home is on a cluster file system.

■ Save Cluster Inventory—Updates the node list associated with the Oracle Clusterware home and its inventory.

■ Run rootaddNode.sh and root.sh—Displays a dialog prompting you to run the rootaddNode.sh script1 from the local node (the node on which you are running Oracle Universal Installer) and to run the root.sh script2 on the new nodes. If Oracle Universal Installer detects that the new nodes do not have an inventory location, Oracle Universal Installer instructs you to run the orainstRoot.sh script3 on those nodes. The central inventory location is the same as that of the local node. The addNodeActionstimestamp.log file, where timestamp shows the session start date and time, contains information about which scripts you need to run and on which nodes you need to run them.

The Cluster Node Addition Progress page's Status column displays In Progress while the phase is in progress, Suspended when the phase is pending execution, and Succeeded after the phase completes. On completion, click Exit. After Oracle Universal Installer displays the End of Node Addition page, click Exit.

7. From the CRS_home/install directory on an existing node, run the Oracle Notification Service configuration utility (ONSCONFIG) as in the following example where remote_port is the default port number 6251 (or another free port if port 6251 is unavailable) and node2 is the name of the node that you are adding:

1 Run the rootaddNode.sh script from the CRS_home/install/ directory on the node from which you are running Oracle Universal Installer.

2 Run the root.sh script on the new node from the Oracle Clusterware home to start Oracle Clusterware on the new node.

3 Run the orainstRoot.sh script on the new node if Oracle Universal Installer prompts you to do so.

Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems

4-6 Oracle Clusterware Administration and Deployment Guide

./onsconfig add_config node2:remote_port

8. Check that your cluster is integrated and that the cluster is not divided into partitions by completing the following operations:

■ Run the following CVU command to obtain detailed output for verifying cluster manager integrity on all of the nodes that are part of your Oracle RAC environment:

cluvfy comp clumgr -n all [-verbose]

■ Use the following CVU command to obtain detailed output for verifying cluster integrity on all of the nodes that are part of your Oracle RAC environment:

cluvfy comp clu [-verbose]

■ Use the following command to perform an integrated validation of the Oracle Clusterware setup on all of the configured nodes, both the preexisting nodes and the nodes that you have added:

cluvfy stage -post crsinst -n all [-verbose]

Adding an Oracle Clusterware Home to a New Node Using Oracle Universal Installer in Silent Mode1. Ensure that you have successfully installed Oracle Clusterware on at least one

node in your cluster environment. To perform the following procedure, the CRS_home must identify your successfully installed Oracle Clusterware home.

2. Go to CRS_home/oui/bin and run the addNode.sh script using the following syntax where node2 is the name of the new node that you are adding, node2-priv is the private node name for the new node, and node2-vip is the VIP name for the new node:

./addNode.sh –silent "CLUSTER_NEW_NODES={node2}" "CLUSTER_NEW_PRIVATE_NODE_NAMES={node2-priv}" "CLUSTER_NEW_VIRTUAL_HOSTNAMES={node2-vip}"

Alternatively, you can specify the entries shown in Example 4–1 in a response file and run the addNode script as follows:

addNode.sh -silent -responseFile filename

Example 4–1 Response File Entries for Adding Oracle Clusterware Home

CLUSTER_NEW_NODES = {"newnode1","newnode2"}CLUSTER_NEW_PRIVATE_NODE_NAMES = {"newnode1-priv","newnode2-priv"}CLUSTER_NEW_VIRTUAL_HOSTNAMES = {"newnode1-vip","newnode2-vip"}

See Also: Oracle Clusterware Administration and Deployment Guide for more information about enabling and using the CVU

See Also: Oracle Universal Installer and OPatch User's Guide for details about how to configure command-line response files

Note: Command-line values always override response file values.

Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems

Adding and Deleting Oracle Clusterware Homes 4-7

3. Perform Step 7 and Step 8 in the "Adding an Oracle Clusterware Home to a New Node Using Oracle Universal Installer in Interactive Mode" section.

Deleting an Oracle Clusterware Home from a Linux or UNIX SystemThe procedures for deleting an Oracle Clusterware home assume that you have successfully installed the Oracle Clusterware on the node from which you want to delete the Oracle Clusterware home.

This section includes the following topics:

■ Deleting an Oracle Clusterware Home Using Oracle Universal Installer in Interactive Mode

■ Deleting an Oracle Clusterware Home from a Node that Cannot Be Accessed

Deleting an Oracle Clusterware Home Using Oracle Universal Installer in Interactive ModeUse the following steps to remove Oracle Clusterware from a cluster node.

Step 1 Verify the location of the Oracle Clusterware homeEnsure that CRS_home correctly specifies the full directory path for the Oracle Clusterware home on each node, where CRS_home is the location of the installed Oracle Clusterware software.

Step 2 Remove the stored network configuration

From a node that is going to remain in your cluster, in the CRS_home/bin directory, run the following command where node2 is the name of the node that you are deleting:

./oifcfg delif –node node2

Step 3 Obtain the remote port numberCreate a dump of the OCR by running the ocrdump dump_file_name command. Open the dump file and search for ONS_HOSTS.hostName.PORT. The subsequent line contains the remote port number.

Step 4 Remove the ONS daemon configurationFrom CRS_home/bin on a node that is going to remain in the cluster, run the Oracle Notification Service Utility (RACGONS). In the following example, the remote_port variable represents the ONS remote port number that you obtained in step Step 3 and node2 is the name of the node that you are deleting:

./racgons remove_config node2:remote_port

Note: Oracle recommends that you back up your voting disk and OCR files after you complete the node deletion process.

Note: This step is unnecessary if you ran the Oracle Interface Configuration Tool (OIFCFG) with the -global option during the installation, such as with Oracle Universal Installer.

Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems

4-8 Oracle Clusterware Administration and Deployment Guide

Step 5 Disable the Oracle Clusterware applicationsOn the node to be deleted, run the rootdelete.sh script as the root user from the CRS_home/install directory to disable the Oracle Clusterware applications and daemons running on the node. If you are deleting Oracle Clusterware from more than one node, then perform this step on each node that you are deleting.

Step 6 Delete the node and update the cluster registryFrom any node that you are not deleting, issue the following command from the CRS_home/install directory as the root user to delete the node from the cluster and to update the Oracle Cluster Registry (OCR). In the following command, the variable node2,node2-number represents the node and the node number that you want to delete:

./rootdeletenode.sh node2,node2-number

If necessary, identify the node number using the following command on the node that you are deleting:

CRS_home/bin/olsnodes -n

Step 7 Remove the node from the node listOn the node that is to be deleted, run the following command from the CRS_home/oui/bin directory where node_to_be_deleted is the name of the node that you are deleting:

./runInstaller -updateNodeList ORACLE_HOME=CRS_home "CLUSTER_NODES={node_to_be_deleted}" CRS=TRUE -local

Step 8 Detach or deinstall the Oracle Clusterware softwareOn the node that you are deleting, run Oracle Universal Installer using the runInstaller command from the CRS_home/oui/bin directory. Depending on whether you have a shared or nonshared Oracle home, complete one of the following procedures:

■ If you have a shared home, then on any node other than the node to be deleted, run the following command from the CRS_home/oui/bin directory:

./runInstaller -detachHome ORACLE_HOME=CRS_home

■ For a nonshared home, deinstall the Oracle Clusterware home from the node that you are deleting using Oracle Universal Installer as follows by issuing the following command from the Oracle_home/oui/bin directory, where CRS_home is the name defined for the Oracle Clusterware home:

./runInstaller -deinstall "REMOVE_HOMES={CRS_home}"

Step 9 Update the node list on the remaining nodesOn any node other than the node you are deleting, run the following command from the CRS_home/oui/bin directory where remaining_nodes_list is a comma-delimited list of the nodes that are going to remain part of your cluster:

./runInstaller -updateNodeList ORACLE_HOME=CRS_home "CLUSTER_NODES={remaining_nodes_list}" CRS=TRUE

Adding and Deleting Oracle Clusterware Homes on Linux and UNIX Systems

Adding and Deleting Oracle Clusterware Homes 4-9

Deleting an Oracle Clusterware Home from a Node that Cannot Be AccessedSometimes a node cannot be accessed for various reasons, such as the local disks have been corrupted, the server has failed, or the server has been physically removed. The definition of the failed node must still be removed from the repository.

To delete a node definition from from the repository, perform the following steps on a healthy node in the cluster:

Step 1 Remove OIFCFG information for the failed nodeRun the following command, as root:

# oifcfg getif

If the command returns global, such as eth0 123.456.0.0 global public, go on to Step 2. Otherwise, run the following command, as root:

# oifcfg delif -node name_of_node_to_be_deleted

Step 2 Remove ONS informationRun the following command, as root, to identify the local port number to use in this procedure:

# cat CRS_home/opmn/conf/ons.config

Run the following command, as root, to remove the ONS information from the failed node:

# CRS_home/bin/racgons remove_config name_of_node_to_be_deleted:local_port_number

Step 3 Remove resourcesResources defined on the failed node must be removed. Resources include the database, the instance, and ASM and can be identified by running the crs_stat -t command. Once identified, use the srvctl remove command to remove the resources.

Step 4 Run rootdeletenode.shIf you do not know the name of the node that you are trying to delete, run the following command, as root:

# CRS_home/bin/olsnodes -p

Otherwise, run the following command, as root:

# CRS_home/install/rootdeletenode.sh name_of_node_to_be_deleted

Step 5 Update the inventoryRun the following command, as owner of the CRS_home, with a comma-seperated list of node names that are going to remain in the cluster as the value for the CLUSTER_NODES parameter:

$ CRS_home/oui/bin/runInstaller -updateNodeList ORACLE_HOME=CRS_HOME "CLUSTER_NODES=remaining_node1, remaining_node2,..." CRS=TRUE

See Also: Oracle Real Application Clusters Administration and Deployment Guide for more information about the srvctl remove command

Adding and Deleting Oracle Clusterware Homes on Windows Systems

4-10 Oracle Clusterware Administration and Deployment Guide

Adding and Deleting Oracle Clusterware Homes on Windows SystemsThis section explains Oracle Clusterware home additions and deletions on Windows systems.

For node additions, ensure that you install the required operating system patches and updates on the new nodes. Then configure the new nodes to be part of your cluster at the network level. Use the instructions in the following sections to extend the Oracle Clusterware home from an existing Oracle Clusterware home to the new nodes

Finally, you can optionally extend the Oracle database software with Oracle RAC components to the new nodes and make the new nodes members of the existing Oracle RAC database. See the node addition procedures described in Oracle Real Application Clusters Administration and Deployment Guide.

This section includes the following topics:

■ Adding an Oracle Clusterware Home to a Windows System

■ Deleting an Oracle Clusterware Home from a Windows System

Adding an Oracle Clusterware Home to a Windows SystemThis section describes how to add new nodes to Oracle Clusterware using Oracle Universal Installer. Oracle Universal Installer requires access to the private interconnect that you checked in the "Prerequisite Steps for Adding Oracle Clusterware" section.

Perform the following steps:

1. On one of the existing nodes go to the CRS_home\oui\bin directory and run the addnode.bat script to start Oracle Universal Installer.

2. Oracle Universal Installer runs in the add node mode and the Welcome page displays. Click Next and the Specify Cluster Nodes for Node Addition page displays.

3. The upper table on the Specify Cluster Nodes for Node Addition page shows the existing nodes, the private node names and the virtual IP (VIP) addresses that are associated with Oracle Clusterware. Use the lower table to enter the public, private node names and the virtual hostnames of the new nodes.

4. Click Next and Oracle Universal Installer verifies connectivity on the existing nodes and on the new nodes. The verifications that Oracle Universal Installer performs include determining whether:

■ The nodes are up

■ The nodes are accessible by way of the network

Note: Do not use the procedures described in this section to extend Oracle Clusterware in configurations where the Oracle database has been upgraded from Oracle Database 10g Release 1 (10.1) on Windows systems. In this scenario, you must use the cloning procedure described in Chapter 3, "Cloning Oracle Clusterware" to extend Oracle Clusterware to additional nodes in the cluster.

Adding and Deleting Oracle Clusterware Homes on Windows Systems

Adding and Deleting Oracle Clusterware Homes 4-11

■ The virtual hostnames are not already in use on the network

■ The user has write permission to create the Oracle Clusterware home on the new nodes

■ The user has write permission to the Oracle Universal Installer inventory in the C:\Program Files\Oracle\Inventory directory

5. If Oracle Universal Installer detects that the new nodes do not have an inventory location, then:

■ The Oracle Universal Installer automatically updates the inventory location in the Registry key

If any verifications fail, Oracle Universal Installer redisplays the Specify Cluster Nodes for Node Addition page with a Status column in both tables indicating errors. Correct the errors or deselect the nodes that have errors and proceed. However, you cannot deselect existing nodes; you must correct problems on nodes that are already part of your cluster before you can proceed with node addition. If all of the checks succeed, Oracle Universal Installer displays the Node Addition Summary page.

6. The Node Addition Summary page displays the following information showing the products that are installed in the Oracle Clusterware home that you are extending to the new nodes:

■ The source for the add node process, which in this case is the Oracle Clusterware home

■ The private node names that you entered for the new nodes

■ The new nodes that you entered

■ The required and available space on the new nodes

■ The installed products listing the products that are already installed on the existing the Oracle Clusterware home

Click Next and Oracle Universal Installer displays the Cluster Node Addition Progress page.

7. The Cluster Node Addition Progress page shows the status of the cluster node addition process. The table on this page has two columns showing the phase of the node addition process and the phase's status according to the following:

This page shows the following three Oracle Universal Installer phases:

Note: If any of the existing nodes are down, then you can proceed with the procedure. However, once the nodes are up, you must run the following command on each of those nodes:

setup.exe -updateNodeList -local "CLUSTER_NODES={available_node_list}"ORACLE_HOME=CRS_home

This operation should be run from the CRS_home\oui\bin directory and the available_node_list value is a comma-delimited list of all of nodes currently in the cluster and CRS_home defines the Oracle Clusterware home directory.

Adding and Deleting Oracle Clusterware Homes on Windows Systems

4-12 Oracle Clusterware Administration and Deployment Guide

■ Copy the Oracle Clusterware Home to New Nodes—Copies the Oracle Clusterware home to the new nodes unless the Oracle Clusterware home is on the Oracle Cluster File System.

■ Perform Oracle Home Setup—Updates the Registry entries for the new nodes, creates the services, and creates folder entries.

■ Save Cluster Inventory—Updates the node list associated with the Oracle Clusterware home and its inventory.

The Cluster Node Addition Progress page's Status column displays In Progress while the phase is in progress, Suspended when the phase is pending execution, and Succeeded after the phase completes. On completion, click Exit. After Oracle Universal Installer displays the End of Node Addition page, click Exit.

8. From CRS_home\install on node1, run the crssetup.add.bat script.

9. Use the following command to perform an integrated validation of the Oracle Clusterware setup on all of the configured nodes, both the preexisting nodes and the nodes that you have added

cluvfy stage -post crsinst -n all [-verbose]

The CVU -post crsinst stage check verifies the integrity of the Oracle Clusterware components. After you have completed the procedures in this section for adding nodes at the Oracle Clusterware layer, you have successfully extended the Oracle Clusterware home from your existing the Oracle Clusterware home to the new nodes. Proceed to Step 3 to prepare the storage for Oracle RAC on the new nodes.

You can optionally run addnode.bat in silent mode, replacing steps 1 through 6 as follows, where nodeI, nodeI+1, and so on are the new nodes that you are adding:

addnode.bat -silent "CLUSTER_NEW_NODES={nodeI,nodeI+1,…nodeI+n}""CLUSTER_NEW_PRIVATE_NODE_NAMES={node-privI,node-privI+1,…node-privI+n}" "CLUSTER_NEW_VIRTUAL_HOSTNAMES={node-vipI,node-vipI+1,…,node-vipI+n}"

You can alternatively specify the entries shown in Example 4–2 in a response file and run addnode as follows:

addnode.bat -silent -responseFile filename

Example 4–2 Response File Entries for Adding Oracle Clusterware Home

CLUSTER_NEW_NODES = {"newnode1","newnode2"}CLUSTER_NEW_PRIVATE_NODE_NAMES = {"newnode1-priv","newnode2-priv"}CLUSTER_NEW_VIRTUAL_HOSTNAMES = {"newnode1-vip","newnode2-vip"}

Command-line values always override response file values.

Perform Step 8 and Step 9 on the local node, or on the node on which you were performing this procedure.

After you have completed the procedures in this section for adding nodes at the Oracle Clusterware layer, you have successfully extended the Oracle Clusterware home from your existing the Oracle Clusterware home to additional nodes.

See Also: Oracle Universal Installer and OPatch User's Guide for details about how to configure command-line response files

Adding and Deleting Oracle Clusterware Homes on Windows Systems

Adding and Deleting Oracle Clusterware Homes 4-13

Deleting an Oracle Clusterware Home from a Windows SystemThis section describes how to delete Oracle Clusterware from Windows systems. These procedures assume that an Oracle Clusterware home is installed on node1 and node2, and that you want to delete node2 from the cluster.

This section contains the following topics:

■ Deleting an Oracle Clusterware Home Using Oracle Universal Installer in Interactive Mode

■ Deleting an Oracle Clusterware Home Using Oracle Universal Installer in Silent Mode

Deleting an Oracle Clusterware Home Using Oracle Universal Installer in Interactive ModePerform the following procedure to use Oracle Universal Installer to delete the Oracle Clusterware home from a node:

1. Perform the delete node operation for database homes as described in Oracle Real Application Clusters Administration and Deployment Guide.

2.

From node1, in the CRS_home\bin directory, run the following command where node2 is the name of the node that you are deleting:

oifcfg delif –node node2

3. From node1 in the CRS_home\bin directory run the following command where remote_port is the ONS remote port number:

racgons remove_config node2:remote_port

To determine the remote port number, from %CRS_HOME%\bin, create an OCR dump by running ocrdump dump_file_name. Open the dump file and search for ONS_HOSTS.hostName.PORT. The subsequent line contains the remote port number for ONS. For example:

ORATEXT : 6251

By default the remote port is 6251.

4. Run srvctl to stop and remove the nodeapps from node 2. From CRS_home/bin run the following commands:

srvctl stop nodeapps –n node2srvctl remove nodeapps –n node2

Note: Oracle recommends that you back up your voting disk and Oracle Cluster Registry files after you complete any node addition or deletion procedures.

Note: This step is unnecessary if you ran the Oracle Interface Configuration Tool (OIFCFG) with the -global option during the installation, such as with Oracle Universal Installer.

Adding and Deleting Oracle Clusterware Homes on Windows Systems

4-14 Oracle Clusterware Administration and Deployment Guide

5. On node1, or on any node that is not being deleted, run the following command from CRS_home\bin where node_name is the node to be deleted and node_number is the node's number as obtained from the output from the olsnodes -n command:

crssetup del –nn node_name,node_number

6. On each node that you want to delete (node2 in this case), run the following command from CRS_home\oui\bin:

setup.exe –updateNodeList ORACLE_HOME=CRS_home "CLUSTER_NODES={node2}" CRS=TRUE –local

7. On node2, using Oracle Universal Installer setup.exe from CRS_home\oui\bin:

■ If you do not have a shared home, then deinstall the Oracle Clusterware installation by running the setup.exe script from CRS_home\oui\bin.

■ If you have a shared home, then do not perform a deinstallation. Instead, run the following command from CRS_home\oui\bin to detach the Oracle Clusterware home on node2:

setup.exe –detachHome –silent ORACLE_HOME=CRS_home

8. On node1, or in the case of a multiple node installation, on any node other than the one to be deleted, run the following command from CRS_home\oui\bin where node_list is a comma-delimited list of nodes that are to remain part of the Oracle Clusterware:

setup.exe –updateNodeList ORACLE_HOME=CRS_home"CLUSTER_NODES={node_list}" CRS=TRUE

9. On node2, stop and delete any services that are associated with this Oracle Clusterware home. In addition, delete any Registry entries and path entries that are associated with this Oracle Clusterware home. Also, delete all of the start menu items associated with this Oracle Clusterware home. Delete the central inventory and the files. For a shared Oracle home, do not delete the CRS_home.

Deleting an Oracle Clusterware Home Using Oracle Universal Installer in Silent ModeUse the following procedure to delete an Oracle Clusterware home by using Oracle Universal Installer in silent mode:

1. Perform steps 1 through 6 from the previous section for using Oracle Universal Installer interactively under the heading "Deleting an Oracle Clusterware Home Using Oracle Universal Installer in Interactive Mode" on page 4-13 to delete nodes at the Oracle Clusterware layer.

2. On node2, using Oracle Universal Installer setup.exe from CRS_home\oui\bin, deinstall the Oracle Clusterware home as follows:

setup.exe –silent -deinstall "REMOVE_HOMES={CRS_home}"

If you have a shared Oracle home, run the following command from CRS_home\oui\bin to detach the Oracle Clusterware home on node2:

setup.exe –detachHome –silent ORACLE_HOME=CRS_home

Adding and Deleting Oracle Clusterware Homes on Windows Systems

Adding and Deleting Oracle Clusterware Homes 4-15

3. Perform Step 8 and Step 9 from the previous procedure under "Deleting an Oracle Clusterware Home Using Oracle Universal Installer in Interactive Mode" to update the node list.

Adding and Deleting Oracle Clusterware Homes on Windows Systems

4-16 Oracle Clusterware Administration and Deployment Guide

Making Applications Highly Available Using Oracle Clusterware 5-1

5Making Applications Highly Available Using

Oracle Clusterware

This chapter explains how you can extend the high availability of the Oracle Clusterware framework to your applications. You do this by wrapping your applications with Oracle Clusterware commands. That is, you can use the same high availability mechanisms of Oracle Database and Oracle Real Application Clusters (Oracle RAC) to make your custom applications highly available. You can use Oracle Clusterware to monitor, relocate, and restart your applications as described in this chapter under the following topics:

■ Overview of Managing Custom Applications with Oracle Clusterware Commands

■ Creating Application Profiles

■ Example of Using Oracle Clusterware Commands to Create Application Resources

■ Oracle Clusterware Action Program Guidelines

■ Using Oracle Clusterware Commands

Overview of Using Oracle Clusterware Commands to Enable High Availability

Oracle Clusterware includes a high availability framework that provides an infrastructure to protect any application. Oracle Clusterware ensures that applications that it manages start when the system starts. Oracle Clusterware also monitors the applications to make sure that they are always available. For example, if a process fails, then Oracle Clusterware attempts to restart the process based on scripts that you customize. If a node in the cluster fails, then you can program processes that normally run on the failed node to restart on another node. The monitoring frequency, starting, and stopping of the applications and the application dependencies are configurable.

To make applications highly available, first create an application profile that identifies your application. The application profile uses a second component, an action program, that describes how Oracle Clusterware should monitor your application and how Oracle Clusterware should respond to changes in your application’s status. Oracle

See Also: Appendix D, "High Availability Oracle Clusterware Command-Line Reference and C API" for detailed information about Oracle Clusterware commands

Note: The Oracle Clusterware API demo is not supported for Windows.

Overview of Managing Custom Applications with Oracle Clusterware Commands

5-2 Oracle Clusterware Administration and Deployment Guide

Database stores application profile attributes in the OCR. The definitions of an application profile, action program, and the other primary Oracle Clusterware high availability components are as follows:

■ Action Program—A program that defines the location of your program, the monitoring frequency, and the start and stop actions that Oracle Clusterware should perform on the application. The start action starts the application, the stop action stops the application, and the monitoring or check action checks the application’s status.

■ Application Profile—An Oracle Clusterware resource file that describes the attributes of your application. An application profile influences your application’s behavior and it identifies other programs or scripts that Oracle Clusterware should run to perform various actions.

■ Privileges—Access and usage privileges that enable Oracle Clusterware to control all of the components of your application for high availability operations, including the right to start processes under other user identities. Oracle Clusterware must run as a privileged user to control applications with the correct start and stop processes. On Linux and UNIX platforms, this usually implies that Oracle Clusterware must run as the root user and on Windows platforms Oracle Clusterware must run as Administrator.

■ Resource—An entity that Oracle Clusterware manages for high availability such as your application.

■ Resource Dependency—A relationship among resources or applications that implies an operational ordering. For example, during a start operation, parent resources are started before resources that have dependencies. Stopping a resource is prevented if resources that depend on it are running,. However, you can force the termination with the crs_stop -f command in which case all resources that depend on the resource being stopped are stopped first.

■ Oracle Cluster Registry (OCR)—A mechanism that stores configuration information that Oracle Clusterware and other Oracle RAC manageability systems use. The OCR uses a hierarchical name space for key value pairs. Keys and subkeys have enforced user, group, and other permissions.

■ Template—A text file generated by the crs_profile command that contains the default values for application profile attributes.

Overview of Managing Custom Applications with Oracle Clusterware Commands

You can use Oracle Clusterware commands to start, stop, relocate, and check the status of your custom applications. Do this by defining your application with an application profile. The profile defines attributes that affect how Oracle Clusterware manages your

Note: A resource in the Oracle Clusterware context is not the same as a resource in the Oracle database sense, such as "Resource Manager." A resource that Oracle Clusterware refers to any entity managed by Oracle Clusterware, including application programs.

See Also: Appendix D, "High Availability Oracle Clusterware Command-Line Reference and C API" for more information about using Oracle Clusterware commands to make your applications highly available

Creating Application Profiles

Making Applications Highly Available Using Oracle Clusterware 5-3

application. Then register your application information in the OCR using the crs_register command. Use the following steps to create an application profile:

1. Create an application profile by running the crs_profile command. See Table 5–1 for a listing of required and optional entries for the Oracle Clusterware profiles.

2. Register the application profile using the crs_register command.

3. Run the crs_start command to initiate the application profile and then Oracle Clusterware runs the start command that you have included in the profile to start your application.

4. Oracle Clusterware periodically runs the action program command to check an application’s status.

5. In the event of a check or node failure, Oracle Clusterware recovers the application either by restarting it on the current node or by relocating the application to another node.

6. If you run the crs_stop command to stop the application, then Oracle Clusterware runs the stop action program command to stop it.

You can manage application availability as follows:

■ Specify starting resources during cluster or node start up

■ Restart applications that fail

■ Relocate applications to other nodes if they cannot run in their current location

Full administrative privileges are not required when using Oracle Clusterware. Any user can create resources or applications. However, the creator or owner must grant permission to other users or user groups in order for others to be able to use Oracle Clusterware on those applications. Additionally, profiles that have privileges defined can only be modified by privileged users. The following sections provide further details about application profiles.

Creating Application ProfilesApplication profiles have attributes that define how Oracle Clusterware starts, manages, and monitors applications. One attribute is the location of the action program that Oracle Clusterware uses to manipulate the application. Oracle Clusterware uses the action program to monitor or check the application status and to start and stop it. Oracle Database reads application profiles from files stored in specified locations and stores the information in the OCR. You use Oracle Clusterware commands in profiles to designate resource dependencies and to determine what happens to an application or service when it loses access to a resource on which it depends.

The following section describes profiles in more detail. The recommended method for creating profiles is to use the crs_profile command, which is described in detail in

Note: Do not use Oracle Clusterware commands prefixed with crs_ (except for crs_stat) on resources that have names beginning with the prefix ora unless Oracle Support Services asks you to. Instead, use the Server Control (SRVCTL) utility on Oracle resources. You can create resources that depend on resources that Oracle has defined. When creating resources, do not use an ora prefix in the resource name. This prefix is reserved for Oracle use only.

Creating Application Profiles

5-4 Oracle Clusterware Administration and Deployment Guide

Appendix D, "High Availability Oracle Clusterware Command-Line Reference and C API" on page D-4.

Application Resource ProfilesAttributes are defined by name=value entries in profile files and these entries can be in any order in the file. The following are some of the primary attributes of an application profile:

■ Resources that are required by an application which are defined by settings for the REQUIRED_RESOURCES parameter. Oracle Clusterware relocates or stops an application if a required resource becomes unavailable. Required resources are defined for each node.

■ Rules for choosing the node on which to start or restart an application are defined by settings for the PLACEMENT parameter. The application must be accessible by the nodes that you nominate for placement.

■ A list of nodes to use in order of preference when Oracle Clusterware starts or fails over an application which is defined by settings for the HOSTING_MEMBERS parameter. This list is used if the placement policy defined by the PLACEMENT parameter is favored or restricted.

■ The filenames of application profiles must be in the form resource_name.cap where resource_name is the name that you or the system assigns to an application and cap is the file suffix. The Oracle Clusterware commands in profiles refer to applications by name, such as resource_name, but not by the full filename.

Required and Optional Profile AttributesApplication profiles have optional and required profile attributes. Optional profile attributes may be left unspecified in the profile. Optional profile attributes that have default values are merged at registration time with the values that are stored in the template for that resource type and for the generic template. Default values are derived from the template.

Each resource type has a template file named TYPE_resource_type.cap that is stored in the template subdirectory under the crs directory of the Oracle Clusterware home. A generic template file for values that are used in all types of resources is stored in the same location in the file named TYPE_generic.cap.

Application Profile AttributesTable 5–1 lists the Oracle Clusterware application profile attributes in alphabetical order. For each attribute, the table shows whether the attribute is required, its default value, and an attribute description.

Creating Application Profiles

Making Applications Highly Available Using Oracle Clusterware 5-5

Table 5–1 Application Profile Attributes

Attribute Required Default Range Description

ACTION_SCRIPT

Yes None N/A The resource-specific script for starting, stopping, and checking a resource. You may specify a full path for the action program file. Otherwise, the default paths are used: CRS_home/crs/script for privileged, and CRS_home/crs/public for public. You may also specify a relative path with this default path as the starting point.

ACTIVE_PLACEMENT

No 0 0, 1 When set to 1, Oracle Clusterware reevaluates the placement of a resource during addition or restart of a cluster node.

AUTO_START No restore N/A Indicates whether Oracle Clusterware should automatically start a resource after a cluster restart. Valid AUTO_START values are:

■ always—Restarts the resource when the node restarts regardless of the state of the resource when the node stopped.

■ restore—Restores the resource to the same state that it was in when the node went down. If the state of the resource was offline (STATE=OFFLINE, TARGET=OFFLINE) when the node went down, then the resource remains offline when the node comes back up. The resource is started only if it was online before the node went down.

■ never—Oracle Clusterware never restarts the resource regardless of the state of the resource when the node stopped.

Note: Oracle only supports lower-case values for always, restore, and never.

CHECK_INTERVAL

No 60 0 or any positive integer

The time interval, in seconds, between repeated executions of the check entry point of a resource's action program. There can be some overhead associated if you set the check interval to a low value and enable frequent checks. Set to 0 to disable the attribute.

DESCRIPTION No Name of the resource

N/A A description of the resource.

FAILOVER_DELAY

No 0 0 or any positive integer

The amount of time, in seconds, that Oracle Clusterware waits before attempting to restart or fail over a resource. Set to 0 to enable immediate failover.

FAILURE_INTERVAL

No 0 0 or any positive integer

The interval, in seconds, during which Oracle Clusterware applies the failure threshold. If the value is zero (0), tracking of failures is disabled.

FAILURE_THRESHOLD

No 0 0 or any positive integer

The number of failures detected within a specified FAILURE_INTERVAL before Oracle Clusterware marks the resource as unavailable and no longer monitors it. If a resource's check script fails this number of times, then the resource is stopped and set offline. If the value is zero (0), tracking of failures is disabled.

Creating Application Profiles

5-6 Oracle Clusterware Administration and Deployment Guide

HOSTING_MEMBERS

Sometimes None N/A An ordered list of cluster nodes separated by blank spaces that can host the resource. This attribute is required only if PLACEMENT equals favored or restricted. This attribute must be empty if PLACEMENT equals balanced.

Enter node names as values for the HOSTING_MEMBERS attribute, not virtual host names or physical host names. Use the node names that you used when you installed Oracle Clusterware. The resources that you mention should contain the node name for the node on which they run. Run the olsnodes commands to see your node names. The HOSTING_MEMBERS Oracle Clusterware attribute is set automatically when these Oracle Clusterware resources are created; you do not need to take further action to ensure that the attribute is set.

The node name is usually the same as the physical host name. However, it can be different. For example, when vendor clusterware is present, the Oracle Clusterware nodes are named the same as the vendor clusterware nodes. Not all vendor clusterware implementations use the physical node names as node names. Use the lsnodes command to display vendor clusterware node names. When there is no vendor clusterware, then the Oracle Clusterware node names must be the same as the physical hostname.

NAME Yes None N/A The name of the application. The application name is a string that contains a combination of letters a-z or A-Z, and digits 0-9. The naming convention is to start with an alphanumeric prefix, such as sky1, and complete the name with an identifier to describe it. The name can contain any platform-supported characters except the exclamation point (!). However, the application name cannot begin with a period.

OPTIONAL_RESOURCES

No None N/A An ordered list of resource names separated by blank spaces that this resource uses during placement decisions. Up to 58 user-defined resources can be listed.

PLACEMENT No balanced N/A The placement policy (balanced, favored, or restricted) specifies how Oracle Clusterware chooses the cluster node on which to start the resource. Also, see "Application Placement Policies" on page 5-11.

REQUIRED_RESOURCES

No None N/A An ordered list of resource names separated by blank spaces that this resource depends on. Each resource to be used as a required resource in this profile must be registered with Oracle Clusterware or the resource’s profile registration will fail.

RESTART_ATTEMPTS

No 1 0 or any positive integer

The number of times that Oracle Clusterware attempts to restart a resource on a single cluster node before attempting to relocate the resource. A value of 1 means that Oracle Clusterware only attempts to restart the resource once on a node. A second failure causes an attempt to relocate the resource. If set to 0, there is no attempt to restart but always try to failover.

RESTART_COUNT

N/A The counter maintained by the Oracle Clusterware daemon for the number of times that a resource had been restarted. It goes from zero to RESTART_ATTEMPTS. This is also written to the OCR.

Table 5–1 (Cont.) Application Profile Attributes

Attribute Required Default Range Description

Creating Application Profiles

Making Applications Highly Available Using Oracle Clusterware 5-7

Default Profile LocationsProfiles may be located anywhere and need not be on a cluster-visible file system. Oracle RAC provides default locations for profiles as described in the next subsection. The default location for profiles with root privileges on Linux and UNIX systems, or Administrator privileges on Windows systems is the profile subdirectory under the crs directory of the Oracle Clusterware home. The default location for profiles with non-root or non-Administrator privileges is the public subdirectory under the crs directory of the Oracle Clusterware home. The action script must be located in the same directory on all nodes and must be the same file.

Using Entry Points to Manage Resources Using Oracle ClusterwareYou can use entry points to specify how to start a resource, stop a resource, and check a resource. You can implement entry points in various ways, such as by using shell or Perl scripts, C++ functions, or Java functions. Oracle Clusterware has the following entry points:

■ START—The start (online) entry point brings the resource online.

■ STOP—The stop (offline) entry point takes the resource offline.

■ CHECK—The check (monitor) entry point monitors the health of a resource.

SCRIPT_TIMEOUT

No 60 Any positive integer

The maximum time (in seconds) for an action script to execute. An error message is returned if the script does not complete within the time specified. The timeout applies to all action script entry points (start, stop, and check). If you do not specify a value, Oracle Clusterware assumes a default value of 60 seconds.

START_TIMEOUT

N/A The maximum time (in seconds) in which a start action script can run. An error message is returned if the script does not complete within the time specified. If you do not specify this attribute or you specify 0 seconds, then Oracle Clusterware uses the SCRIPT_TIMEOUT value.

STOP_TIMEOUT N/A The maximum time (in seconds) in which a stop action script can run. An error message is returned if the script does not complete within the time specified. If you do not specify this attribute or if you specify 0 seconds, then Oracle Clusterware uses the SCRIPT_TIMEOUT value.

TYPE Yes None N/A Must be set to application.

UPTIME_THRESHOLD

Any positive integer

The value for UPTIME_THRESHOLD represents the length of time that a resource must be up before Oracle Clusterware considers the resource to be stable. By setting a value for the UPTIME_THESHOLD attribute, you can indicate a resource's stability. The form of this attribute is xd where x is a positive integer and d is one of the following denominations:

s: seconds (the default)m: minutesh: hoursd: daysw: weeks

For example: 30 is 30 seconds (because s is the default), 10m is 10 minutes, and so on.

Table 5–1 (Cont.) Application Profile Attributes

Attribute Required Default Range Description

Example of Using Oracle Clusterware Commands to Create Application Resources

5-8 Oracle Clusterware Administration and Deployment Guide

Example of Using Oracle Clusterware Commands to Create Application Resources

The example in this section creates an application named postman. Oracle Clusterware uses the script /opt/email/bin/crs_postman to start, stop, and monitor whether the application is running (action_script). Oracle Clusterware checks postman every five seconds as specified by the setting for the check_interval attribute. Oracle Clusterware restarts postman no more than once if it fails. When deciding on which node to place the postman application, Oracle Clusterware considers the value for the optional_resources parameter. If possible, Oracle Clusterware places postman on the same node. Finally, for postman to run, the resource network1 must be running on the same node as specified by the setting for the required_resources parameter. If network1 fails or if it is relocated to another node, then Oracle Clusterware stops or moves the postman application.

Using the crs_profile Command to Create An Application Resource ProfileTo create an action profile, you use the crs_profile command. Example 5–1 uses the crs_profile command to create an application profile for the postman action script that is used to monitor email.

Example 5–1 Using the crs_profile Command to Create an Action Profile

$ crs_profile -create postman -t application -B /opt/email/bin/crs_postman \-d "Email Application" -r network1 -l application2 \-a postman.scr -o ci=5,ft=2,fi=12,ra=2

The contents of the application profile file that the example creates are as follows:

NAME=postmanTYPE=applicationACTION_SCRIPT=/oracle/crs/script/postman.scrACTIVE_PLACEMENT=0AUTO_START=alwaysCHECK_INTERVAL=5DESCRIPTION=email appFAILOVER_DELAY=0FAILURE_INTERVAL=12FAILURE_THRESHOLD=2HOSTING_MEMBERS=OPTIONAL_RESOURCES=application2PLACEMENT=balancedREQUIRED_RESOURCES=network1RESTART_ATTEMPTS=2

Note: Do not use the Oracle Clusterware commands prefixed with crs_ (except for crs_stat) on resources that have names beginning with the prefix ora unless either Oracle Support Services ask you to, or unless Oracle has certified you as described in https://metalink.oracle.com.

Instead, use the Server Control (SRVCTL) utility on Oracle resources. You can create resources that depend on resources that Oracle has defined. When creating resources, do not use an ora prefix in the resource name. This prefix is reserved for Oracle use only.

You can also use the Oracle Clusterware commands to inspect the configuration and status.

Example of Using Oracle Clusterware Commands to Create Application Resources

Making Applications Highly Available Using Oracle Clusterware 5-9

SCRIPT_TIMEOUT=60

A good example of an action script is the xclock script, which is a simple action script that is a default binary on all Linux and UNIX platforms. Example 5–2 shows the contents of the xclock action script.

Example 5–2 Action Script Example: xclock

#!/bin/bash# start/stop/check script for xclock example# To test this change BIN_DIR to the directory where xclock is based# and set the DISPLAY variable to a server within your network. BIN_DIR=/usr/X11R6/binLOG_DIR=/tmpBIN_NAME=xclockDISPLAY=yourhost.domain.com:0.0export DISPLAY if [ ! -d $BIN_DIR ]then echo "start failed" exit 2fi PID1=`ps -ef | grep $BIN_NAME | grep -v grep | grep -v xclock_app | awk '{ print $2 }'` case $1 in'start') if [ "$PID1" != "" ] then status_p1="running" else if [ -x $BIN_DIR/$BIN_NAME ] then umask 002 ${BIN_DIR}/${BIN_NAME} & 2>${LOG_DIR}/${BIN_NAME}.log status_p1="started" else echo `basename $0`": $BIN_NAME: Executable not found" fi fi echo "$BIN_NAME: $status_p1" ;; 'stop') if [ "${PID1}" != "" ] then kill -9 ${PID1} && echo "$BIN_NAME daemon killed" else echo "$BIN_NAME: no running Process!" fi ;;'check') if [ "$PID1" != "" ] then echo "running"

Example of Using Oracle Clusterware Commands to Create Application Resources

5-10 Oracle Clusterware Administration and Deployment Guide

exit 0 else echo "not running" exit 1 fi ;;*) echo "Usage: "`basename $0`" {start|stop|check}" ;;esac

Oracle Clusterware Required Resources ListOracle Clusterware uses the required resources list, with the placement policy and hosting nodes list, to determine the cluster nodes that are eligible to host an application. Required resources must be ONLINE on the nodes on which the application is running or started.

The failure of a required resource on a hosting node causes Oracle Clusterware to attempt to restart the application on the current node. If RESTART_ATTEMPTS is not set to 0, and if the application cannot start on the current node, then Oracle Clusterware attempts to fail the application over to another node that provides the required resource. Alternatively, Oracle Clusterware stops the application if there is no suitable node. In this case, Oracle Clusterware posts a not restarting event notification.

You can also use required resource lists to start, stop, and relocate groups of interdependent applications when you use the crs_start, crs_stop, or crs_relocate commands with the force (-f) option. In other words, you can configure a set of resources to have other required resources. For example you can configure resources A, B, and C where A and B depend on C. Then if you stop resource C with -force option, this action will stop all three resources. The same is true for the crs_relocate command.

In addition, using the -force option can relocate an online dependency, if necessary, to enable the starting of a resource. For instance, assume that resource VIP_A that has a primary node assignment of node A is running instead on node B. If you perform a crs_start on the instance resource for node A, then this operation will fail. This is because the VIP_A resource is required for the instance on node A and the VIP_A resource is online. However, the resource is on a node where the resource cannot run. Performing a crs_start -f on instance B, however, forces the VIP_A resource to relocate and then start the instance.

Creating VIPs for ApplicationsIf your application is accessed by way of a network, then Oracle recommends that you create a virtual internet protocol address for the application as a dependent resource. In the previous example, network1 is an application VIP address. Create application VIP addresses as follows:

crs_profile –create network1 -t application \-a CRS_home/bin/usrvip \-o oi=eth0,ov=138.3.83.78,on=255.255.240.0

See Also: Appendix D, "High Availability Oracle Clusterware Command-Line Reference and C API" for an explanation of the command options

Example of Using Oracle Clusterware Commands to Create Application Resources

Making Applications Highly Available Using Oracle Clusterware 5-11

In this example, CRS_home is the home directory for the Oracle Clusterware installation. In addition, eth0 is the name of the public network adapter, 138.3.83.78 which resolves by way of DNS to a new hostname that will locate your application regardless of the node on which it is running. Finally, 255.255.240.0 is the netmask for the public IP address. As the oracle user, register the VIP address with Oracle Clusterware as follows:

crs_register network1

On Linux and UNIX operating systems, the application VIP address script must run as the root user. As the root user, change the owner of the resource as follows:

crs_setperm network1 –o root

As the root user, enable the oracle user to run this script:

crs_setperm network1 –u user:oracle:r-x

As the oracle user, start the VIP address as follows:

crs_start network1

Application Placement PoliciesThe placement policy specifies how Oracle Clusterware selects a node on which to start an application and where to relocate the application after a node failure. Only cluster nodes on which all of the required resources are available, as listed in an application's profile, are eligible to be considered as hosting nodes for an application. Oracle Clusterware supports the following placement policies:

■ balanced—Oracle Clusterware favors starting or restarting the application on the node that is currently running the fewest resources. A placement that is based on optional resources is considered first. Next, the host with the fewest resources running is chosen. If no node is favored by these criteria, then any available node is chosen.

■ favored—Oracle Clusterware refers to the list of nodes in the HOSTING_MEMBERS attribute of the application profile. Only cluster nodes that are in this list and that satisfy the resource requirements are eligible for placement consideration. Placement due to optional resources is considered first. If no node is eligible based on optional resources, then the order of the hosting nodes determines which node runs the application. If none of the nodes in the hosting node list are available, then Oracle Clusterware places the application on any available node. This node may or may not be included in the HOSTING_MEMBERS list.

■ restricted—Similar to favored except that if none of the nodes on the hosting list are available, then Oracle Clusterware does not start or restart the application. A restricted placement policy ensures that the application never runs on a node that is not on the list, even if you manually relocate it to that node.

You must specify hosting nodes in the HOSTING_MEMBERS attribute to use a favored or restricted placement policy. Do not specify hosting nodes in the HOSTING_MEMBERS attribute with a balanced placement policy. Otherwise, the application will not validate and you cannot register it. If ACTIVE_PLACEMENT is set to 1, then the placement of the application is reevaluated whenever you add a node to the cluster or

Note: In the case of a user VIP, you must use the usrvip action script that Oracle Database provides in the CRS home/bin directory.

Oracle Clusterware Action Program Guidelines

5-12 Oracle Clusterware Administration and Deployment Guide

if the cluster node restarts. This enables Oracle Clusterware to relocate applications to a preferred node after the node recovers from a failure.

Optional Resources in Placement DecisionsOracle Clusterware uses optional resources to choose a hosting node based on the number of optional resources that are in an ONLINE state on the hosting node. If each node has an equal number of optional resources in an ONLINE state, then Oracle Clusterware considers the order of the optional resources as follows:

■ Oracle Clusterware compares the state of the optional resources on each node starting at the first resource that you list in the application profile and then proceeds through the list.

■ For each consecutive resource in your list, if the resource is ONLINE on one node, then any node that does not have the resource ONLINE is not considered.

■ Oracle Clusterware evaluates each resource in the list in this manner until only one node is available to host the resource.

■ The maximum number of optional resources is 58.

If this algorithm results in multiple preferred nodes, then the resource is placed on one of these nodes chosen according to its placement policy.

Oracle Clusterware Action Program GuidelinesThis section provides the following guidelines for writing Oracle Clusterware action programs that interpret Oracle Clusterware start, stop, and check commands:

■ Oracle Clusterware relies on a status code upon exiting from an action program to set the resource state to ONLINE or OFFLINE. On Windows systems, the program should be nonblocking, which may imply a Windows service or a Windows resource that does not block during console interactions.

■ Action programs must return a status code to indicate success or failure. For example, on Linux or UNIX systems, if the script is written in Bourne Shell, then the program should issue exit(1) to indicate failure, and exit(0) to indicate success. Similarly, on Windows systems, the action program should return a status of (0) to indicate success and (1) to indicate failure.

■ After application failure, Oracle Clusterware calls the action program with a check parameter. The action program replies to Oracle Clusterware with a status of (1) to indicate failure. After receiving this failure status, Oracle Clusterware calls the action program with a stop parameter. It is important that the action program returns a status of (0) to indicate a successful stop of the application, even though the application was not running when the stop request was called.

■ Oracle Clusterware sets the resource state to UNKNOWN if an action program's stop entry point fails to exit within the number of seconds in the SCRIPT_TIMEOUT value, or if the action program returns with a code that indicates failure. This may occur during a start, relocation, or stop operation. Ensure that the action program’s stop entry point exits with a value that indicates success if the resource is successfully stopped or if the resource is not running.

■ When a daemon or service starts, it usually needs to start as a background process, depending on the platform. However, a resource started in this way always returns success during a start attempt. This means that the default scripts cannot detect failures caused by minor errors, such as misspelled command paths.

Oracle Clusterware Action Program Guidelines

Making Applications Highly Available Using Oracle Clusterware 5-13

■ On Linux and UNIX systems, if a resource does not move to the background immediately upon startup, then you can start the application in the background by adding an ampersand (&) to the end of the line that starts the application.

■ On Windows systems, you can use net start to start a service that needs to start in the background.

■ When using commands to start daemons or services in the background, interactively make test runs of the commands used in the script to eliminate errors before using the script with Oracle Clusterware.

How Oracle Clusterware Runs Action Programs This section describes how Oracle Clusterware runs action programs. The first argument to an action program is the command start, stop, or check depending on which action Oracle Clusterware is running. The second argument is the Oracle Clusterware resource name of the application. This enables a script to determine which instance of the resource that Oracle Clusterware is starting, stopping, or checking.

An action program can retrieve any of its Oracle Clusterware resource attributes from the environment by using $_CAA_attribute_name. For example, $_CAA_NAME contains the application name, the second argument to the script, and $_CAA_HOSTING_MEMBERS contains its HOSTING_MEMBERS attribute.

User Defined AttributesOracle Clusterware supports user-defined attributes in Oracle Clusterware applications, which are attributes having names that contain USR. User-defined attributes are stored as part of the Oracle Clusterware application profile for the application. You can reference them in an action program using $_USR_attributename.

To add a user-defined attribute, add it to the file CRS_home/crs/template/application.tdf using the following syntax:

## an example user-defined attribute##!===========================attribute: USR_EXAMPLEtype: stringswitch: -o exampledefault:required: no

The attribute parameter contains the name of the new attribute. The type parameter defines the type of the user-defined attribute and can be one of the following:

■ string

■ boolean

■ integer—a numeric attribute

■ positive_integer—a numeric attribute that must be positive

■ name string

■ name_list—a comma-delimited list of names

Using Oracle Clusterware Commands

5-14 Oracle Clusterware Administration and Deployment Guide

The switch parameter describes how the attribute is specified for the crs_profile command. Set the required field to no for user-defined attributes.

Windows crsuser ProgramThis section describes the Windows crsuser program. The syntax for the crsuser command is:

crsuser add [domain\]username

For example, on a Windows system you could issue the following command as an operating system user that is part of ORA_DBA group and the Local Administrator group:

C:\> crsuser add oracledomain\oracluster

Provide the user's Windows password. This creates the OracleCRSToken_user service that Oracle Clusterware needs to start the Oracle Clusterware resources under the given user ID (when they are not running as the LocalSystem account). You can also use the crsuser command: remove [domain\]username to remove a token service and crsuser list to list a registered users.

Using Oracle Clusterware CommandsThis section describes how to use the Oracle Clusterware commands under the following topics:

■ Registering Application Resources

■ Starting Application Resources

■ Relocating Applications and Application Resources

■ Stopping Applications and Application Resources

■ Unregistering Applications and Application Resources

■ Displaying Clusterware Application and Application Resource Status Information

Registering Application ResourcesEach application that you manage with Oracle Clusterware must have an application profile and the profile must be added to OCR. Use the crsctl add resource command to add applications to OCR. For example, enter the following command to add the mail monitoring application from the previous example:

# crsctl add resource postman

If you modify a profile, then update the OCR by running the crs_register -u command again.

Starting Application ResourcesTo start an application resource that is registered with Oracle Clusterware, use the crs_start command. For example:

# crs_start postman

Note: User-defined attribute names that begin with USR_ORA are reserved for use by Oracle.

Using Oracle Clusterware Commands

Making Applications Highly Available Using Oracle Clusterware 5-15

The following text is an example of the command output:

Attempting to start ’postman’ on node ’rac1’Start of ’postman’ on node ’rac1’ succeeded.

The application now runs on the node named rac1.

The command waits for the amount of time specified by the setting for the SCRIPT_TIMEOUT parameter to receive a notification of success or failure from the action program each time the action program is called. Application resources can be started if they have stopped due to exceeding their failure threshold values. You must register a resource with crs_register before you can start it.

To start and stop the resources, use the crs_start and crs_stop commands. Manual starts or stops outside of Oracle Clusterware can invalidate the resource status. In addition, Oracle Clusterware may attempt to restart a resource on which you perform a manual stop operation.

All required resources must be online on the node where you start the resource. If the resources that the REQUIRED_RESOURCES parameter identifies are offline, then the command crs_start resource_name will start the required resources before starting the resource.

Running the crs_start command on a resource sets the resource target value to ONLINE. Oracle Clusterware attempts to change the state to match the target by running the action program with the start parameter. When a resource is running, both the target state and current state are ONLINE.

Starting an Application on an Unavailable NodeWhen starting an application on a cluster node that is unavailable, crs_start can give indeterminate results. In this scenario, the start section of the action program is run, but the cluster node fails before notification of the start is displayed on the command line. The crs_start command returns a failure with the error Remote start for resource_name failed on node node_name. The application is actually ONLINE but fails over to another node making the application appear as though it were started on the incorrect node.

If a cluster node fails while you are starting a resource on that node, then check the state of the resource on the cluster by using the crs_stat command to determine the state of that resource.

Relocating Applications and Application ResourcesUse the crs_relocate command to relocate applications and application resources. For example, to relocate the mail monitoring application to the node known as rac2, enter the following command:

# crs_relocate postman -c rac2

Note: The name of the application resource may or may not be the same as the name of the application.

See Also : Appendix D, "High Availability Oracle Clusterware Command-Line Reference and C API" for examples of Oracle Clusterware command output

Using Oracle Clusterware Commands

5-16 Oracle Clusterware Administration and Deployment Guide

Each time that the action program is called, the crs_relocate command waits for the duration identified by the value for the SCRIPT_TIMEOUT parameter to receive notification of success or failure from the action program. A relocation attempt fails if:

■ The application has required resources that are ONLINE on the initial node

■ Applications that require the specified resource are ONLINE on the initial node

To relocate an application and its required resources, use the -f option with the crs_relocate command. Oracle Clusterware relocates or starts all resources that are required by the application regardless of their state.

Stopping Applications and Application ResourcesTo stop applications and application resources, use the crs_stop command. Immediately after the crs_stop command completes, the application status converts to OFFLINE. Because Oracle Clusterware always attempts to match a resource's state to its target, the Oracle Clusterware subsystem stops the application. The following example stops the mail application from the example:

# crs_stop postman

The following text is an example of the command output:

Attempting to stop `postman` on node `rac1`Stop of `postman` on node `rac1` succeeded.

You cannot stop an application if the application is a required resource for another online application unless you use the force (-f) option. If you use the crs_stop -f resource_name command on an application that is required by other resources and if those resources are online, then Oracle Clusterware stops the application. In addition, all of the resources that require that application that are online are also stopped.

Managing Automatic Oracle Clusterware Resource Operations for Action ScriptsThe following section explains additional information for controlling how Oracle Clusterware manages restarts. You can prevent Oracle Clusterware from automatically restarting a resource by setting several action program attributes. You can also control how Oracle Clusterware manages the restart counters for your action programs. In addition, you can customize the timeout values for the start, stop, and check actions that Oracle Clusterware performs on action scripts. These topics are described under the following headings:

■ Preventing Automatic Restarts

■ Automatically Manage Restart Attempts Counter for Resources

■ Implications of Restart and Timeout Features for Previous Releases

Preventing Automatic RestartsWhen a node stops and restarts, Oracle Clusterware starts the resources as soon as the node starts. This may not be desirable because resource startup might fail if system components on which the resource depends, such as a volume manager or a file

Note: Oracle Clusterware can only stop applications and application resources. Oracle Clusterware cannot stop network, tape, or media changer resources.

Using Oracle Clusterware Commands

Making Applications Highly Available Using Oracle Clusterware 5-17

system, are not running. This is especially true if Oracle Clusterware does not manage the system components on which the resource depends. To manage automatic restarts, you can use the AUTO_START attribute to specify whether Oracle Clusterware should automatically start a resource when a node restarts.

Valid AUTO_START values are:

■ always—Causes the resource to restart when the node restarts regardless of the resource’s state when the node stopped.

■ restore—Does not start the resource at restart time if it was in an offline state, such as STATE=OFFLINE, TARGET=OFFLINE, when the node stopped. The resource is restored to its state when the node went down. The resource is started only if it was online before and not otherwise.

■ never—Oracle Clusterware never restarts the resource regardless of the resource’s state when the node stopped.

Automatically Manage Restart Attempts Counter for ResourcesWhen a resource fails, Oracle Clusterware restarts the resource for only the number of times specified in the profile attribute RESTART_ATTEMPTS regardless of how often the resource fails. The CRSD process maintains an internal counter to track how often a resource has been restarted. There is a mechanism by which Oracle Clusterware can automatically manage the restart attempts counter based on the stability of a resource. Use the UPTIME_THESHOLD attribute to indicate resource stability.

You can specify the time for the UPTIME_THRESHOLD attribute in different units of measure, such as seconds (s), minutes (m), hours (h), days (d) or weeks (w). Examples of valid values for this attribute are: 7d for seven days, 5h for five hours, 180m for 180 minutes, and so on. Specify the time period as a numeric value and the unit of measure as the last character, s, m, h, d, or w.

After the time period that you have indicated by the setting for UPTIME_THRESHOLD has elapsed, Oracle Clusterware resets the value for RESTART_COUNTS to 0. Oracle Clusterware can alert you when the value for RESTART_COUNT reaches the value that you have set for RESTART_ATTEMPTS.

RESTART_ATTEMPTS and RESTART_COUNT Failure Scenarios Some of the failure scenarios for the RESTART_ATTEMPTS and RESTART_COUNT attributes are:

■ When a resource keeps restarting—The resource does not meet the uptime threshold criteria and it will be stopped after restarting for the number of attempts set by the value for RESTART_ATTEMPTS.

■ When a node fails or restarts—Oracle Clusterware resets the value for RESTART_COUNTER to 0 either when the resource relocates or when it restarts on the same node.

Note: Oracle only supports lower-case values for always, restore, and never.

Note: Oracle Clusterware writes an alert to the CRSD log file when the value for RESTART_COUNT reaches the value that you have set for RESTART_ATTEMPTS. Oracle Clusterware does not write this message to the alert file.

Using Oracle Clusterware Commands

5-18 Oracle Clusterware Administration and Deployment Guide

■ If the crsd process fails—Because both RESTART_COUNT and RESTART_ATTEMPTS are stored in OCR, the behavior is not affected.

Implications of Restart and Timeout Features for Previous Releases If you have an installation that is prior to Oracle Database 11g, add the release 1 (11.1) attributes to your profiles by doing one of the following:

■ Modify your resources with the Oracle Database 11g attributes and reregister the resources. This causes the separate timeouts to be effective.

■ Do not modify your resources. This retains the pre-release behavior of Oracle Database 11g using the value in SCRIPT_TIMEOUT as the timeout for all start, stop, and check actions.

Unregistering Applications and Application ResourcesTo unregister an application or an application resource, use the crs_unregister command. You cannot unregister an application or resource that is ONLINE or required by another resource. The following example unregisters the mail application:

# crs_unregister postman

The unregistration process frees space on the OCR. Additionally, run the crs_unregister command as a clean-up step when a resource is no longer managed by Oracle Clusterware. Generally, you should unregister all permanently stopped applications.

Displaying Clusterware Application and Application Resource Status InformationTo display status information about applications and resources that are on cluster nodes, use the crs_stat command. The following example displays the status information for the postman application:

# crs_stat postmanNAME=postmanTYPE=applicationTARGET=ONLINESTATE=ONLINE on rac2

Enter the following command to view information about all applications and resources in tabular format:

# crs_stat -t

The following text is an example of the command output:

Name Type Target State Host----------------------------------------------------------------cluster_lockd application ONLINE ONLINE rac2dhcp application OFFLINE OFFLINE

Enter the following command to determine:

■ How many times an application resource has been restarted

■ How many times an application resource has failed within the failure interval

■ The maximum number of times that an application or resource can be restarted or fail

■ The target state of the application or resource and the normal status information

Using Oracle Clusterware Commands

Making Applications Highly Available Using Oracle Clusterware 5-19

# crs_stat -v

To view verbose content in tabular format, enter the following command:

# crs_stat -v -t

The following text is an example of the command output:

Name Type R/RA F/FT Target State Host----------------------------------------------------------------------cluster_lockd application 0/30 0/0 ONLINE ONLINE rac2dhcp application 0/1 0/0 OFFLINE OFFLINEnamed application 0/1 0/0 OFFLINE OFFLINEnetwork1 application 0/1 0/0 ONLINE ONLINE rac1

Enter the following command to view the application profile information that is stored in the OCR:

# crs_stat -p

The following text is an example of the command output:

NAME=cluster_lockdTYPE=applicationACTION_SCRIPT=cluster_lockd.scrACTIVE_PLACEMENT=0AUTO_START=restoreCHECK_INTERVAL=5DESCRIPTION=Cluster lockd/statdFAILOVER_DELAY=30FAILURE_INTERVAL=60FAILURE_THRESHOLD=1HOSTING_MEMBERS=OPTIONAL_RESOURCES=PLACEMENT=balancedREQUIRED_RESOURCES=RESTART_ATTEMPTS=2SCRIPT_TIMEOUT=60 ...

See the crs_stat command for more information.

See Also: Appendix D, "High Availability Oracle Clusterware Command-Line Reference and C API" for detailed information about Oracle Clusterware commands

Using Oracle Clusterware Commands

5-20 Oracle Clusterware Administration and Deployment Guide

Cluster Verification Utility Reference A-1

ACluster Verification Utility Reference

Oracle provides Cluster Verification Utility (CVU) to perform system checks in preparation for installation, patch updates, or other system changes. Learning how to use CVU can ensure that you have completed the required system configuration and preinstallation steps so that your installation, update, or patch operation completes successfully.

This appendix describes the CVU under the following topics:

■ Using the Cluster Verification Utility

■ Cluster Verification Utility Requirements

■ Understanding CVU Commands, Help, Output, and Nodelist Shortcuts

■ Performing Various CVU Tests

■ Known Issues for the Cluster Verification Utility

Using the Cluster Verification UtilityThe CVU can verify the primary cluster components during an operational phase or stage. A component can be basic, such as free disk space, or it can be complex, such as checking Oracle Clusterware integrity. For example, CVU can verify multiple Oracle Clusterware subcomponents across Oracle Clusterware layers. Additionally, CVU can check disk space, memory, processes, and other important cluster components. A stage could be, for example, database installation, for which CVU can verify whether your system meets the criteria for an Oracle Real Application Clusters (Oracle RAC) installation. Other stages include the initial hardware setup and the establishing of system requirements through the fully operational cluster setup.

When verifying stages, CVU uses entry and exit criteria. In other words, each stage has entry criteria that define a specific set of verification tasks to be performed before initiating that stage. This check prevents you from beginning a stage, such as installing Oracle Clusterware, unless you meet the Oracle Clusterware stage's prerequisites.

The exit criteria for a stage define another set of verification tasks that you need to perform after the completion of the stage. Post-checks ensure that the activities for that stage have been completed. Post-checks identify stage-specific problems before they propagate to subsequent stages.

The node list that you use with CVU commands should be a comma-delimited list of host names without a domain. The CVU ignores domains while processing node lists.

See Also: Your platform-specific Oracle Clusterware and Oracle Real Application Clusters installation guide for information about how to manually install CVU

Cluster Verification Utility Requirements

A-2 Oracle Clusterware Administration and Deployment Guide

If a CVU command entry has duplicate node entries after removing domain information, then CVU eliminates the duplicate node entries. Wherever supported, you can use the -n all option to verify all of your cluster nodes that are part of a specific Oracle RAC installation. You do not have to be the root user to use the CVU and the CVU assumes that the current user is the oracle user.

For network connectivity verification, the CVU discovers all of the available network interfaces if you do not specify an interface on the CVU command line. For storage accessibility verification, the CVU discovers shared storage for all of the supported storage types if you do not specify a particular storage identification on the command line. The CVU also discovers the Oracle Clusterware home if one is available.

Run the CVU command-line tool using the cluvfy command. Using cluvfy does not adversely affect your cluster environment or your installed software. You can run cluvfy commands at any time, even before the Oracle Clusterware installation. In fact, the CVU is designed to assist you as soon as your hardware and operating system are operational. If you run a command that requires Oracle Clusterware on a node, then the CVU reports an error if Oracle Clusterware is not yet installed on that node.

You can enable tracing by setting the environment variable SRVM_TRACE to true. For example, in tcsh an entry such as setenv SRVM_TRACE true enables tracing. The CVU trace files are created in the CV_HOME/cv/log directory. Oracle Database automatically rotates the log files and the most recently created log file has the name cvutrace.log.0. You should remove unwanted log files or archive them to reclaim disk place if needed. The CVU does not generate trace files unless you enable tracing.

Cluster Verification Utility RequirementsThe CVU requirements are:

■ At least 30MB free space for the CVU software on the node from which you run the CVU

■ A location for the current JDK, Java 1.4.1 or later

■ A work directory with at least 25MB free space on each node

Understanding CVU Commands, Help, Output, and Nodelist Shortcuts This section describes the following Cluster Verification Utility topics:

■ Using CVU Help

■ Verbose Mode and UNKNOWN Output

Note: The CVU only supports an English-based syntax and English online help.

Note: When using the CVU, the CVU attempts to copy any needed information to the CVU work directory. Make sure that the CVU work directory exists on all of the nodes in your cluster database and that the directory on each node has write permissions established for the CVU user. Set this directory using the CV_DESTLOC environment variable. If you do not set this variable, then the CVU uses /tmp as the work directory on Linux and UNIX systems, and C:\temp on Windows systems.

Understanding CVU Commands, Help, Output, and Nodelist Shortcuts

Cluster Verification Utility Reference A-3

■ Cluster Verification Utility Nodelist Shortcuts

Using CVU HelpThe cluvfy commands have context sensitive help that shows their usage based on the command-line arguments that you enter. For example, if you enter cluvfy, then the CVU displays high-level generic usage text describing the stage and component syntax. If you enter cluvfy comp -list, then the CVU shows the valid components with brief descriptions about each of them. If you enter cluvfy comp -help, then the CVU shows detailed syntax for each of the valid component checks. Similarly, cluvfy stage -list and cluvfy stage -help display valid stages and their syntax for their checks respectively.

If you enter an invalid CVU command, then the CVU shows the correct usage for that command. For example, if you type cluvfy stage -pre dbinst, then CVU shows the correct syntax for the precheck commands for the dbinst stage. Enter the cluvfy -help command to see detailed CVU command information.

Verbose Mode and UNKNOWN OutputAlthough by default the CVU reports in nonverbose mode by only reporting the summary of a test, you can obtain detailed output by using the -verbose argument. The -verbose argument produces detailed output of individual checks and where applicable shows results for each node in a tabular layout.

If a cluvfy command responds with UNKNOWN for a particular node, then this is because the CVU cannot determine whether a check passed or failed. The cause of this could be a loss of reachability or the failure of user equivalence to that node. The cause could also be any system problem that was occurring on that node at the time that CVU was performing a check.

If you run the CVU using the -verbose argument and the CVU responds with UNKNOWN for a particular node, then this is because the CVU cannot determine whether a check passed or failed. The following is a list of possible causes for an UNKNOWN response:

■ The node is down

■ Executables that the CVU requires are missing in CRS_home/bin or the Oracle home directory

■ The user account that ran the CVU does not have privileges to run common operating system executables on the node

■ The node is missing an operating system patch or a required package

■ The node has exceeded the maximum number of processes or maximum number of open files, or there is a problem with IPC segments, such as shared memory or semaphores

Cluster Verification Utility Nodelist ShortcutsYou can use the following nodelist shortcuts:

To provide the CVU a list of all of the nodes of a cluster, enter -n all. CVU attempts to obtain the node list in the following order:

1. If vendor clusterware is available, then the CVU selects all of the configured nodes from the vendor clusterware using the lsnodes utility.

Understanding CVU Commands, Help, Output, and Nodelist Shortcuts

A-4 Oracle Clusterware Administration and Deployment Guide

2. If Oracle Clusterware is installed, then the CVU selects all of the configured nodes from Oracle Clusterware using the olsnodes utility.

3. If neither the vendor nor Oracle Clusterware is installed, then the CVU searches for a value for the CV_NODE_ALL key in the configuration file.

4. If vendor and Oracle Clusterware are not installed and no key named CV_NODE_ALL exists in the configuration file, then the CVU searches for a value for the CV_NODE_ALL environmental variable.

If you have not set this variable, then the CVU reports an error.

To provide a partial node list, you can set an environmental variable and use it in the CVU command. For example, on Linux or UNIX systems you can enter:

setenv MYNODES node1,node3,node5cluvfy comp nodecon -n $MYNODES [-verbose]

Cluster Verification Utility Configuration File You can use the CVU configuration file to define specific inputs for the execution of the CVU. The path for the configuration file is CV_HOME/cv/admin/cvu_config. You can modify this using a text editor. The inputs to the tool are defined in the form of key entries. You must follow these rules when modifying the CVU configuration file:

■ Key entries have the syntax name=value

■ Each key entry and the value assigned to the key only defines one property

■ Lines beginning with the number sign (#) are comment lines and are ignored

■ Lines that do not follow the syntax name=value are ignored

The following is the list of keys supported by CVU:

■ CV_NODE_ALL—If set, it specifies the list of nodes that should be picked up when Oracle Clusterware is not installed and a -n all option has been used in the command line. By default, this entry is commented out.

■ CV_RAW_CHECK_ENABLED—If set to TRUE, it enables the check for accessibility of shared disks on RedHat release 3.0. This shared disk accessibility check requires that you install a cvuqdisk rpm on all of the nodes. By default, this key is set to TRUE and shared disk check is enabled.

■ CV_XCHK_FOR_SSH_ENABLED—If set to TRUE, it enables the X-Windows check for verifying user equivalence with ssh. By default, this entry is commented out and X-Windows check is disabled.

■ ORACLE_SRVM_REMOTESHELL—If set, it specifies the location for ssh/rsh command to override the CVU default value. By default, this entry is commented out and the tool uses /usr/sbin/ssh and /usr/sbin/rsh.

■ ORACLE_SRVM_REMOTECOPY—If set, it specifies the location for the scp or rcp command to override the CVU default value. By default, this entry is commented out and CVU uses /usr/bin/scp and /usr/sbin/rcp.

If CVU does not find a key entry defined in the configuration file, then the CVU searches for the environment variable that matches the name of the key. If the environment variable is set, then the CVU uses its value, otherwise the CVU uses a default value for that entity.

Performing Various CVU Tests

Cluster Verification Utility Reference A-5

Performing Various CVU TestsYou can perform the following tests using CVU as described under the following topics:

■ Cluster Verification Utility System Requirements Verifications

■ Cluster Verification Utility Storage Verifications

■ Cluster Verification Utility Connectivity Verifications

■ Cluster Verification Utility User and Permissions Verifications

■ Cluster Verification Utility Node Comparisons and Verifications

■ Cluster Verification Utility Installation Verifications

■ Cluster Verification Utility Oracle Clusterware Component Verifications

■ Cluster Verification Utility Cluster Integrity Verifications

■ Cluster Verification Utility Argument and Option Definitions

Cluster Verification Utility System Requirements VerificationsTo verify the minimal system requirements on the nodes prior to installing Oracle Clusterware or Oracle RAC, use the sys component verification command as follows:

cluvfy comp sys [ -n node_list ] -p { crs | database } } [-r { 10gR1 | 10gR2 | 11gR1} ] [ -osdba osdba_group ] [ -orainv orainventory_group ] [-verbose]

To check the system requirements for installing Oracle RAC, use the -p database argument, and to check the system requirements for installing Oracle Clusterware, use the -p crs argument. To check the system requirements for installing Oracle Clusterware or Oracle RAC from Oracle Database 11g Release 1 (11.1), use the -r 11gR1 argument. For example, verify the system requirements for installing Oracle Clusterware on the cluster nodes known as node1,node2 and node3 by running the following command:

cluvfy comp sys -n node1,node2,node3 -p crs -verbose

Cluster Verification Utility Storage VerificationsTo verify whether storage is shared among the nodes in your cluster database or to identify all of the storage that is available on the system and can be shared across the cluster nodes, use the component verification command ssa as follows:

cluvfy comp ssa [ -n node_list ] [ -s storageID_list ] [-verbose]

For example, discover all of the shared storage systems available on your system by running the following command:

cluvfy comp ssa -n all -verbose

You can verify the accessibility of a specific storage location, such as /dev/sda, across the cluster nodes by running the following command:

See Also: Table A–1 for details about the arguments and options used in the following CVU examples

See Also: "Known Issues for the Cluster Verification Utility" on page A-9 for the types of storage that CVU supports

Performing Various CVU Tests

A-6 Oracle Clusterware Administration and Deployment Guide

cluvfy comp ssa -n all -s /dev/sda

To verify whether a certain amount of free space is available on a specific location in the nodes of your cluster database, use the component verification command space.

cluvfy comp space [ -n node_list ] -l storage_location -z disk_space {B|K|M|G} [-verbose]

For example, you can verify the availability of at least 2 GB of free space at the location /home/dbadmin/products on all of the cluster nodes by running the following command:

cluvfy comp space -n all -l / home/dbadmin/products –z 2G -verbose

To verify the integrity of your Oracle Cluster File System (OCFS) on platforms on which OCFS is available, use the component verification command cfs as follows:

cluvfy comp cfs [ -n node_list ] -f file_system [-verbose]

For example, you can verify the integrity of the cluster file system /oradbshare on all of the nodes by running the following command:

cluvfy comp cfs -f /oradbshare –n all -verbose

Cluster Verification Utility Connectivity VerificationsTo verify the cluster nodes can be reached from the local node or from any other cluster node, use the component verification command nodereach as follows:

cluvfy comp nodereach -n node_list [ -srcnode node ] [-verbose]

To verify the connectivity between the cluster nodes through all of the available network interfaces or through specific network interfaces, use the component verification command nodecon as follows:

cluvfy comp nodecon -n node_list [ -i interface_list ] [-verbose]

Use the nodecon command without the -i option as follows to use CVU to:

■ Discover all of the network interfaces that are available on the cluster nodes

■ Review the interfaces’ corresponding IP addresses and subnets

■ Obtain the list of interfaces that are suitable for use as VIPs and the list of interfaces to private interconnects

■ Verify the connectivity between all of the nodes through those interfaces

cluvfy comp nodecon -n all [-verbose]

You can run this command in verbose mode to identify the mappings between the interfaces, IP addresses, and subnets. To verify the connectivity between all of the nodes through specific network interfaces, use the comp nodecon command with the -i option. For example, you can verify the connectivity between the nodes

Note: The space component does not support block or raw devices.

Note: The sharededness check for the file system is supported for OCFS2 version 1.2.1 or higher.

Performing Various CVU Tests

Cluster Verification Utility Reference A-7

node1,node2, and node3, through interface eth0 by running the following command:

cluvfy comp nodecon -n node1,node2,node3 –i eth0 -verbose

Cluster Verification Utility User and Permissions VerificationsTo verify user accounts and administrative permissions-related issues, use the component verification command admprv as follows:

cluvfy comp admprv [ -n node_list ] [-verbose] | -o user_equiv [-sshonly] | -o crs_inst [-orainv orainventory_group ] | -o db_inst [-orainv orainventory_group ] [-osdba osdba_group ] | -o db_config -d oracle_home

To verify whether user equivalence exists on specific nodes, use the -o user_equiv argument. On Linux and UNIX platforms, this command verifies user equivalence first using ssh and then using rsh, if the ssh check fails. To verify the equivalence only through ssh, use the -sshonly option. By default, the equivalence check does not verify X-Windows configurations, such as whether you have disabled X-forwarding, whether you have the proper setting for the DISPLAY environment variable, and so on.

To verify X-Windows aspects during user equivalence checks, set the CV_XCHK_FOR_SSH_ENABLED key to TRUE in the configuration file that resides in the path CV_HOME/cv/admin/cvu_config before you run the admprv -o user_equiv command. Use the -o crs_inst argument to verify whether you have permissions to install Oracle Clusterware.

You can use the -o db_inst argument to verify the permissions that are required for installing Oracle RAC and the -o db_config argument to verify the permissions that are required for creating an Oracle RAC database or for modifying an Oracle RAC database's configuration. For example, you can verify user equivalence for all of the nodes by running the following command:

cluvfy comp admprv -n all -o user_equiv -verbose

On Linux and UNIX platforms, this command verifies user equivalence by first using ssh and then using rsh if the ssh check fails. To verify the equivalence only through ssh, use the -sshonly option. By default, the equivalence check does not verify X-Windows configurations, such as when you have disabled X-forwarding with the setting of the DISPLAY environment variable. To verify X-Windows aspects during user equivalence checks, set the CV_XCHK_FOR_SSH_ENABLED key to TRUE in the configuration file CV_HOME/cv/admin/cvu_config before you run the admprv -o user_equiv command.

To verify the existence of node applications, namely VIP, ONS and GSD, on all of the nodes, use the component nodeapp command:

cluvfy comp nodeapp [ -n node_list ] [-verbose]

Cluster Verification Utility Node Comparisons and VerificationsUse the component verification peer command to compare the nodes as follows:

cluvfy comp peer [ -refnode node ] -n node_list [-r { 10gR1 | 10gR2 | 11gR1} ] [ -orainv orainventory_group ] [ -osdba osdba_group ] [-verbose]

Performing Various CVU Tests

A-8 Oracle Clusterware Administration and Deployment Guide

The following command lists the values of several preselected properties on different nodes from Oracle Database 11g Release 1 (11.1):

cluvfy comp peer -n node_list [-r 11gR1] [-verbose]

You can also use the comp peer command with the -refnode argument to compare the properties of other nodes against the reference node.

Cluster Verification Utility Installation VerificationsTo verify whether your system meets all of the criteria for an Oracle Clusterware installation, use the -pre crsinst command for the Oracle Clusterware installation stage as follows:

cluvfy stage -pre crsinst -n node_list [ -c ocr_location ] [-r { 10gR1 | 10gR2 | 11gR1} ][ -q voting_disk ] [ -osdba osdba_group ] [ -orainv orainventory_group ] [-verbose]

After you have completed phase one, verify that Oracle Clusterware is functioning properly before proceeding with phase two of your Oracle RAC installation by running the -post crsinst command for the Oracle Clusterware installation stage:

cluvfy stage -post crsinst -n node_list [-verbose]

To verify whether your system meets all of the criteria for an Oracle RAC installation, use the pre dbinst command for the Database Installation stage:

cluvfy stage -pre dbinst -n node_list [-r { 10gR1 | 10gR2 | 11gR1} ] [ -osdba osdba_group ] [ -orainv orainventory_group ] [-verbose]

To verify whether your system meets all of the criteria for creating a database or for making a database configuration change, use the pre dbcfg command for the Database Configuration stage:

cluvfy stage -pre dbcfg -n node_list -d oracle_home [-verbose]

Cluster Verification Utility Cluster Integrity VerificationsTo check the integrity of your entire cluster, which means to verify that all of the nodes in the cluster have the same view of the cluster configuration, use the component verification command comp clu, as follows:

cluvfy comp clu

Cluster Verification Utility Oracle Clusterware Component VerificationsTo verify the integrity of all of the Oracle Clusterware components, use the component verification comp crs command:

cluvfy comp crs [ -n node_list ] [-verbose]

To verify the integrity of each individual Cluster Manager subcomponent, use the component verification command comp clumgr:

cluvfy comp clumgr [ -n node_list ] [-verbose]

To verify the integrity of the Oracle Cluster Registry, use the component verification command comp ocr:

cluvfy comp ocr [ -n node_list ] [-verbose]

Known Issues for the Cluster Verification Utility

Cluster Verification Utility Reference A-9

Cluster Verification Utility Argument and Option DefinitionsTable A–1 describes the CVU arguments and options used in the previous examples:

Known Issues for the Cluster Verification UtilityThis section describes the following known limitations for CVU:

■ Database Versions Supported by Cluster Verification Utility

■ Linux Shared Storage Accessibility (ssa) Check Reports Limitations

■ Shared Disk Discovery on Red Hat Linux

Database Versions Supported by Cluster Verification UtilityThe current CVU release supports only Oracle Database 10g or higher, Oracle RAC, and Oracle Clusterware and CVU is not backward compatible. In other words, CVU cannot check or verify Oracle Database products prior to Oracle Database 10g.

Linux Shared Storage Accessibility (ssa) Check Reports LimitationsThe current release of cluvfy has the following limitations on Linux regarding shared storage accessibility check.

Table A–1 Cluster Verification Utility Arguments and Options

Argument or Option Definition

-n node_list The comma-delimited list of nondomain qualified node names on which the test should be conducted. If all is specified, then all of the nodes in the cluster will be used for verification.

-i interface_list The comma-delimited list of interface names.

-f file_system The name of the file system.

-s storageID_list The comma-delimited list of storage identifiers.

-l storage_location The storage path.

-z disk_space The required disk space, in units of bytes (B), kilobytes (K), megabytes (M), or gigabytes (G).

-osdba osdba_group The name of the OSDBA group. The default is dba.

-orainv orainventory_group The name of the Oracle inventory group. The default is oinstall.

-verbose Makes CVU print detailed output.

-o user_equiv Checks user equivalence between the nodes.

-sshonly Check user equivalence for ssh setup only.

-o crs_inst Checks administrative privileges for installing Oracle Clusterware.

-o db_inst Checks administrative privileges for installing Oracle RAC.

-o db_config Checks administrative privileges for creating or configuring a database.

-refnode The node that will be used as a reference for checking compatibility with other nodes.

-srcnode The node from which the reachability to other nodes should be checked.

-r { 10gR1 | 10gR2 | 11gR1} The Oracle Database release for which the requirements for installation of Oracle Clusterware or Oracle RAC are to be verified. If this option is not specified, then Oracle Database 11g Release 1 (11.1) is assumed.

Known Issues for the Cluster Verification Utility

A-10 Oracle Clusterware Administration and Deployment Guide

■ Currently NAS storage (r/w, no attribute caching) and OCFS2 (version 1.2.1 or higher) are supported.

■ For sharedness checks on NAS, cluvfy commands require you to have write permission on the specified path. If the cluvfy user does not have write permission, cluvfy reports the path as not shared.

Shared Disk Discovery on Red Hat LinuxTo perform discovery and shared storage accessibility checks for SCSI disks on Red Hat Linux 3.0 (or higher) and SUSE Linux Enterprise Server, CVU requires the CVUQDISK package. If you attempt to use CVU and the CVUQDISK package is not installed on all of the nodes in your Oracle RAC environment, then CVU responds with an error.

Perform the following procedure to install the CVUQDISK package:

1. Login as the root user.

2. Copy the rpm, cvuqdisk-1.0.1-1.rpm, to a local directory. You can find this rpm in the rpm subdirectory of the top-most directory in the Oracle Clusterware installation media. For example, you can find cvuqdisk-1.0.1-1.rpm in the directory /mountpoint/clusterware/rpm/ where mountpoint is the mounting point for the disk on which the directory is located.

3. Set the environment variable to a group that should own the CVUQDISK package binaries. If CVUQDISK_GRP is not set, then by default the oinstall group is the owner’s group.

4. Determine whether previous versions of the CVUQDISK package are installed by running the command rpm -q cvuqdisk. If you find previous versions of the CVUQDISK package, then remove them by running the command rpm -e cvuqdisk previous_version where previous_version is the identifier of the previous CVUQDISK version.

5. Install the latest CVUQDISK package by running the command rpm -iv cvuqdisk-1.0.1-1.rpm.

OLSNODES Command Reference B-1

BOLSNODES Command Reference

This appendix describes the syntax and command options for the OLSNODES command. The OLSNODES command provides the list of nodes and other information for all nodes participating in the cluster. The syntax for the OLSNODES command is:

olsnodes [-n] [-i] [-l] [-v] [-g] [-p]

If you issue the OLSNODES command without any command parameters, the command returns a listing of the nodes in the cluster:

[user1@node1 myoracle]# olsnodesnode1node2node3node4

Table B–1 describes the options you can include on the OLSNODES command to obtain additional cluster-related information.

Table B–1 OLSNODES Command Options

Option Description

-g Logs cluster verification information with more details.

-i Lists all nodes participating in the cluster and includes the Virtual Internet Protocol (VIP) address assigned to each node.

-l Displays the local node name.

-n Lists al nodes participating in the cluster and includes the assigned node numbers.

-p Lists all nodes participating in the cluster and includes the private interconnect assigned to each node.

-v Logs cluster verification information in verbose mode.

B-2 Oracle Clusterware Administration and Deployment Guide

Oracle Interface Configuration (OIFCFG) Command Reference C-1

COracle Interface Configuration (OIFCFG)

Command Reference

The Oracle Interface Configuration (OIFCFG) command-line interface helps you to define and administer network interfaces. You can issue OIFCFG commands in single-instance and Oracle Clusterware environments to:

■ Allocate and deallocate network interfaces to components

■ Direct components to use specific network interfaces

■ Retrieve component configuration information

This appendix contains the following main topics:

■ Starting the OIFCFG Command-Line Interface

■ Summary of the OIFCFG Usage

Starting the OIFCFG Command-Line InterfaceBefore you invoke OIFCFG, ensure that you have started Oracle Clusterware on at least the local node and preferably on all nodes if you intend to include the -global option on the command.

You invoke OIFCFG from the CRS_home/bin/ directory as the CRS user who installed the Oracle Clusterware software.

For example:

% ./oifcfg

Issue the oifcfg -help command to display online help for OIFCFG.

oifcfg iflist oifcfg setif {-node node_name | -global} {if_name/subnet:if_type}... oifcfg getif [-node node_name | -global] [-if if_name[subnet [-type if_type]] oifcfg delif [-node node_name | -global] [if_name/subnet] oifcfg [-help] node_name - name of the host, as known to a communications network if_name - name by which the interface is configured in the system subnet - subnet address of the interface if_type - type of the interface {cluster_interconnect|public|storage}

Summary of the OIFCFG UsageThis section contains the following topics:

Summary of the OIFCFG Usage

C-2 Oracle Clusterware Administration and Deployment Guide

■ OIFCFG Command Format

■ OIFCFG Commands

■ OIFCFG Usage Notes

■ OIFCFG Examples

OIFCFG Command Formatoifcfg iflistoifcfg setif {-node node_name | -global} {if_name/subnet:if_type} ...oifcfg getif [-node node_name | -global] [-if if_name/subnet [-type if_type]]oifcfg delif [-node node_name | -global] [if_name/subnet]oifcfg [-help]

OIFCFG CommandsYou can enter any of the OIFCFG commands listed in Table C–1.

OIFCFG Command ParametersThis section lists the parameters for the OIFCFG commands. Note that some of the parameters are optional, depending on which command you issue.

-node node_nameThe name of the Oracle Clusterware node as listed in the output from the olsnodes command. The OLSNODES command is described in Appendix B, "OLSNODES Command Reference".

-globalA network interface can be stored as a global interface (as reported by the iflist command) or as a node-specific interface:

■ An interface is stored as a global interface when all of the nodes of an Oracle Real Application Clusters (Oracle RAC) cluster have the same interface connected to the same subnet. The global interface (and configuring all nodes with the same network interface for each public subnet and the same network interface for each private subnet) is not only the recommended configuration, but it is also the default installation configuration.

■ An interface can be stored as a node-specific (local) interface.

Table C–1 OIFCFG Commands

Command Description

oifcfg iflist Shows the available interfaces that you can configure with setif. The iflist command queries the operating system to find which network interfaces are present on this node.

oifcfg setif Sets an interface type (public or cluster interconnect) for an interface.

oifcfg getif Displays the interfaces for which an interface type has been defined with the setif command, along with the type for that interface.

oifcfg delif Deletes the stored network configuration for global or node-specific interfaces.

Summary of the OIFCFG Usage

Oracle Interface Configuration (OIFCFG) Command Reference C-3

-if if_nameThe name by which the interface is configured in the system.

subnetThe subnet number of the interface.

-type if_typeThe type of interface: public or cluster interconnect.

-helpDisplay online help for OIFCFG commands.

OIFCFG Usage Notes■ A network interface specification takes the following form:

interface_name/subnet:interface_type

The specification uniquely identifies the network interface using the:

– Interface name

– Associated subnet

– Interface type

The interface type indicates the purpose for which the network is configured. The supported interface types are:

* Public—An interface that can be used for communication with components external to Oracle RAC instances, such as Oracle Net and Virtual Internet Protocol (VIP) addresses.

* Cluster_interconnect—A private interface used for the cluster interconnect to provide interinstance or Cache Fusion1 communication.

If you set the interface type to cluster_interconnect, it affects instances as they start up and changes do not take effect until you restart the instances.

For example, the following specification identifies qfe0 as a cluster interconnect located at the address 204.152.65.0:

qfe0/204.152.65.0:cluster_interconnect

■ The Oracle Universal Installer uses OIFCFG to identify and display available interfaces.

■ The effect of changing the interface names depends on which name you are changing, and whether or not you are also changing the IP address. In cases where you change only the interface names, the ramifications are minor. If you change

Note: Oracle currently does not support having different network interfaces for each node in the cluster. The best practice is to configure all nodes with the same network interface for each public subnet and the same network interface for each private subnet. See "Changing Network Addresses" on page 2-11 for information about changing interface names.

1 Cache Fusion is a diskless cache coherency mechanism that provides copies of blocks directly from a holding instance's memory cache to a requesting instance's memory cache.

Summary of the OIFCFG Usage

C-4 Oracle Clusterware Administration and Deployment Guide

the name for the public interface that is stored in the OCR, you must modify the nodeapps for each node. Therefore, you must stop the nodeapps for this change to take effect.

■ You must restart Oracle Clusterware on all members of the cluster when you make global changes. For local changes, you only need to perform a node restart. Interconnect changes for the database occur at instance startup. However, the interconnect for Oracle Clusterware might be different.

■ Because interconnects are chosen when instances start, just issuing OIFCFG commands does not have an immediate effect on the running system. Instead, changes take effect after restarting the component that might be affected by the command.

OIFCFG ExamplesThe following examples show some common uses for the OIFCFG commands.

Example 1 Listing the Names of Network InterfacesYou can use OIFCFG to list the interface names and the subnets of all of the interfaces available on the local node by executing the iflist keyword, as shown in this example:

oifcfg iflisthme0 139.185.141.0qfe0 204.152.65.0

Example 2 Retrieving Network InformationYou can also retrieve specific OIFCFG information with a getif command.

oifcfg getif [ [-global | -node nodename] [-if if_name[/subnet]] [-type if_type] ]

For example, after you install Oracle Clusterware, you can verify that the public and cluster interconnect have been set to the desired values by entering the following commands as root:

$ oifcfg getif

This command should return values for global public and global cluster_interconnect. For example:

en0 144.25.68.0 global publichme0 139.185.141.0 global cluster_interconnect

Example 3 Storing a New Global InterfaceTo store a new interface, use the setif keyword. For example, to store the interface hme0, with the subnet 139.185.141.0, as a global interface (to be used as an interconnect for all of the Oracle RAC instances in your cluster), you would use the command:

oifcfg setif -global hme0/139.185.141.0:cluster_interconnect

For a cluster interconnect that exists between only two nodes, for example rac1 and rac2, you could create the cms0 interface with the following commands, assuming 139.185.142.0 is the subnet number for the interconnect on rac1 and rac2, respectively:

oifcfg setif -global cms0/139.185.142.0:cluster_interconnect

Summary of the OIFCFG Usage

Oracle Interface Configuration (OIFCFG) Command Reference C-5

Example 4 Deleting the Stored InterfaceUse the OIFCFG delif command to delete the stored configuration for global or node-specific interfaces. A specific node-specific or global interface can be deleted by supplying the interface name, with an optional subnet, on the command line. Without the -node or -global options, the delif keyword deletes either the given interface or all of the global and node-specific interfaces on all of the nodes in the cluster.

For example, the following command deletes the global interface named qfe0 for the subnet 204.152.65.0:

oifcfg delif -global qfe0/204.152.65.0

The following command deletes all of the global interfaces assigned with OIFCFG:

oifcfg delif -global

Summary of the OIFCFG Usage

C-6 Oracle Clusterware Administration and Deployment Guide

High Availability Oracle Clusterware Command-Line Reference and C API D-1

DHigh Availability Oracle Clusterware

Command-Line Reference and C API

This appendix describes the Oracle Clusterware application program interface (API) command reference and includes the following topics:

■ Using Oracle Clusterware Commands

■ Oracle Clusterware Commands

■ C Application Programming Interface to Oracle Clusterware

■ Functions for Managing Resource Structures

Using Oracle Clusterware CommandsThis section explains how to use the Oracle Clusterware commands to manage applications and application resources under the Oracle Clusterware framework.

Application Profile SyntaxThe examples in this appendix show the command syntaxes that you use in application profiles. Lines starting with a pound sign (#) are comment lines and are not processed as part of a profile. A backslash (\) at the end of a line indicates that the next line is a continuation of the previous line. See the section titled "Using the crs_profile Command to Create An Application Resource Profile" on page 5-8 to see a profile example.

Security and PermissionsOracle Clusterware uses security similar to Linux or UNIX systems, where permissions may be specified for the owner, nodes of a group, or holders of specific privileges. There may also be default permissions for other users. These permissions are read, write, and run on Windows systems, and rwx on Linux or UNIX systems. Default ownership and permissions are set when you create an application profile. Table D–1 shows the Oracle Clusterware commands and their owners and describes

See Also: Chapter 5, "Making Applications Highly Available Using Oracle Clusterware" for detailed information about using Oracle Clusterware to make applications highly available

Note: The Oracle Clusterware API demo is not supported for Windows.

Oracle Clusterware Commands

D-2 Oracle Clusterware Administration and Deployment Guide

which commands require write permission and which commands require run permission.

Oracle Clusterware CommandsTable D–2 alphabetically lists the Oracle Clusterware commands that this appendix describes. Note that the -q option is available for all commands and enables the command to run in quiet mode. This means that no messages are displayed on the console.

Table D–1 Oracle Clusterware Command Owner and Permissions Matrix

Owner Read Write Permission RequiredRun/Execute Permission Required

User crs_stat {any options} crs_register {no arguments}

crs_register -u

crs_unregister

crs_start

crs_stop

crs_relocate

crs_stat {no arguments}

crs_stat -l -t -r

Group crs_stat {any options} crs_register {no arguments}

crs_register -u

crs_unregister

crs_start

crs_stop

crs_relocate

crs_stat {no arguments}

crs_stat -l -t -r

Other crs_stat {any options} crs_register {no arguments}

crs_register -u

crs_unregister

crs_start

crs_stop

crs_relocate

crs_stat {no arguments}

crs_stat -l -t -r

Note: On platforms that do not have a group concept, the group permissions are NULL or not set.

Note: Do not use the Oracle Clusterware commands prefixed with crs_ (except for crs_stat) on resources that have names beginning with the prefix ora unless either Oracle Support asks you to, or unless Oracle has certified you as described in https://metalink.oracle.com. Server Control (SRVCTL) is the correct utility to use on Oracle resources. You can create resources that depend on resources that Oracle has defined. You can also use the Oracle Clusterware commands to inspect the configuration and status.

Oracle Clusterware Commands

High Availability Oracle Clusterware Command-Line Reference and C API D-3

crs_getpermLists the permissions defined for the resource. If any user other than root needs access to a resource that was created by the root user, you need to explicitly give them access. However, resources registered by users other than root are owned by the registering user, and can see their own resources. The access definitions are similar to Unix.

Syntax and Options for crs_getpermUse the crs_getperm command with the following syntax:

crs_getperm resource_name [-u user|-g group] [-q]

To obtain the permissions associated with a resource, use the following syntax:

crs_getperm resource_name

Example of crs_getpermTo list the permissions associated with the postman application, use the following command:

crs_getperm postman

====================Name: postmanowner:root:rwx,pgrp:oinstall:r-x,other::r--,user:oracle:r-x,=====================

Table D–2 Oracle Clusterware Commands Summary

Command Description

crs_getperm on page D-3 Lists the permissions associated with a resource.

crs_profile on page D-4 Creates, validates, deletes, and updates an Oracle Clusterware application profile

crs_register on page D-17 Registers configuration information for an application with the OCR.

crs_relocate on page D-9 Relocates an application profile to another node.

crs_setperm on page D-12 Sets permissions associated with a resource.

crs_stat on page D-12 Lists the status of an application profile.

crs_start on page D-15 Starts applications that have been registered.

crs_stop on page D-16 Stops an Oracle Clusterware application.

crs_unregister on page D-17 Removes the configuration information for an application profile from the OCR.

Note: On platforms that do not have a group concept, the group permissions are NULL or not set.

See Also: The crs_setperm command on page D-12 to modify the permissions associated with a resource

Oracle Clusterware Commands

D-4 Oracle Clusterware Administration and Deployment Guide

crs_profileCreates, validates, deletes, and updates an Oracle Clusterware application profile. It works on the user’s copy of a profile.

You can also use crs_profile to generate a template script. The crs_profile command creates new application profiles and validates, updates, deletes, or lists existing profiles. An application profile assigns values to attributes that define how a resource should be managed or monitored in a cluster. For the root user, profiles are written to the CRS_home/crs/profile directory. For nonprivileged users, the profile is written to the CRS_home/crs/public directory. Values in the profile that are left blank are ignored and may be omitted if not required. Omitted profile variables that are required for a resource type can cause validation or registration to fail.

After you have created an application profile and registered the application with Oracle Clusterware using the crs_register command, you can use other Oracle Clusterware commands, such as crs_stat, crs_start, crs_stop, crs_relocate, and crs_unregister, on the application. You must register applications using the crs_register command before the other Oracle Clusterware commands are available to manage the application. The crs_profile command may have other options for defining user-defined attributes in a profile.

Syntax and Options for the crs_profile CommandUse the crs_profile command with the following syntax to create an application profile template:

crs_profile -create resource_name -t application [-a action_script] [-B executable_pathname] [-dir directory] [-d description] [-p placement_policy] [-h hosting_nodes] [-r required_resources] [-l optional_resources] [-o option,[...]] [attribute_flag attribute_value] [...] [-f] [-q]

crs_profile -create resource_name -t application [-dir directory_path] [-a action_script] [-B binary_pathname] [-d description] [-h hosting_members] [-r required_resources] [-l optional_resources] [-p placement_policy] [-o as=auto_start,ci=check_interval,ft=failure_threshold, fi=failure_interval,ra=restart_attempts,fd=failover_delay, st=script_timeout,ap=active_placement,bt=rebalance, ut=uptime_threshold,rt=start_timeout,pt=stop_timeout] [-f] [-q]

To create an application profile from an application profile template you use crs_profile to create a template (a cap) file. This file is then read automatically by the crs_register command which reads the data from the cap file and puts it in the OCR. For example:

crs_profile -create resource_name -I template_file [-f] [-q]

To validate the application profile syntax of a profile, enter the following command:

Table D–3 crs_getperm Options

Option Description

-g group Obtains the permissions associated with the specified (operating system) group.

-q Runs the command in quiet mode (no messages are displayed on the console).

resource_name Obtains the permissions associated with a specific resource.

-u user Obtains the permissions associated with the specified user.

Oracle Clusterware Commands

High Availability Oracle Clusterware Command-Line Reference and C API D-5

crs_profile -validate resource_name [-q]

To list one or more application profiles:

crs_profile -print [resource_name [...]] [-q]

To create an application profile template from an existing application profile:

crs_profile -template resource_name [-O template_file] [-q]

To update an application profile:

crs_profile -update resource_name [option [...]] [-q]

To delete an application profile and its associated action program:

crs_profile -delete resource_name [-q]

Note: The crs_profile -delete command deletes the resource profile file but does not delete the action script file.

Table D–4 Options for the crs_profile Command

Option Description

[attribute_flag attribute_value]

All crs_profile commands take the following format:

crs_profile attribute_flag attribute_value

This command sets the specified attribute (whose flag is specified by attribute_flag) to the given value (whose value is specified by attribute_value) in the profile that gets generated. For example, the following command sets the -validate attribute to the value resource_name:

crs_profile -validate resource_name

[-a action_script] Specifies the action program for the application. An action program has start, stop, and check entry points that are called by Oracle Clusterware. You can specify either a full path name for the script file or its filename.

[-B executable_pathname]

Specifies the location of the application executable and causes the crs_profile command to generate an action program file that contains the application executable path name.

-create resource_name

Creates a resource profile for the indicated application according to the specified options. A resource_name is a string containing a combination of characters [a-z, A-Z, 0-9, '.','_']. The resource name may not start with a period (.). The maximum length for a resource name is 128 characters. Resource profiles are created in the directory that you specify.

[-d description] Specifies the description of the application. Enclose the description in double quotation marks ("") if it contains white space. If you do not specify an application description, then Oracle Clusterware uses the application name.

[-dir directory] Specifies the file location where the profile is located.

[-h hosting_nodes] Specifies an ordered list of nodes, separated by white space, that can host the application. If there is more than one node, then the list must be enclosed in double quotation marks (""). If you specify the -p option with a placement policy of preferred or unavailable, then you must also specify the -h option.

[-l optional_resources]

Specifies an ordered list of optional resources, separated by white space. If there are multiple optional resources specified, then the list must be enclosed in double quotation marks ("").

Oracle Clusterware Commands

D-6 Oracle Clusterware Administration and Deployment Guide

[-o option[,...]] Specifies a comma-delimited list of attribute option names and their values. To see the list of attribute option names, see Table D–5, " Options for the -o Flag" on page D-6 or run the crs_profile -help command.

[-p placement_policy] Specifies the policy according to which Oracle Clusterware selects the node on which to start or restart the application. You can specify balanced, favored, or restricted as placement policies. All policies except for balanced require you to also specify the -h flag.

-q Runs the command in quiet mode (no messages are displayed on the console).

[-r required_resources]

Specifies an ordered list of resources, separated by spaces, on which the application depends. These resources must be active on any node on which the application is running. If there are multiple required resources, then the list must be enclosed in double quotation marks (""). If you do not specify a required resources list, then Oracle Clusterware imposes no required dependencies upon the application resource.

-t application Indicates an application or application resource type.

-validate Validates the application profile syntax of a profile.

Table D–5 Options for the -o Flag

Option Description

ap=active_placement When set to 1, it reevaluates the placement policy when a new cluster node becomes available. With placement policies of FAVORED and RESTRICTED,a highly available application relocates to a more highly favored cluster node. The default is 0.

as=auto_start Indicates whether Oracle Clusterware automatically starts a resource after a cluster restart. Valid AUTO_START values are:

■ always—Restarts the resource when the node restarts regardless of the state of the resource when the node stopped.

■ restore—Restores the resource to the same state that it was in when the node went down. If the state of the resource was offline (STATE=OFFLINE, TARGET=OFFLINE) when the node went down, then the resource remains offline when the node comes back up. The resource is started only if it was online before the node went down.

■ never—Oracle Clusterware never restarts the resource regardless of the state of the resource when the node stopped.

Note: Oracle only supports lower-case values for always, restore, and never.

ci=check_interval Sets the time (in seconds) at which the check entry point of the application's action program runs. The check interval is the maximum amount of time that an application can be unavailable to clients before Oracle Clusterware attempts to restart it. The default check interval is 60 seconds.

fd=failover_delay Specifies the number of seconds that Oracle Clusterware waits before attempting to relocate the application resource. An application resource that was running on a cluster node that failed restarts immediately on that node if it becomes available again within the failover delay period.

If application resources are dependent on each other due to required resources defined in application profiles, then all interdependent application resources wait for the largest failover delay that you have defined for any of the resources. The default failover delay is 0 (zero) seconds.

fi=failure_interval Specify the time (in seconds) during which the failure threshold is calculated and applied. The default failure interval is 0 (zero). Oracle Clusterware uses a default failure interval of 0, which turns off monitoring of failure threshold and failure intervals. Specifying a nonzero failure interval has no effect unless the failure threshold is also nonzero.

Table D–4 (Cont.) Options for the crs_profile Command

Option Description

Oracle Clusterware Commands

High Availability Oracle Clusterware Command-Line Reference and C API D-7

Examples of the crs_profile CommandThe following is an example of using the crs_profile command to create the application profile for the postman application:

crs_profile -create postman \-t application \-a postman.sh \-o ci=5,ra=2"

The following is an example of using the crs_profile command to validate the application profile named dtcalc:

crs_profile -validate dtcalc

If you do not specify either the -a or -B options, then the profile is created without an action program value. You should create a readable, executable script and update the profile with this script's value before attempting to start the application. If you specify the -a option alone, then the action program specified must exist at the specified location or in the default directory if no path is specified. Otherwise, the command fails.

crs_registerThe crs_register command registers one or more applications specified with the resource_name parameter for each application. This command requires write access to the target application. This command only succeeds if the profile is found either in the default location or in the directory specified by the -dir option. An application must be registered for Oracle Clusterware to monitor the resource or to start, restart, or relocate a highly available application associated with an application resource. An application registration must be updated for any changes to an application profile to take effect. Also, see the "Security and Permissions" section on page D-1 for information about the privileges required for this command.

An application can be registered or updated only if the CRS daemon is active and an Oracle Clusterware application profile exists in the profile directory for this application. If fields are missing from an application profile, then the profile is merged with the default profile template and Oracle uses the values in the default profile template.

ft=failure_threshold Specifies the number of times that Oracle Clusterware may detect an application or application resource failure within the failure interval before it marks the application or application resource as unavailable and stops monitoring it. The value must be in the range 0-20. Setting the value to 0 (zero) turns off failure threshold monitoring. If you do not specify a failure threshold, then Oracle Clusterware uses a default failure threshold of 0 (zero).

ra=restart_attempts Specifies the number of times that Oracle Clusterware attempts to restart the application or application resource on the current node before attempting to relocate it. The default number of restart attempts is 1.

st=script_timeout Sets the maximum number of seconds within which an action program can run. The Oracle Clusterware commands and daemon return an error message and post an event to Event Manager (EVM) when they invoke action program entry points and the script does not complete its execution within this time. The default is 60 seconds.

Table D–5 (Cont.) Options for the -o Flag

Option Description

Oracle Clusterware Commands

D-8 Oracle Clusterware Administration and Deployment Guide

The ownership and default permissions for the application are set during the registration process. You can register any .cap file as long as the permissions for the file permit read and write, for example, crs_profile and crs_register must be able to read the file. By default, the user who registers the application is the owner of the application. If the profile cannot be registered, then Oracle Clusterware displays messages explaining why. Use the crs_stat command to verify that the application is registered.

Syntax and Options for the crs_register CommandYou can use the crs_register command to register and update applications. Use the following crs_register syntax to register an application:

crs_register resource_name [-dir directory_path] [...] [-u] [-f] [-q]

crs_register resource_name -update [-a action_script][-d description] [-h hosting_members] [-r required_resources][-l optional_resources] [-p placement_policy][-o as=auto_start,ci=check_interval,ft=failure_threshold,fi=failure_interval,ra=restart_attempts,fd=failover_delay,st=script_timeout,ap=active_placement,bt=rebalance,ut=uptime_threshold,rt=start_timeout,pt=stop_timeout] [-q]

The resource_name [...] parameter can be the name of one or more application resources as specified in an application profile. If you do not specify any options, then the crs_relocate command relocates each specified application resource according to its placement policy and required resource lists. Oracle Clusterware does not relocate a resource if there are interdependent application resources unless you specify the -f option. A profile must exist for the application that you are registering.

Table D–6 Options on the crs_register Command

Option Description

[-dir directory_path]

Specifies the directory where the .cap file is located if the .cap file is not located in the default directory.

-f Enables you to register a resource even though you have no execute permission on one or more required resources.

-u Ensures the changes take effect immediately. Use crs_register -u immediately after issuing a crs_profile -update command or after manually editing an application profile.

[attribute_flag attribute_value]

All crs_profile commands take the following format:

crs_profile attribute_flag attribute_value

This command sets the specified attribute (whose flag is specified by attribute_flag) to the given value (whose value is specified by attribute_value) in the profile that gets generated. For example, the following command sets the -validate attribute to the value resource_name:

crs_profile -validate resource_name

[-a action_script] Specifies the action program for the application. An action program has start, stop, and check entry points that are called by Oracle Clusterware. You can specify either a full path name for the script file or its filename.

[-d description] Specifies the description of the application. Enclose the description in double quotation marks ("") if it contains white space. If you do not specify an application description, then Oracle Clusterware uses the application name.

Oracle Clusterware Commands

High Availability Oracle Clusterware Command-Line Reference and C API D-9

Use the following crs_register command syntax to update a registered application:

crs_register resource_name -update [option ...] [-o option,...] [-q]

Example of the crs_register CommandThe following example registers an application named postman for which an application profile exists in the CRS_home/crs/profile directory:

CRS_home/bin/crs_register postman

Related Commands for the crs_register Commandcrs_stat, crs_profile, crs_relocate, crs_start, crs_stop, crs_unregister

crs_relocateThe crs_relocate command relocates applications and application resources as specified by the command options that you use and the entries in your application profile. The specified application or application resource must be registered and running under Oracle Clusterware in the cluster environment before you can relocate

[-h hosting_nodes] Specifies an ordered list of nodes, separated by white space, that can host the application. If there is more than one node, then the list must be enclosed in double quotation marks (""). If you specify the -p option with a placement policy of preferred or unavailable, then you must also specify the -h option.

[-l optional_resources]

Specifies an ordered list of optional resources, separated by white space. If there are multiple optional resources specified, then the list must be enclosed in double quotation marks ("").

[-o option[,...]] Specifies a comma-delimited list of attribute option names and their values. To see the list of attribute option names, see Table D–5, " Options for the -o Flag" on page D-6 or run the crs_profile -help command.

[-p placement_policy]

Specifies the policy according to which Oracle Clusterware selects the node on which to start or restart the application. You can specify balanced, favored, or restricted as placement policies. All policies except for balanced require you to also specify the -h flag.

-q Runs the command in quiet mode (no messages are displayed on the console).

[-r required_resources]

Specifies an ordered list of resources, separated by spaces, on which the application depends. These resources must be active on any node on which the application is running. If there are multiple required resources, then the list must be enclosed in double quotation marks (""). If you do not specify a required resources list, then Oracle Clusterware imposes no required dependencies upon the application resource.

See Also: Table D–4 and Table D–5 for the listing of options and their definitions for using -update; the options for -update are the same as for the crs_profile command

Note: The profile will be in the crs/profile directory if the profile is created by the root user. The profile will be in the crs/public directory if the profile is created by any other user.

Table D–6 (Cont.) Options on the crs_register Command

Option Description

Oracle Clusterware Commands

D-10 Oracle Clusterware Administration and Deployment Guide

it. The command displays a message if you specify a cluster node that is unavailable or if the attempt to relocate failed. Also, see the "Security and Permissions" section on page D-1 for information about the privileges required for this command.

When you issue a crs_relocate command, Oracle Clusterware first runs the stop entry point of the action program on the node on which it is currently running. Oracle Clusterware then performs the start entry point of the action program to start it on a new node.

If Oracle Clusterware fails to stop the application or application resource on the current node due to an action program error, then it marks it as UNKNOWN. You cannot issue the crs_relocate command on a resource in this state. Instead, run a crs_stop -f command on the resource and restart it by issuing he crs_start command to return it to the ONLINE state before you attempt to relocate it again. If Oracle Clusterware fails to restart an application resource, then you may need to check the resource action program.

If the action program start entry point fails to run successfully, then the stop entry point is run. If the stop entry point fails to run successfully, then the state is marked as UNKNOWN and relocation attempts are stopped. If the stop entry point succeeds, then the state is set to OFFLINE. The target state remains ONLINE however, so subsequent cluster node failures or restarts can cause Oracle Clusterware to attempt to restart the application. If you have not specified the node to which to relocate it and if there are available cluster nodes that satisfy the placement criteria, then Oracle Clusterware attempts to start the application on one of these available nodes.

If one or more user-defined attributes have been defined for application resources, then you can specify values for these attributes when relocating an application with the crs_relocate command. The specified value is passed to the action program as an environment variable with the attribute name.

The actions that Oracle Clusterware takes while relocating an application resource are echoed on the command line. You can also monitor them using the Event Manager (EVM). Standard error and standard output from a resource action program that is started by the crs_relocate command are redirected to the standard error and standard output for the crs_relocate command. Note that if the Oracle Clusterware daemon starts an application, then standard error and standard output of the action program is lost. In an action program, you can check for user invocation of the action program using reason codes.

Syntax and Options for the crs_relocate CommandUse the crs_relocate command with the following syntax:

crs_relocate resource_name [...] [-c cluster_node] [-f] [-q]

crs_relocate resource_name [-c cluster_node] [-q]

crs_relocate [USR_attribute_name=value] [...] resource_name [-c cluster_node] [-q]

crs_relocate -s source_node [-c cluster_node] [-q]

Oracle Clusterware Commands

High Availability Oracle Clusterware Command-Line Reference and C API D-11

Example of the crs_relocate CommandThe following example relocates an application resource to the node known as rac1:

crs_relocate postman -c rac1Attempting to stop `postman` on node `rac2`Stop of `postman` on node `rac1` succeededAttempting to start `postman` on node `rac1`Start of `postman` on node `rac1` succeeded

The following example attempts to relocate all application resources from node rac2 to node rac1:

crs_relocate -s rac2 -c rac1Attempting to stop `postman` on node `rac2`Stop of `postman` on node `rac2` succeeded.Attempting to start `postman` on node `rac1`Start of `postman` on node `rac1` succeeded.Attempting to stop `calc` on node `rac2`Stop of `calc` on node `rac2` succeeded.Attempting to start `calc` on node `rac1`Start of `calc` on node `rac1` succeeded.

If a user-defined attribute USR_DEBUG has been defined, then the following example runs the stop and start entry point of the action program with the USR_DEBUG environment variable set to FALSE. This overrides any value set in the application profile. In the corresponding action program, if you add the following line to the appropriate section of the action program, then you can view the value:

echo $USR_DEBUG

Then issue the following command:

# crs_relocate USR_DEBUG=false database

Table D–7 Options on the crs_relocate Command

Option Description

-c cluster_node Relocates each indicated application or application resource to the specified node regardless of its placement policy. If required resources are not available on the destination node or if the application resource is restricted from that node, then the crs_relocate command fails and the application or application resource remains on the current node.

-f Forces relocation of the specified applications, all applications dependent on them and all applications that they are dependent upon. This option is necessary for relocating any application that requires another application or one that is required by any ONLINE application.

-s source_node Relocates all running applications or application resources from the source_node. If you do not also specify the -c option, then the crs_relocate command relocates each resource according to its placement policy and required resource lists.

-t Lists information for all resources in a tabular form.

-q Runs the command in quiet mode (no messages are displayed on the console).

[USR_attribute_name=value]

When starting the resource, this option sets the named attribute to the given value in the CRS resource's environment.

Oracle Clusterware Commands

D-12 Oracle Clusterware Administration and Deployment Guide

crs_setpermModifies the permissions associated with a resource. This command is similar to the chmod command for Linux and UNIX systems or the Windows desktop options, in this order: File, Properties, Security, and Permissions.

Syntax and Options for the crs_setperm CommandUse the crs_setperm command with the following syntax:

crs_setperm resource_name -u aclstring [-q]

crs_setperm resource_name -x aclstring [-q]

crs_setperm resource_name -o user_name [-q]

crs_setperm resource_name -g group_name [-q]

Example of the crs_setperm CommandThe following example modifies the permissions on the admin1 user for the postman application:

crs_setperm postman -u user:admin1:r-x

crs_statThe crs_stat command provides status information for resources on the cluster nodes. To query resources with the crs_stat command, the CRS resource permissions must have read and execute permissions (r and x permissions on Linux and UNIX systems). An exception is the -g option, that anyone can use to verify whether a resource exists.

Resources are either ONLINE or OFFLINE as shown in the STATE attribute. An application resource in the ONLINE state is running successfully on a cluster node. This cluster node is shown indicating its state.

The TARGET value shows the state to which Oracle Clusterware attempts to set the resource. If the TARGET value is ONLINE and a cluster node fails, then Oracle Clusterware attempts to restart the application on another node if possible. If there is a

See Also: The crs_getperm command on page D-3 to obtain the permissions associated with a resource

Table D–8 Options on the crs_setperm Command

Option Description

-u Updates the ACL string.

-x Deletes the ACL string.

-o Changes the owner of the resource.

Note: Only the root user can change the owner.

-g Changes the primary group of the resource.

-q Runs the command in quiet mode (no messages are displayed on the console).

aclstring Is one of the following:

user:username:rwxgroup:groupname:r-xother::r--

Oracle Clusterware Commands

High Availability Oracle Clusterware Command-Line Reference and C API D-13

condition forcing a resource STATE to be OFFLINE, such as a required resource that is OFFLINE, then the TARGET value remains ONLINE and Oracle Clusterware attempts to start the application or application resource once the condition is corrected.

A TARGET value for all non-application resources should be ONLINE unless the resource has a failure count higher than the failure threshold, in which case the TARGET is changed to OFFLINE. Oracle Clusterware then treats the resource as if its STATE were OFFLINE. If the STATE is ONLINE and the TARGET is OFFLINE, then you can reset the target value to ONLINE using the crs_start command.

The verbose status -v gives additional information that may be useful, especially for troubleshooting. The RESTART_COUNT value shows how many times an application resource has been restarted on a single cluster node. The maximum number of restarts before Oracle Clusterware stops restarting the application is equal to RESTART_ATTEMPTS. FAILURE_COUNT shows the number of times that a resource has failed within the FAILURE_INTERVAL defined in the application profile. The maximum number of failures before Oracle Clusterware stops attempting to restart an application is equal to the value set for the FAILURE_THRESHOLD parameter. If a cluster node fails and applications are waiting to be relocated due to the profile FAILOVER_DELAY attribute, then the verbose status also shows the FAILOVER_STATUS field. The FAILOVER_STATUS field is not shown at any other time. The FAILOVER_STATUS field shows the node the application failed on and how much time is left waiting for that node to restart before restarting on another node.

Syntax and Options for the crs_stat CommandUse the crs_stat command with the following syntax:

crs_stat [resource_name [...]] [-v] [-l] [-q] [-c cluster_node]

crs_stat [resource_name [...]] -t [-v] [-q] [-c cluster_node]

crs_stat -p [resource_name [...]] [-q]

crs_stat [-a] resource_name -g

crs_stat [-a] resource_name -r [-c cluster_node]

crs_stat -f [resource_name [...]] [-q] [-c cluster_node]

crs_stat -ls resource_name

To view information about all resources, enter the command with no arguments.

Table D–9 Options on the crs_stat Command

Option Description

-a resource_name Used with the -g or -r option to verify whether the specified resource is registered or running under Oracle Clusterware. Specify the name of the resource as listed in its application profile.

-c cluster_node Displays information about applications or application resources on the specified cluster node.

-f Displays all information about the resource including extended information (-v) and the in-memory profile (-p).

-g Returns 0 (zero) if the specified application or application resource is registered with Oracle Clusterware; returns 1 if it is not. This option only works if a single resource is specified with the -a option.

-l Displays the status in a list (nontabular) format. This is the default format.

Oracle Clusterware Commands

D-14 Oracle Clusterware Administration and Deployment Guide

Examples of the crs_stat CommandThe following example displays the status information for the postman application:

crs_stat postman

NAME=postmanTYPE=applicationTARGET=ONLINESTATE=ONLINE on rac2

The following example displays the status information in a tabular form:

crs_stat -t

Name Type Target State Host----------------------------------------------------------------cluster_lockd application ONLINE ONLINE rac2dhcp application OFFLINE OFFLINE

The following example shows the output for the crs_stat command with the -v option for the node rac2:

crs_stat -v

NAME=cluster_lockdTYPE=applicationRESTART_ATTEMPTS=30RESTART_COUNT=0FAILURE_THRESHOLD=0FAILURE_COUNT=0TARGET=ONLINESTATE=ONLINE on rac2

-ls Lists resources, owners, and permissions for all resources.

-ls resource_name Lists resources, owners, and permissions for a specified resource.

-p resource_name [...]]

Displays the status of the in-memory profile for the specified resources. If no resources are specified, then the status of the in-memory profile for all registered resources is displayed. If a resource is not registered, its status is not displayed.

-q Runs the command in quiet mode (no messages are displayed on the console).

-r Returns 0 (zero) if the specified application or application resource is running under Oracle Clusterware; returns 1 if it is not. This option can only be successful if a single resource is specified with the -a option.

-t Lists information for all resources in a tabular form.

-v Lists how many times a resource has been restarted or has failed within the resource failure interval, the maximum number of times that a resource can be restarted or can fail, and the target state of the application. Also, this option lists normal status information.

This option displays extended information about the status of resources. Extra attributes shown include RESTART_COUNT and FAILURE_COUNT. The RESTART_COUNT attribute shows the number of restarts of the application that have been attempted since it was started. The FAILURE_COUNT attribute shows how many failures have occurred during the last FAILURE_THRESHOLD in seconds. The attribute FAILOVER_STATUS shows the time at which an application resource started while waiting to relocate due to a cluster node failure if the resource has a FAILOVER_DELAY value greater than 0. It is not displayed otherwise.

Table D–9 (Cont.) Options on the crs_stat Command

Option Description

Oracle Clusterware Commands

High Availability Oracle Clusterware Command-Line Reference and C API D-15

crs_startThe crs_start command sets an application or application resource target state to ONLINE and attempts to start specified registered applications or application resources. The target state is the state that Oracle Clusterware attempts to achieve. Also, see the "Security and Permissions" section on page D-1 for information about the privileges required for this command.

There are several user-defined attributes available for starting an application with crs_start. The specified value is passed to the action program as an environment variable with the attribute name.

If you do not specify one node on which to start the application and there are available cluster nodes that satisfy the placement criteria, then Oracle Clusterware attempts to start the application on one of the available nodes. You must stop a resource that has failed before restarting it. You should also confirm why your action program is failing.

The crs_start command starts dependent resources as defined in the application profile REQUIRED_RESOURCE fields. After a registered resource is started, you must use the crs_stop command to end its execution.

The crs_start command displays status messages and if there are problems starting an application, then feedback is displayed. Standard error and standard output from a resource action program invoked by the crs_start command are redirected to the standard error and standard output for crs_start. Note that if the CRS daemon starts an application, then standard error and standard output of the action program is lost. Within an action program, you can check for user invocation of the action program using reason codes.

Syntax and Options for the crs_start CommandUse the crs_start command with the following syntax:

crs_start resource_name [...] [-c cluster_node] [-q] [-f]

crs_start -all [-q]

crs_start [USR_attribute_name=value] [...] resource_name [-c node_name] [-q]

For resource_name [...], the names are as specified in an application profile of one or more resources to be started. The resource must be registered with Oracle Clusterware.

Table D–10 Options On the crs_start Command

Option Description

-all Starts all registered Oracle Clusterware applications or application resources on active cluster nodes, according to their placement policies and required resource lists.

-c node_name Starts each indicated resource on the specified node if the cluster node is enabled by the placement policy and resource dependencies. If the cluster node specified is not enabled by the placement policy and resource dependencies, then the crs_start command fails. If the specified node is not available, the command fails. If one of the specified resources fails to start, then the command still attempts to start other resources listed on the command line.

Oracle Clusterware Commands

D-16 Oracle Clusterware Administration and Deployment Guide

Examples of the crs_start CommandThe following example starts an application named postman:

crs_start postmanAttempting to start `postman` on node `rac1`Start of `postman` on node `rac1` succeeded.

The following example starts the application on a specific cluster node rac2:

crs_start -c rac2 postmanAttempting to start `postman` on node `rac2`Start of `postman` on node `rac2` succeeded.

crs_stopStops an application on the specified node. You can run the crs_stat command to verify that the application you specify is stopped.

Syntax and Options for the crs_stop CommandUse the crs_stop command with the following syntax:

crs_stop resource_name [...] [-f] [-q]

crs_stop -c cluster_node [...] [-q]

crs_stop -all [-q]

crs_stop [USR_attribute_name=value] [...] resource_name [-q]

-c cluster_node [...]

Specifies one or more active cluster nodes on which to stop all of the applications or application resources. You can use the options described in Table D–11:

-f Forces starting of a specified application or application resource if all of the required resources are available or can be started. Any applications or application resources in the specified application profile's required resource list are started if not currently running or are relocated if they are running on another cluster node. Use this option with only one specified application resource at a time.

-q Runs the command in quiet mode (no messages are displayed on the console).

[USR_attribute_name=value]

Table D–10 (Cont.) Options On the crs_start Command

Option Description

Oracle Clusterware Commands

High Availability Oracle Clusterware Command-Line Reference and C API D-17

Examples of the crs_stop CommandTo stop an application named dtcalc, enter the following command:

crs_stop dtcalc

Status Messages for the crs_stop CommandOracle Clusterware displays the following status messages:

Attempting to stop `dtcalc` on node `rac2`Stop of `dtcalc` on node `rac2` succeeded.

crs_unregisterThe crs_unregister command removes the registration information of Oracle Clusterware resources from the binary Oracle Clusterware registry database. Oracle Clusterware will no longer acknowledge this resource. An application associated with a resource that is unregistered is no longer highly available. Also, see the "Security and Permissions" section on page D-1 for information about the privileges required for this command.

Upon successful completion of the crs_unregister command, the resource is removed from the online Oracle Clusterware environment. You cannot unregister a resource that is a required resource for another resource. You must stop the resource by using the crs_stop command before unregistering it.

Syntax and Options for the crs_unregister CommandUse the crs_unregister command with the following syntax:

crs_unregister resource_name [...] [-q]

The -q option runs the command in quiet mode (no messages are displayed on the console).

Example of the crs_unregister CommandThe following example unregisters a highly available application called postman:

crs_unregister postman

Table D–11 Options on the crs_stop Command

Option Description

-all Stops all registered Oracle Clusterware applications or application resources on active cluster nodes.

-c cluster_node Stops applications or application resources on the specified cluster node.

-f Forces termination of the indicated applications or application resources and all application resources dependent on the specified resources. This option is useful to stop all application resources when the crs_stop command fails because of application resources that are dependent on the specified resource or if the application is in the UNKNOWN state because of an unsuccessful stop. This option makes the crs_stop command ignore any errors that are reported by an action program.

-q Runs the command in quiet mode (no messages are displayed on the console).

C Application Programming Interface to Oracle Clusterware

D-18 Oracle Clusterware Administration and Deployment Guide

C Application Programming Interface to Oracle ClusterwareThe APIs described in this section provide access to operational control of resources that Oracle Clusterware manages. The API is used to register user applications to the Oracle Clusterware system so that they can be managed by Oracle Clusterware and made highly available. When an application is registered, the application can be started and its state queried. If the application is no longer to be run, it can be stopped and unregistered from Oracle Clusterware. The Oracle Clusterware services are provided by another process, such as the CRSD process, running outside the database server. These APIs communicate with the CRSD process using an IPC mechanism.

clscrs_init_crsInitializes a context for communications with Oracle Clusterware.

Parameters■ ctx—Context to initialize, caller allocated storage

■ errf—A Callback function for the error info, or NULL

■ errCtx—Context for the error callback func, or NULL

■ flags—CLSCRS_FLAG_TRACE, to enable tracing of lower layers

■ CLSCRS_FLAG_USETHREADS—Enables using the clscrs context in multiple threads

ReturnsStatus enum.

SyntaxCLSCRS_STAT clscrs_init_crs(clscrs_ctx **ctx, clscrs_msgf errf, void *errCtx, ub4 flags);

clscrs_term_crsReleases a context for communications with Oracle Clusterware.

Parameterctx—Context previously initialized with clscrs_init_crs.

Returnsstatus enum

SyntaxCLSCRS_STAT clscrs_term_crs( clscrs_ctx **ctx);

Note: You can install the Oracle Clusterware high availability Application Programming Interface (API) from the Oracle Database client installation media.

C Application Programming Interface to Oracle Clusterware

High Availability Oracle Clusterware Command-Line Reference and C API D-19

clscrs_getnodenameReturns the correct node name. If you issue this command within a configured cluster, then this is the name as known to the clusterware. If not in a cluster, then the name is as known through other clusterware means. Getting a node name may require a lot of resources in the first call, especially if CLSCRS was not initialized with a preexisting CSS context.

Parameters ■ ctx—Context previously initialized with clscrs_init

■ node_name—Filled in node name

SyntaxCLSCRS_STAT clscrs_getnodename(clscrs_ctx *ctx, oratext *node_name);

clscrs_env_createCreates a call environment describing the parameters to the Oracle Clusterware calls. The memory is owned by clscrs which is released by a call to clscrs_env_delete.

Parameters■ context—Input context

■ env—Output environment pointer

Returnsstatus enum

SyntaxCLSCRS_STAT clscrs_env_create( clscrs_ctx *ctx, clscrs_env *env );

clscrs_env_setSets an attribute or value argument in an environment for the Oracle Clusterware calls; the value may be NULL.

Parameters■ env—Previously created environment

■ attr—Attribute name

■ val—Attribute value, may be NULL pointer

Returnsenvironment status enum

SyntaxCLSCRS_ENVCODE clscrs_env_set(clscrs_env env, const oratext *attr, const oratext *val);

C Application Programming Interface to Oracle Clusterware

D-20 Oracle Clusterware Administration and Deployment Guide

clscrs_env_deleteDeletes an environment for the Oracle Clusterware calls. The environment must be created again before use. You cannot delete an environment and then set new parameters.

Parameters■ env—The environment whose contents are deleted

Returnsenvironment status enum.

SyntaxCLSCRS_ENVCODE clscrs_env_delete(clscrs_env env);

clscrs_env_formatCreates a callback with formatted environment lines.

Parameters■ env—Env to dump

■ msgf—Callback function

■ msgp—Argument to msgp, uninterpreted

ReturnsNone.

Syntaxvoid clscrs_env_format( clscrs_env env, clscrs_msgf msgf, void *msgp );

clscrs_start_resourceDirect Oracle Clusterware to start a named set of resources. If a host is specified, then start there, otherwise start according to the placement policy for the resources in question. If the flag is async, then msgf is never called and Oracle returns the OK status after initiating the call to Oracle Clusterware, and the actual starts are performed asynchronously, and msgf will never be called. If flag is not async and msgf is not NULL, then msgf will be driven one line at a time with collected output from the start programs.

Parameters■ ctx—The context

■ names—The resource(s) to start

■ num_names—The number of resources in names[] to start

■ host—The host name on which to start the resource, may be NULL

■ env—Environment arguments to start

■ msgf—User message callback, may be NULL

■ msgp—User callback argument, may be NULL

C Application Programming Interface to Oracle Clusterware

High Availability Oracle Clusterware Command-Line Reference and C API D-21

■ msgf—Callback function

■ msgf—Callback function

Returnsstatus enum

SyntaxCLSCRS_STAT clscrs_start_resource( clscrs_ctx *ctx, const oratext *name[], sword num_names, const oratext *host, clscrs_env env, clscrs_msgf msgf, void *msgp, uword flag );

clscrs_stop_resourceDirects Oracle Clusterware to stop a named set of resources. The flag enables async stop.

Parameters■ ctx— the context

■ names—The resource(s) to stop

■ num_names—Number of resources in names[] to stop

■ flag—Async, force options

Returnsstatus enum

Syntaxclscrs_stop_resource( clscrs_ctx *ctx, const oratext *names[], sword num_names,clscrs_env env, clscrs_msgf msgf, void *msgarg, uword flag );

clscrs_check_resourceDirect Oracle Clusterware to run the check actions for the identified resources. A delay can be specified to indicate a time to wait in seconds before the check action is performed on the server. This occurs after the call returns. This can be used to let something run before performing the check. For each resource identified by the in_splist, a status will be returned in the op_status, indicating whether the resource exists and whether the caller has permission to invoke operations on the resource. A resource that is explicitly named with no pattern matching will have a NOTFOUND error. A pattern that has no matches will return no error. There will be no valid attributes returned for the resources. There is no way of knowing exactly when the checks will actually be run or be completed. Returns SUCCESS if the operation status of all of the identified resources is SUCCESS, otherwise FAILURE and specific status will be found in the op_status entries. The caller must create the op_status list before the call and destroy it when done.

Parameters■ in_splist [in]—List of resources to check

■ delay_secs [in]—Time to wait before the check is run, as appropriate to the resource

■ flags [in]—None

C Application Programming Interface to Oracle Clusterware

D-22 Oracle Clusterware Administration and Deployment Guide

■ op_status [out]—Resource list structure holding the status of the check operation for each resource

Returnsstatus enum.

Syntaxclscrs_check_resource( clscrs_splist *in_splist, ub4 delay_seconds, uword flags, clscrs_reslist *op_status);

clscrs_register_resourceRegister the resources in the input resource list. The attributes for the resources are encapsulated in the input resource list. The output op_status list contains the results of the register operation for each resource and contains no valid resource attributes.

The caller must create and populate the in_reslist and must create the op_status list. Both the lists must be destroyed by the caller. The op_status list cannot be reused with another API call, you must create and destroy it for each API call.

One or more attributes of an already registered resource can be modified by passing the CLSCRS_FLAG_REG_UPDATE flag. The flags apply to all resources in the input reslist.

Parameters ■ in_reslist [in]—List of resources to be registered

■ flags [in]—CLSCRS_FLAG_REG_UPDATE or 0

■ op_status [out]—Resource list holding the status of the register operation for each resource

Returns■ CLSCRS_STAT_SUCCESS—If all input resources are successfully registered

■ CLSCRS_STAT_FAIL—If atleast one resource could not be registered

■ CLSCRS_STAT_CONNECTION—If there is a communication error with the crsd process

Syntaxclscrs_register_resource(clscrs_reslist *in_reslist, uword flags, clscrs_reslist *op_status);

clscrs_unregister_resourceUnregisters the resources in the input list of resource names. The output op_status list contains the results of the unregister operation for each resource. The caller must create and populate the rqlist and must create the op_status list. Both the lists must be destroyed by the caller. The op_status list cannot be reused with another API call. You must create and destroy it for each API call.

Parameters■ rqlist [in]—List of resources names to be unregistered

■ flags [in]—Option flags

C Application Programming Interface to Oracle Clusterware

High Availability Oracle Clusterware Command-Line Reference and C API D-23

■ op_status [out]—Resource list holding the status of the unregister operation for each resource

Returns■ CLSCRS_STAT_SUCCESS—If all input resources are successfully unregistered

■ CLSCRS_STAT_FAIL—If atleast one resource could not be unregistered

■ CLSCRS_STAT_CONNECTION—If there is a communication error with the crsd process

Syntaxclscrs_unregister_resource(clscrs_splist *rqlist, uword flags, clscrs_reslist *op_status);

clscrs_statObtains information about the resources identified in rqlist. The resources can be restricted to a specific node given in the nodename parameter. The information is returned in the output out_reslist.

If the rqlist is NULL, information about all registered resources is returned. If the node name is not NULL, the result is restricted to resources running on the specific node. Otherwise, all nodes in the cluster are considered.

The res_flags are used to scope the resource domain that is matched. There are no values for this parameter. The res_attrs identifies the attributes to be returned in the out_reslist. The caller may create a specific list of attributes of interest or use a predefined list returned by one of the macros CLSCRS_ATTRS_NONE, CLSCRS_ATTRS_ALL, CLSCRS_ATTRS_STATE, CLSCRS_ATTRS_SHORT. The attribute list applies to all resources identified by the rqlist. Any errors for a resource are returned in the output out_reslist.

Parameters■ rqlist [in]—List of resource names to query

■ nodename [in]—Query resources running only on the given node (optional, may be NULL)

■ res_attrs [in]—List to specify the attributes to be returned for each resource

■ res_flags [in]—Flags to restrict the resources being matched

■ out_reslist [out]—List holding the returned resource information

Returns■ CLSCRS_STAT_SUCCESS if information is obtained for all input resources

■ CLSCRS_STAT_FAIL if no information is available for atleast one resource

■ CLSCRS_STAT_CONNECTION if there is an error communicating with the crsd process

Syntaxclscrs_stat(clscrs_splist *rqlist, oratext *nodename, uword res_flags, clscrs_splist *res_attrs, clscrs_reslist *out_reslist);

Functions for Managing Resource Structures

D-24 Oracle Clusterware Administration and Deployment Guide

Functions for Managing Resource StructuresThis section describes functions for creating, managing, and destroying resources and the attributes associated with resources. The following structures are used in this section:

■ clscrs_sp—This is a stringpair (sp) structure. It contains a name and a value string. The value may be NULL. It is created and destroyed and its contents can be examined and the value replaced. An sp may be a member of exactly one stringpair list (splist).

■ clscrs_splist—This stringpair list (splist) is a list of zero or more stringpairs used in various contexts. Sometimes the list contains names and values, sometimes the list contains only names. The list is created and destroyed; members can be retrieved by name, added, deleted, replaced, and the list can be traversed with first and next operations.

■ clscrs_res—This represents a resource abstraction that contains the name of a resource and additional data about the context in which it is used. Sometimes it contains resource attribute data and other times it contains status and return message about an operation. A resource may be in exactly one resource list.

■ clscrs_reslist—This resource list is an abstraction used to contain information about zero or more resources. A resource list is created, appended, iterated over, and destroyed.

Export OperationsThis section describes the following export operations:

■ Stringpair Operations

■ Splist Operations

■ Resource Operations

■ Resource List Operations

Stringpair OperationsThis section describes the stringpair operations.

clscrs_sp_set Changes the value part of a stringpair. The new value may be NULL. After the call returns, the memory of the value can be recycled.

Parameters

■ sp [in]—Stringpair for which you want to set a value

■ value [in]—Value component of the stringpair (may be NULL)

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the sp argument is NULL

Syntax

clscrsret clscrs_sp_set(clscrs_sp *sp, const oratext *value);

clscrs_sp_get Gets the name and value components of a stringpair.

Parameters

Functions for Managing Resource Structures

High Availability Oracle Clusterware Command-Line Reference and C API D-25

■ sp [in]—Stringpair for which the name and value are being obtained

■ name [out]—Name component of the stringpair

■ value [out]—Value component of the stringpair (may be NULL)

Returns

■ clscrsretSUCC on success

■ clscrsretNOMEM if no memory can be allocated

■ clscrsretBADARG if the sp argument is NULL

Syntax

clscrsret clscrs_sp_get(clscrs_sp *sp, oratext **name, oratext **value);

clscrs_sp_get_value Gets the value component of a stringpair.

Parameters

■ sp [in]—Stringpair for which the name and value are being obtained

■ value [out]—Value component of the stringpair (may be NULL)

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the sp argument is NULL

Syntax

clscrsret clscrs_sp_get_value(clscrs_sp *sp, oratext **value);

Splist OperationsThis section describes the splist operations.

clscrs_splist_create Creates a new stringpair list. The memory for the splist is allocated by the function.

Parameters

■ ctx [in]—clscrs context

■ splist [out]—The new stringpair list created

Returns

■ clscrsretSUCC on success

■ clscrsretNOMEM if no memory can be allocated

■ clscrsretBADCTX if the context is NULL

Syntax

clscrsret clscrs_splist_create(clscrs_ctx *ctx, clscrs_splist **splist);

clscrs_splist_create_and_set Creates a new stringpair list (splist) and set the name and value for the first stringpair in the list. The memory for the splist is allocated by the function.

Parameters

■ ctx [in]—clscrs context

■ name [in]—Name component of the stringpair

Functions for Managing Resource Structures

D-26 Oracle Clusterware Administration and Deployment Guide

■ value [in]—Value component of the stringpair (may be NULL)

■ sp [out]—The new stringpair created

Returns

■ clscrsretSUCC on success

■ clscrsretNOMEM if no memory can be allocated

■ clscrsretBADCTX if the context is NULL

■ clscrsretBADARG if the name argument is NULL

Syntax

clscrsret clscrs_splist_create_and_set(clscrs_ctx *ctx, const oratext *name, const oratext *value, clscrs_splist **splist);

clscrs_splist_append Adds a new stringpair (sp) to a stringpair list (splist).

Parameters

■ splist [in]—splist to add the new {name, value} stringpair.

■ name [in]—Name component of the stringpair

■ value [in]—Value component of the stringpair (may be NULL)

Returns

■ clscrsretSUCC on success

■ clscrsretNOMEM if no memory can be allocated

■ clscrsretBADARG if the name argument is NULL

Syntax

clscrsret clscrs_splist_append(clscrs_splist *splist, const oratext *name, const oratext *value);

clscrs_splist_first Gets the first stringpair (sp) from a stringpair list (splist).

Parameters

■ name [in]—splist to get the first sp from

■ sp [out]—The first sp from the given splist

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the splist is NULL

■ clscrsretEMPTY if there are no stringpair elements in the splist

Syntax

clscrsret clscrs_splist_first(clscrs_splist *splist, clscrs_sp **sp);

clscrs_splist_next Gets the current next stringpair (sp) from a stringpair list (splist). This function is called to iterate over the stringpairs in a splist.

Parameters

■ name [in]—splist to get the next sp from

■ sp [out]—The next sp {name, value} in the given splist

Functions for Managing Resource Structures

High Availability Oracle Clusterware Command-Line Reference and C API D-27

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the splist is NULL

■ clscrsretENDLIST if there are no more stringpair elements in the splist

Syntax

clscrsret clscrs_splist_next(clscrs_splist *splist, clscrs_sp **sp);

clscrs_splist_replace Replaces the value for a stringpair (sp) in a stringpair list (splist).

Parameters

■ splist [in]—Splist to add the new {name, value} stringpair.

■ name [in]—Name for which the value is being replaced.

■ value [in]—Replacement value for the name (may be NULL)

Returns

■ clscrsretSUCC on successclscrsretBADARG if the name argument is NULL

■ clscrsretBADARG if the name argument is NULL

■ clscrsretBADARG if the name argument is NULL

Syntax

clscrsret clscrs_splist_replace(clscrs_splist *splist, const oratext *name, const oratext *value);

clscrs_splist_delete_sp Deletes a stringpair (sp) from a stringpair list (splist).

Parameters

■ splist [in]—Splist from which to delete the {name, value} stringpair.

■ splist [in]—Name to be deleted from the given splist.

Returns

■ clscrsretSUCC on success

■ clscrsretNONAME if there is no stringpair matching the given name

■ clscrsretBADARG if the name argument is NULL

Syntax

clscrsret clscrs_splist_delete_sp(clscrs_splist *splist, const oratext *name);

clscrs_splist_find Finds the value for a stringpair (sp) in a stringpair list (splist).

Parameters

■ splist [in]—Splist to look into

■ name [in]—Name for which the value is looked up

■ value [out]—Value associated with the given name in the given splist

Returns

■ clscrsretSUCC on success

■ clscrsretNONAME if there is no stringpair matching the given name

Functions for Managing Resource Structures

D-28 Oracle Clusterware Administration and Deployment Guide

■ clscrsretBADARG if the name argument is NULL

Syntax

clscrsret clscrs_splist_find(clscrs_splist *splist, const oratext *name, oratext **value);

clscrs_splist_count Counts the number of stringpairs (sp) in a stringpair list (splist).

Parameters

■ splist [in]—Splist to count the number of stringpairs.

■ count [out]—The number of stringpairs in the given splist.

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the splist is NULL

Syntax

clscrsret clscrs_splist_count(clscrs_splist *splist, ub4 *count);

clscrs_splist_destroy Frees the memory for a stringpair list (splist).

Parameter

■ splist [in]—Splist to destroy

Returns

■ clscrsretSUCC on success

Syntax

clscrsret clscrs_splist_destroy(clscrs_splist **splist);

clscrs_res_create Creates a new resource. The memory for the resource structure is allocated by the function. The memory is freed when a resource list (clscrs_reslist) is destroyed through clscrs_reslist_destroy().

Parameters

■ ctx [in]—clscrs context

■ resname [in]—Name of the resource

■ res [out]—The new resource created

Returns

■ clscrsretSUCC on success

■ clscrsretNOMEM if no memory can be allocated

■ clscrsretBADCTX if the context is NULL

■ clscrsretBADARG if the resource name is NULL

Syntax

clscrsret clscrs_res_create(clscrs_ctx *ctx, const oratext *resname, clscrs_res **res);

clscrs_res_get_name Gets the name of a resource.

Parameters

Functions for Managing Resource Structures

High Availability Oracle Clusterware Command-Line Reference and C API D-29

■ res [in]—Resource for which the name is obtained

■ name [out]—Name of the resource

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the resource argument is NULL

Syntax

clscrsret clscrs_res_get_name(clscrs_res *res, oratext **name);

clscrs_res_set_attr Sets a resource attribute for a resource.

Paramters

■ res [in]—Resource for which the attribute is set

■ attrname [in]—Name of the resource attribute

■ value [in]—Value for the resource attribute (may be NULL)

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the attribute name is NULL

■ clscrsretNOMEM if no memory can be allocated

Syntax

clscrsret clscrs_res_set_attr(clscrs_res *res, const oratext *attrname, const oratext *value);

clscrs_res_get_attr Gets a resource attribute for a resource.

Parameters

■ res [in]—Resource for which the attribute is obtained

■ attrname [in]—Name of the resource attribute

■ value [out]—Value for the resource attribute

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the attribute name is NULL

■ clscrsretNOMEM if no memory can be allocated

Syntax

clscrsret clscrs_res_get_attr(clscrs_res *res, const oratext *attrname, oratext **value);

clscrs_res_get_attr_list Gets the attribute list for a resource. The attribute list is a list of stringpairs. The client does not allocate the memory for attribute list.

Paramters

■ res [in]—Resource for which the attribute list

■ attrlist [out]—List of attributes for the given resource

Returns

Functions for Managing Resource Structures

D-30 Oracle Clusterware Administration and Deployment Guide

■ clscrsretSUCC on success

■ clscrsretBADARG if the resource is NULL

■ clscrsretNOMEM if no memory can be allocated

■ clscrsretNOATTRS if there are no attributes set for the resource

Syntax

clscrsret clscrs_res_get_attr_list(clscrs_res *res, clscrs_splist **attrlist);

clscrs_res_set_attr_list Sets the attribute list for a resource. The attribute list is a list of stringpairs. The list is created from the clscrs_splist_create call.

Parameters

■ res [in]—Resource for which the attribute list is set

■ attrlist [in]—List of attributes to be set for the given resource

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the resource is NULL

Syntax

clscrsret clscrs_res_set_attr_list(clscrs_res *res, clscrs_splist *attrlist);

clscrs_res_attr_count Gets the number of attributes for a resource.

Parameters

■ res [in]—Resource for which number of attributes is obtained

■ count [out]—Number of attributes for the given resource

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the resource is NULL

Syntax

clscrsret clscrs_res_attr_count(clscrs_res *res, ub4 *count);

clscrs_res_get_op_status Gets the status of an operation for a resource. The memory for the msg is allocated by the function.

Parameters

■ res [in]—Resource for which the operation status is obtained

■ status [out]—Status of an operation on the given resource

■ msg [out]—Text message for the result of an operation on the resource

Returns

■ clscrsretSUCC on success

■ clscrsretNOMEM if no memory can be allocated

■ clscrsretNOMSG if there is no msg available

■ clscrsretBADARG if the resource is NULL

Syntax

Functions for Managing Resource Structures

High Availability Oracle Clusterware Command-Line Reference and C API D-31

clscrsret clscrs_res_get_op_status(clscrs_res *res, CLSCRS_STAT *status, oratext **msg);

clscrs_res_get_registered Gets the registration status of a resource.

Parameters

■ res [in]—Resource for which the operation status is set

■ registered [out]—Boolean indicating whether the resource is registered

Returns

■ clscrsretSUCC on success

■ clscrsretNOMEM if no memory can be allocated

■ clscrsretBADARG if the resource is NULL

Syntax

clscrsret clscrs_res_get_registered(clscrs_res *res, boolean *registered);

clscrs_res_get_node_list Gets the nodelist currently hosting the resource, or NULL if there is no host for the resource or there are no attributes. The caller need not allocate memory for the nodelist.

Parameters

■ res [in]—Resource for which the nodelist is obtained

■ nodelist [out]—Splist holding the node(s)

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if the resource is NULL

Syntax

clscrsret clscrs_res_get_node_list(clscrs_res *res, clscrs_splist **nodelist);

clscrs_res_destroy Frees the memory for a resource.

Parameter

■ res [in]—Resource for which the memory is freed

Returns

■ clscrsretSUCC on success

Syntax

clscrsret clscrs_res_destroy(clscrs_res **res);

clscrs_reslist_create Creates a new resource list. The memory for the resource list is allocated by the function.

Parameters

■ ctx [in]—clscrs context

■ reslist [out]—Resource list (empty) that is created

Returns

■ clscrsretSUCC on success

Functions for Managing Resource Structures

D-32 Oracle Clusterware Administration and Deployment Guide

■ clscrsretNOMEM if no memory can be allocated

■ clscrsretBADCTX if the context is NULL

Syntax

clscrsret clscrs_reslist_create(clscrs_ctx *ctx, clscrs_reslist **reslist);

clscrs_reslist_append Adds a resource to a resource list.

Parameters

■ reslist [in]—Resource list to add the resource to

■ res [in]—Resource to add

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if reslist is NULL

■ clscrsretRESEXISTS if the resource already exists in reslist

Syntax

clscrsret clscrs_reslist_append(clscrs_reslist *reslist, clscrs_res *res);

clscrs_reslist_first Gets the first resource on a resource list.

Parameters

■ reslist [in]—Resource list for which the first resource is obtained

■ res [out]—The first resouce on the resource list

Returns

■ clscrsretSUCC on success.

■ clscrsretBADARG if reslist is NULL

■ clscrsretEMPTY if there are no resources in the list

Syntax

clscrsret clscrs_reslist_first(clscrs_reslist *reslist, clscrs_res **res);

clscrs_reslist_next Gets the current next resource from the resource list. This function is called to iterate over the resources in a resource list.

Parameters

■ reslist [in]—Resource list for which the first resource is obtained

■ res [out]—The next resouce on the resource list

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if reslist is NULL

■ clscrsretENDLIST if there are no more resources in the list

Syntax

clscrsret clscrs_reslist_next(clscrs_reslist *reslist, clscrs_res **res);

clscrs_reslist_find Finds a resource in a resource list.

Functions for Managing Resource Structures

High Availability Oracle Clusterware Command-Line Reference and C API D-33

Parameters

■ reslist [in]—Resource list (empty) that is created

■ name [in]—Name of the resource that is being obtained

■ res [out]—The resouce corresponding to the given name

Returns

■ clscrsretSUCC on success

■ clscrsretNORES if the resource is not found

■ clscrsretBADARG if reslist or name is NULL

Syntax

clscrsret clscrs_reslist_find(clscrs_reslist *reslist, const oratext *name, clscrs_res **res);

clscrs_reslist_count Counts the number of resources in a resource list.

Parameters

■ reslist [in]—Resource list for which the count is obtained

■ count [out]—Number of resources in the resource list

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if reslist is NULL

Syntax

clscrsret clscrs_reslist_count(clscrs_reslist *reslist, ub4 *count);

clscrs_reslist_delete_res Deletes a resource from a resource list.

Parameters

■ reslist [in]—Resource list from which to delete the resource

■ name [in]—Name of the resource to delete

Returns

■ clscrsretSUCC on success

■ clscrsretBADARG if reslist or name is NULL

■ clscrsretNORES if the resource is not in reslist

Syntax

clscrsret clscrs_reslist_delete_res(clscrs_reslist *reslist,

clscrs_reslist_destroy Frees the memory for a resource list.

Parameters

■ reslist [in]—Resource list for which the memory is to be freed

Returns

■ clscrsretSUCC on success

Syntax

clscrsret clscrs_reslist_destroy(clscrs_reslist **reslist);

Functions for Managing Resource Structures

D-34 Oracle Clusterware Administration and Deployment Guide

clscrs_get_error_message Retrieves the error message corresponding to the return codes from a clscrs API. The memory for the error message is allocated by the caller. If the buffer is not large enough, the length is returned in msg_len.

Parameters

■ ctx [in]—clscrs context

■ err_code [in]—Error code returned from the clscrs API

■ msg [out]—Message corresponding to err_code

■ msg_len [inout]—Length of the message buffer.

Returns

■ clscrsretSUCC on success

Syntax

clscrsret clscrs_get_error_message(clscrs_ctx *ctx, clscrsret err_code, oratext *msg, sb4 msg_len);

clscrs_get_fixed_attrlist Gets the list of attributes corresponding to an attribute group identifier.

Parameters

■ ctx [in]—clscrs context

■ attrgrp [in]—Attribute group that identifies a group of attributes

■ attrlist [out]—List of attributes returned that corresponds to the attribute group

Returns

■ clscrsretSUCC on success

Syntax

clscrs_splist* clscrs_get_fixed_attrlist(clscrs_ctx *ctx, clscrs_attr_grp attrgrp);

Resource OperationsThis section describes resource functions. The clscrs_res resource abstraction contains the name of a resource and additional data appropriate to the context in which it is used. Sometimes it carries status and error return information about an operation. Other times it contains attribute data as input to an operation. A resource may be in exactly one resource list. If so, its successor may be found with the NEXT operation.

CLSCRSRET clscrs_res_create(clscrs_cts *ctx, const oratext *resname, clscrs_res **res)Creates a single resource and fills in a handle to it. The resname must be provided, and cannot be NULL.

CLSCRSRET clscrs_res_get_name (clscrs_res *res, oratext **name);Returns a pointer to the name of a resource. The returned name pointer is only valid as long as the resource exists.

Functions for Managing Resource Structures

High Availability Oracle Clusterware Command-Line Reference and C API D-35

CLSCRSRET clscrs_res_get_op_status (clscrs_res *res, CLSCRS_STAT *stat, oratext **msg)If there is a valid operation error value in the resource, fills in the stat and the pointer to an error message and returns SUCCESS. May fill in a NULL for the msg. If there is no valid operation status, returns INVALID.

clscrs_splist *clscrs_res_get_node_list(clscrs_res *res);Returns an splist holding the nodes currently hosting the resource, or NULL if there is no host for the resource or there are no attributes. The count of the list may obtained and the list iterated. The list is owned by the resource and will be destroyed when the resource is destroyed. This operation is a special case, interpreting the semantics of the attribute or attributes that may hold the current hosting member list. There is no specified ordering of the list.

CLSCRSRET clscrs_res_get_attr(clscrs_res *res, const oratext *attrname, oratext **value);Fills in a pointer to the value of the attribute with the given name and returns SUCCESS if found, or FAILURE if the name is not present in the attribute set of the resource, and INVALID if there is no attribute list in the resource. May assert if given an invalid resource handle.

CLSCRSRET clscrs_res_set_attr(clscrs_res *res, oratext *attrname, oratext *value);Sets the value of the attribute with the given name and returns SUCCESS or FAILURE if the name is not present in the attribute set of the resource. If the attribute already exists, then its current value will be replaced. On return, the memory for the name and value may be recycled.

CLSCRSRET clscrs_res_attr_count(clscrs_res *res, ub4 *count);Fills in the number of attributes that will be returned with a scan using an attrIter. Returns SUCCESS if there are attributes. If there are no attributes, returns INVALID, but still sets the count to zero.

CLSCRSRET clscrs_res_get_attr_list(clscrs_res *res, clscrs_splist **attrlist);Returns an splist for the attributes of the resource, allowing next() operations for a scan. The list is owned by the resource, which will destroy it when the resource is destroyed. Returns SUCCESS if there is an attribute list, or INVALID if there is not one in the resource. There is no specified ordering of the list.

Resource List OperationsThis section describes the resource list operations. The clscrs_reslist resource list is an abstraction that contains information about zero or more resources. A list is created, appended, iterated over, and destroyed.

CLSCRSRET clscrs_reslist_create(clscrs_ctx *ctx, clscrs_reslist **reslist)Creates a resource list and fills a handle.

CLSCRSRET clscrs_reslist_count(clscrs_reslist *reslist, ub4 *count)Fills a count of the number of resources in a list.

CLSCRSRET clscrs_reslist_first(clscrs_reslist *reslist, clscrs_res **first)Fills a handle to the first resource in the list of resources. If the list is empty, fill in NULL.

Functions for Managing Resource Structures

D-36 Oracle Clusterware Administration and Deployment Guide

CLSCRSRET clscrs_reslist_next(clscrs_reslist *reslist, clscrs_res **next)Fills a handle to the next resource in the list of resources. If the list is empty, fill in NULL.

CLSCRSRET clscrs_reslist_find(clscrs_reslist *reslist, const oratext *name, clscrs_res **res)Finds a resource by name in a reslist, and fill in a handle to the one located. If the resource is not found, fill in NULL. Does an exact match lookup with no pattern matching.

CLSCRS_RES *clscrs_reslist_destroy(clscrs_res *res)Deletes a resource list and all of the resources that it currently contains.

E

Managing the Oracle Cluster Registry E-1

E Managing the Oracle Cluster Registry

This appendix describes the syntax of the Oracle Cluster Registry (OCR) configuration utility, OCRCONFIG. It also explains how to troubleshoot OCR using the OCRDUMP and OCRCHECK utilities. You can also use these utilities on Oracle Local Registry (OLR).

This appendix contains the following topics:

■ About OCRCONFIG

– Overview

– Operational Notes

■ OCRCONFIG Command Reference

■ Troubleshooting and Diagnostic Output

– Oracle Cluster Registry Troubleshooting

– Using the OCRCHECK Utility

– Using the OCRDUMP Utility to View Oracle Cluster Registry Content

– Diagnostic Output for OCRCONFIG

About OCRCONFIG

E-2 Oracle Clusterware Administration and Deployment Guide

About OCRCONFIG

This section contains topics which relate to using the OCRCONFIG utility.

■ Overview

■ Operational Notes

OverviewUse the ocrconfig command to manage OCR. Using this utility you can import, export, add, delete, restore, overwrite, backup, repair, replace, move, upgrade, or downgrade OCR.

Operational Notes

Usage InformationThe ocrconfig command syntax is as follows:

ocrconfig -option

Using Utility HelpTo display the help output for the OCRCONFIG utility:

ocrconfig -help

Privileges and SecurityTo use the OCRCONFIG utility you must be logged into the operating system as a user with administrative privileges.

Log FilesWhen you use the OCRCONFIG utility, a log file called ocrconfig_pid.log is created in the $ORACLE_HOME/log/host_name/client directory. Ensure that you have write privileges in this directory before running the OCRCONFIG utility.

OCRCONFIG Command Reference

Managing the Oracle Cluster Registry E-3

OCRCONFIG Command Reference

This section lists the following OCRCONFIG commands:

■ ocrconfig -add

■ ocrconfig -backuploc

■ ocrconfig -delete

■ ocrconfig -downgrade

■ ocrconfig -export

■ ocrconfig -import

■ ocrconfig -manualbackup

■ ocrconfig -overwrite

■ ocrconfig -repair

■ ocrconfig -replace

■ ocrconfig -restore

■ ocrconfig -showbackup

■ ocrconfig -upgrade

ocrconfig -addUse the ocrconfig -add command to add an OCR device or file.

Syntaxocrconfig -add file_name

Usage Notes■ You must run this command as root.

■ The file_name variable can be a device name, a file name, or the name of an ASM disk group. For example:

– /dev/raw/raw1

– /oradbocfs/crs/data.ocr

– d:\oracle\mirror.ocr

– +newdg

If you specify an ASM disk group, the name of the disk group must be preceded by a plus sign (+).

ExampleTo add an OCR file to the default location in ASM, data:

# ocrconfig -add +data

ocrconfig -backuplocUse the ocrconfig -backuploc command to specify an OCR backup directory location.

OCRCONFIG Command Reference

E-4 Oracle Clusterware Administration and Deployment Guide

Syntaxocrconfig [-local] -backuploc file_name

Usage Notes■ You must run this command as root.

■ Use the -local option to back up OLR.

■ The file_name variable can be a full directory path name that is accessible by all nodes. For example:

– Grid_home/cdata/cluster3/

– d:\cdata\cluster3

■ The default location for generating OCR backups on Linux or UNIX systems is Grid_home/cdata/cluster_name, where cluster_name is the name of your cluster. The Windows default location for generating OCR backups uses the same path structure.

■ The default location for generating OLR backups on Linux or UNIX systems is Grid_home/cdata/host_name, where host_name is the name of the node on which the OLR resides that you want to back up. The Windows default location for generating OLR backups uses the same path structure.

ExampleTo specify an OCR backup location in a directory:

# ocrconfig -backuploc $Grid_home/cdata/cluster3

ocrconfig -deleteUse the ocrconfig -delete command to remove an OCR device or file.

Syntaxocrconfig -delete file_name

Usage Notes■ You must run this command as root.

■ The file_name variable can be a device name, a file name, or the name of an ASM disk group. For example:

– /dev/raw/raw1

– /oradbocfs/crs/data.ocr

– d:\oracle\mirror.ocr

– +olddg

If you specify an ASM disk group, the name of the disk group must be preceded by a plus sign (+).

ExampleTo remove an OCR location:

# ocrconfig -delete +olddg

OCRCONFIG Command Reference

Managing the Oracle Cluster Registry E-5

ocrconfig -downgradeUse the ocrconfig -downgrade command to downgrade OCR to an earlier specified version.

Syntaxocrconfig -downgrade [-version version_string]

Usage Notes■ You must run this command as root.

ExampleTo downgrade OCR to an earlier version:

# ocrconfig -downgrade -version

ocrconfig -exportUse the ocrconfig -export command to export the contents of OCR to a target file.

Syntaxocrconfig [-local] -export file_name

Usage Notes■ You must run this command as root.

■ Use the -local option to export the contents of OLR.

■ The file_name variable can be a full path name or the name of an ASM disk group that is accessible by all nodes. For example:

– /oradbocfs/crs/data.ocr

– d:\oracle\

ExampleTo export the contents of OCR to a file:

# ocrconfig -export d:\tmp\a

ocrconfig -importUse the ocrconfig -import command to import the contents of a target file into which you exported the contents of OCR back into OCR.

Syntaxocrconfig [-local] -import file_name

Usage Notes■ You must run this command as root.

■ Use the -local option to import the contents of OLR from a file.

■ The file_name variable can be a full path name or the name of an ASM disk group that is accessible by all nodes. For example:

– /oradbocfs/crs/data.ocr

– d:\oracle\

OCRCONFIG Command Reference

E-6 Oracle Clusterware Administration and Deployment Guide

– +backupdg

If you specify an ASM disk group, the name of the disk group must be preceded by a plus sign (+).

■ You must shut down Oracle Clusterware before running this command.

ExampleTo import the contents a file back into OCR:

# ocrconfig -import d:\tmp\a

ocrconfig -manualbackupUse the ocrconfig -manualbackup command to back up OCR on demand in the location you specify with the -backuploc option.

Syntaxocrconfig [-local] -manualbackup

Usage Notes■ You must run this command as root.

■ Use the -local option to perform a manual backup of OLR.

ExampleTo back up OCR:

# ocrconfig -manualbackup

ocrconfig -overwriteUse the ocrconfig -overwrite command to overwrite an OCR configuration in the OCR metadata with the current OCR configuration information that is found on the node from which you run this command.

Syntaxocrconfig -overwrite

Usage Notes■ You must run this command as root.

ExampleTo overwrite an OCR configuration:

# ocrconfig -overwrite

ocrconfig -repairUse the ocrconfig -repair command to repair an OCR configuration on the node from which you run this command. Use this command to add, delete, or replace an OCR configuration on a node that may have been stopped while you made changes to the OCR configuration in the cluster.

Syntaxocrconfig -repair -add file_name | -delete file_name | -replace current_file_name -replacement new_file_name

OCRCONFIG Command Reference

Managing the Oracle Cluster Registry E-7

Usage Notes■ You must run this command as root.

■ The file_name variable can be a device name, a file name, or the name of an ASM disk group. For example:

– /dev/raw/raw1

– /oradbocfs/crs/data.ocr

– d:\oracle\mirror.ocr

– +newdg

If you specify an ASM disk group, the name of the disk group must be preceded by a plus sign (+).

■ You can only use one option with ocrconfig -repair at a time.

ExampleTo repair an OCR configuration:

# ocrconfig -repair -delete +olddg

ocrconfig -replaceUse the ocrconfig -replace command to replace an OCR device or file on the node from which you run this command.

Syntaxocrconfig -replace current_file_name -replacement new_file_name

Usage Notes■ You must run this command as root.

■ The *_file_name variable can be a device name, a file name, or the name of an ASM disk group. For example:

– /dev/raw/raw1

– /oradbocfs/crs/data.ocr

– d:\oracle\mirror.ocr

– +newdg

If you specify an ASM disk group, the name of the disk group must be preceded by a plus sign (+).

ExampleTo replace an OCR device or file:

# ocrconfig -replace /dev/raw/raw1 -replacement +newdg

ocrconfig -restore

E-8 Oracle Clusterware Administration and Deployment Guide

ocrconfig -restore

Use the ocrconfig -restore command to restore OCR from an automatically created OCR backup file.

Syntaxocrconfig [-local] -restore file_name

Usage Notes■ You must run this command as root.

■ Use the -local option to restore a backup of OLR.

■ The file_name variable can be a file name or an ASM file name from an ASM disk group. For example:

– /oradbocfs/crs/BACKUP00.ocr

– d:\oracle\BACKUP01.ocr

– +backupdg:BACKUP02

If you specify an ASM disk group and file name, the name of the disk group must be preceded by a plus sign (+) and the disk group and file name must be separated by a colon (:).

ExampleTo restore OCR from a file on ASM:

# ocrconfig -restore +backupdg:BACKUP02

OCRCONFIG Command Reference

Managing the Oracle Cluster Registry E-9

ocrconfig -showbackup

Use the ocrconfig -showbackup command to display the backup location, timestamp, and the originating node name of the backup files. By default, this command displays information for both automatic and manual backups unless you specify auto or manual.

Syntaxocrconfig [-local] -showbackup [auto | manual]

Usage Notes■ Use the -local option to show manual OLR backup information. The -local

flag functions only with the manual option.

■ You can optionally specify auto or manual to display information about only automatic backups or only manual backups, respectively:

– auto: Displays information about automatic backups that Oracle Clusterware created in the past 4 hours, 8 hours, 12 hours, and in the last day and week.

– manual: Displays information about manual backups that you invoke using the ocrconfig -manualbackup command.

ExampleTo display manual backup information for OLR:

$ ocrconfig -local -showbackup manual

ocrconfig -upgrade

E-10 Oracle Clusterware Administration and Deployment Guide

ocrconfig -upgrade

Use the ocrconfig -upgrade command to upgrade OCR from a previous version.

Syntaxocrconfig [-local] -upgrade [user [group]]

Usage Notes■ You must run this command as root.

■ Use the -local option to upgrade OLR.

ExampleTo display manual backup information for OLR:

# ocrconfig -local -showbackup manual

Troubleshooting and Diagnostic Output

Managing the Oracle Cluster Registry E-11

Troubleshooting and Diagnostic Output

This section describes various methods for troubleshooting problems with OCR, and obtaining diagnostic information from the utilities used to manage the OCR.

This section contains the following topics:

■ Oracle Cluster Registry Troubleshooting

■ Using the OCRCHECK Utility

■ Using the OCRDUMP Utility to View Oracle Cluster Registry Content

■ Diagnostic Output for OCRCONFIG

Oracle Cluster Registry Troubleshooting

E-12 Oracle Clusterware Administration and Deployment Guide

Oracle Cluster Registry Troubleshooting

Table E–1 describes common OCR problems with corresponding resolution suggestions.

Table E–1 Common Oracle Cluster Registry Problems and Solutions

Problem Solution

Not currently using OCR mirroring and would like to enable it.

Run the ocrconfig command with the -replace option as described on page E-7.

OCR failed and you must replace it. Error messages in Oracle Enterprise Manager or OCR log file.

Run the ocrconfig command with the -replace option as described on page E-7.

OCR has a misconfiguration. Run the ocrconfig command with the -repair option as described on page E-6.

You are experiencing a severe performance effect from OCR processing or you want to remove OCR for other reasons.

Run the OCRCONFIG command with the -replace option as described on page E-7.

OCR has failed and before you can fix it, the node must be rebooted with only one OCR.

Run the ocrconfig with the -repair option to remove the bad OCR file. Oracle Clusterware cannot start if it cannot find all OCRs defined.

Troubleshooting and Diagnostic Output

Managing the Oracle Cluster Registry E-13

Using the OCRCHECK Utility

The OCRCHECK utility displays the version of the OCR’s block format, total space available and used space, OCRID, and the OCR locations that you have configured. OCRCHECK performs a block-by-block checksum operation for all of the blocks in all of the OCRs that you have configured. It also returns an individual status for each file and a result for the overall OCR integrity check.

You can run the ocrcheck -help command to display usage information about this utility.

The following example shows a sample of the OCRCHECK utility output:

# ocrcheck

Status of Oracle Cluster Registry is as follows : Version : 3 Total space (kbytes) : 262120 Used space (kbytes) : 752 Available space (kbytes) : 261368 ID : 2098980155 Device/File Name : +dg1 Device/File integrity check succeeded Device/File Name : +dg2 Device/File integrity check succeeded Device/File not configured Device/File not configured Device/File not configured Cluster registry integrity check succeeded Logical corruption check succeeded

To display only configured OCR locations:

$ ocrcheck -config

Oracle Cluster Registry configuration is : Device/File Name : Grid_home/oracle/has_work/data.ocr Device/File Name : Grid_home/oracle/has_work/mirror.ocr

Run the ocrcheck -local -config command to obtain OLR information.

$ ocrcheck -local -config

Oracle Local Registry configuration is : Device/File Name : Grid_home/oracle/has_work/data.olr.stact23OCRCHECK creates a log file in the Grid_home/log/host_name/client directory. To change the log level, edit the Grid_home/srvm/admin/ocrlog.ini file.

Note: The logical corruption check is only performed if you run the ocrcheck command as root.

Using the OCRDUMP Utility to View Oracle Cluster Registry Content

E-14 Oracle Clusterware Administration and Deployment Guide

Using the OCRDUMP Utility to View Oracle Cluster Registry Content

This section explains how to use the OCRDUMP utility to view OCR and Oracle Local Registry (OLR) content for troubleshooting. The OCRDUMP utility enables you to view the OCR and OLR contents by writing the content to a file or stdout in a readable format.

You can use several options for OCRDUMP. For example, you can limit the output to a key and its descendents. You can also write the contents to an XML file that you can view using a browser. OCRDUMP writes the OCR keys as ASCII strings and values in a data type format. OCRDUMP retrieves header information based on a best effort basis.

OCRDUMP also creates a log file in Grid_home/log/host_name/client. To change the log level, edit the Grid_home/srvm/admin/ocrlog.ini file.

To change the logging component, edit the entry containing the comploglvl= entry. For example, to change the log level of the OCRAPI component to 3 and to change the log level of the OCRRAW component to 5, make the following entry in the ocrlog.ini file:

comploglvl="OCRAPI:3;OCRRAW:5"

OCRDUMP Utility Syntax and OptionsThis section describes the OCRDUMP utility command syntax and usage. Run the ocrdump command with the following syntax where file_name is the name of a target file to which you want Oracle Database to write the Oracle Cluster Registry output and where key_name is the name of a key from which you want Oracle Database to write Oracle Cluster Registry subtree content:

ocrdump [file_name | -stdout] [-local] [-backupfile backup_file_name] [-keyname key_name] [-xml] [-noheader]

Table E–2 describes the OCRDUMP utility options and option descriptions.

Note: Make sure that you have file creation privileges in the Grid_home directory before using the OCRDUMP utility.

Table E–2 OCRDUMP Options and Option Descriptions

Options Description

file_name The name of a file to which you want OCRDUMP to write output.

By default, OCRDUMP writes output to a predefined output file named OCRDUMPFILE. The file_name option redirects OCRDUMP output to a file that you specify.

-stdout Use this option to redirect the OCRDUMP output to the text terminal that initiated the program.

If you do not redirect the output, OCRDUMP writes output to a predefined output file named OCRDUMPFILE.

-local Use this option to dump the contents of OLR.

-backupfile Use this option to view the contents of an OCR backup file. Use the -local option with this option to view the contents of an OLR backup file.

backup_file_name

The name of the backup file with the content you want to view. You can query the backups using the ocrconfig -showbackup command.

Troubleshooting and Diagnostic Output

Managing the Oracle Cluster Registry E-15

OCRDUMP Utility Examples The following ocrdump utility examples extract various types of OCR information and write it to various targets:

ocrdump

Writes the OCR content to a file called OCRDUMPFILE in the current directory.

ocrdump MYFILE

Writes the OCR content to a file called MYFILE in the current directory.

ocrdump -stdout -keyname SYSTEM

Displays the OCR content from the subtree of the key SYSTEM in the terminal window.

ocrdump -stdout -xml

Displays the OCR content in the terminal window in XML format.

ocrdump -stdout -backupfile $ORA_CRS_HOME/cdata/cluster_name/file_name

Writes the entire OCR and OLR content to a backup file located in the $ORA_CRS_HOME/cdata/cluster_name directory. You must run this command as root to be able to view all of the keys. Be sure to name the file appropriately so that it can be recognized by anyone as an OCR backup file, such as BACKUPOO.ocr.

Sample OCRDUMP Utility OutputThe following OCRDUMP examples show the KEYNAME, VALUE TYPE, VALUE, permission set (user, group, world) and access rights for two sample runs of the ocrdump command. The following shows the output for the SYSTEM.language key that has a text value of AMERICAN_AMERICA.WE8ASCII37.

[SYSTEM.language]ORATEXT : AMERICAN_AMERICA.WE8ASCII37SECURITY : {USER_PERMISSION : PROCR_ALL_ACCESS, GROUP_PERMISSION : PROCR_READ, OTHER_PERMISSION : PROCR_READ, USER_NAME : user, GROUP_NAME : group} The following shows the output for the SYSTEM.version key that has integer value of 3:

[SYSTEM.version]UB4 (10) : 3SECURITY : {USER_PERMISSION : PROCR_ALL_ACCESS, GROUP_PERMISSION : PROCR_READ, OTHER_PERMISSION : PROCR_READ, USER_NAME : user, GROUP_NAME : group}

-keyname key_name

The name of an Oracle Cluster Registry key whose subtree is to be dumped.

-xml Use this option to write the output in XML format.

-noheader Does not print the time at which you ran the command and when the Oracle Cluster Registry configuration occurred.

Table E–2 (Cont.) OCRDUMP Options and Option Descriptions

Options Description

Diagnostic Output for OCRCONFIG

E-16 Oracle Clusterware Administration and Deployment Guide

Diagnostic Output for OCRCONFIG

The OCRCONFIG utility creates a log file in Grid_home/log/host_name/client.

To change the amount of logging, edit the path in the Grid_home/srvm/admin/ocrlog.ini file.

Troubleshooting Oracle Clusterware F-1

FTroubleshooting Oracle Clusterware

This appendix introduces monitoring the Oracle Clusterware environment and explains how you can enable dynamic debugging to troubleshoot Oracle Clusterware processing, and enable debugging and tracing for specific components and specific Oracle Clusterware resources to focus your troubleshooting efforts.

This appendix contains the following topics:

■ Monitoring Oracle Clusterware

■ Dynamic Debugging

■ Component Level Debugging

■ Oracle Clusterware Shutdown and Startup

■ Enabling and Disabling Oracle Clusterware Daemons

■ Determining the Active Versions and Software Versions

■ Diagnostics Collection Script

■ Oracle Clusterware Alerts

■ Resource Debugging

■ Checking the Health of the Clusterware

■ Clusterware Log Files and the Unified Log Directory Structure

■ Troubleshooting the Oracle Cluster Registry

■ Enabling Additional Tracing for Oracle Clusterware High Availability

Monitoring Oracle ClusterwareYou can use Oracle Enterprise Manager to monitor the Oracle Clusterware environment. When you log in to Oracle Enterprise Manager using a client browser, the Cluster Database Home page appears where you can monitor the status of both Oracle Clusterware environments. Monitoring can include such things as:

■ Notification if there are any VIP relocations

■ Status of the Oracle Clusterware on each node of the cluster using information obtained through the Cluster Verification Utility (cluvfy)

■ Notification if node applications (nodeapps) start or stop

■ Notification of issues in the Oracle Clusterware alert log for the OCR, voting disk issues (if any), and node evictions

Monitoring Oracle Clusterware

F-2 Oracle Clusterware Administration and Deployment Guide

The Cluster Database Home page is similar to a single-instance Database Home page. However, on the Cluster Database Home page, Oracle Enterprise Manager displays the system state and availability. This includes a summary about alert messages and job activity, as well as links to all the database and Automatic Storage Management (ASM) instances. For example, you can track problems with services on the cluster including when a service is not running on all of the preferred instances or when a service response time threshold is not being met.

You can use the Oracle Enterprise Manager Interconnects page to monitor the Oracle Clusterware environment. The Interconnects page shows the public and private interfaces on the cluster, the overall throughput on the private interconnect, individual throughput on each of the network interfaces, error rates (if any) and the load contributed by database instances on the interconnect, including:

■ Overall throughput across the private interconnect

■ Notification if a database instance is using public interface due to misconfiguration

■ Throughput and errors (if any) on the interconnect

■ Throughput contributed by individual instances on the interconnect

All of this information also is available as collections that have a historic view. This is useful in conjunction with cluster cache coherency, such as when diagnosing problems related to cluster wait events. You can access the Interconnects page by clicking the Interconnect tab on the Cluster Database home page.

Also, the Oracle Enterprise Manager Cluster Database Performance page provides a quick glimpse of the performance statistics for a database. Statistics are rolled up across all the instances in the cluster database in charts. Using the links next to the charts, you can get more specific information and perform any of the following tasks:

■ Identify the causes of performance issues.

■ Decide whether resources need to be added or redistributed.

■ Tune your SQL plan and schema for better optimization.

■ Resolve performance issues

The charts on the Cluster Database Performance page include the following:

■ Chart for Cluster Host Load Average—The Cluster Host Load Average chart in the Cluster Database Performance page shows potential problems that are outside the database. The chart shows maximum, average, and minimum load values for available nodes in the cluster for the previous hour.

■ Chart for Global Cache Block Access Latency—Each cluster database instance has its own buffer cache in its System Global Area (SGA). Using Cache Fusion, Oracle RAC environments logically combine each instance's buffer cache to enable the database instances to process data as if the data resided on a logically combined, single cache.

■ Chart for Average Active Sessions—The Average Active Sessions chart in the Cluster Database Performance page shows potential problems inside the database. Categories, called wait classes, show how much of the database is using a resource, such as CPU or disk I/O. Comparing CPU time to wait time helps to determine how much of the response time is consumed with useful work rather than waiting for resources that are potentially held by other processes.

■ Chart for Database Throughput—The Database Throughput charts summarize any resource contention that appears in the Average Active Sessions chart, and

Component Level Debugging

Troubleshooting Oracle Clusterware F-3

also show how much work the database is performing on behalf of the users or applications. The Per Second view shows the number of transactions compared to the number of logons, and the amount of physical reads compared to the redo size for each second. The Per Transaction view shows the amount of physical reads compared to the redo size for each transaction. Logons is the number of users that are logged on to the database.

In addition, the Top Activity drilldown menu on the Cluster Database Performance page enables you to see the activity by wait events, services, and instances. Plus, you can see the details about SQL/sessions by going to a prior point in time by moving the slider on the chart.

Dynamic DebuggingYou can use crsctl commands as the root user to enable dynamic debugging for Oracle Clusterware, the Event Manager (EVM), and the clusterware subcomponents. You can dynamically change debugging levels using crsctl commands. Debugging information remains in the Oracle Cluster Registry (OCR) for use during the next startup. You can also enable debugging for resources.

The crsctl syntax to enable debugging for Oracle Clusterware is:

crsctl debug log crs "CRSRTI:1,CRSCOMM:2"

The crsctl syntax to enable debugging for EVM is:

crsctl debug log evm "EVMCOMM:1"

The crsctl syntax to enable debugging for resources is:

crsctl debug log res "resname:1"

Component Level DebuggingYou can use crsctl commands as the root user to enable dynamic debugging for the Oracle Clusterware Cluster Ready Services (CRS), Oracle Cluster Registry (OCR), Cluster Synchronization Services (CSS), and the Event Manager (EVM).

This section contains the following topics:

■ Enabling Debugging for CRS, OCR, CSS, and EVM Modules

■ Creating an Initialization File to Contain the Debugging Level

Enabling Debugging for CRS, OCR, CSS, and EVM ModulesYou can enable debugging for the CRS, OCR, CSS, and EVM modules and their components by setting environment variables or by issuing crsctl debug commands using the following syntax:

crsctl debug log module_name component:debugging_level

You must issue the crsctl debug command as the root user, and supply the following information:

■ module_name—The name of the module: CRS, EVM, or CSS.

See Also: Oracle Database 2 Day + Real Application Clusters Guide

Component Level Debugging

F-4 Oracle Clusterware Administration and Deployment Guide

■ component—The name of a component for the CRS, OCR, EVM, or CSS module. See Table F–1 for a list of all of the components.

■ debugging_level—A number from 1 to 5 to indicate the level of detail you want the debug command to return, where 1 is the least amount of debugging output and 5 provides the most detailed debugging output.

You can dynamically change the debugging level in the crsctl command, or you can configure an init file for changing the debugging level as described in "Creating an Initialization File to Contain the Debugging Level" on page F-5.

The following commands show examples of how to enable debugging for the various modules:

■ To enable debugging for Oracle Clusterware:

crsctl debug log crs "CRSRTI:1,CRSCOMM:2"

■ To enable debugging for OCR:

crsctl debug log crs "CRSRTI:1,CRSCOMM:2,OCRSRV:4"

■ To enable debugging for EVM:

crsctl debug log evm "EVMCOMM:1"

■ To enable debugging for resources

crsctl debug log res "resname:1"

To list the components that can be used for debugging, issue the crsctl lsmodules command using the following syntax and supply crs, evm, or css for the module_name parameter:

crsctl lsmodules module_name

Table F–1 shows the components for the CRS, OCR, EVM, and CSS modules, respectively. Note that some of the component names are common between the CRS, EVM, and CSS daemons and may be enabled on that specific daemon. For example COMMNS is the NS layer and because each daemon uses the NS layer, you can enable this specific module component on any of the daemons to get specific debugging information.

Note: You do not have to be the root user to run the crsctl command with the lsmodules option.

Oracle Clusterware Shutdown and Startup

Troubleshooting Oracle Clusterware F-5

Creating an Initialization File to Contain the Debugging LevelThis section describes how to specify the debugging level in an initialization file. This debugging information is stored for use during the next startup.

For each process that you want to debug, you can create an initialization file that contains the debugging level.

The initialization file name includes the name of the process that you are debugging (process_name.ini). The file is located in the |Oracle_home/log/hostname/admin/| directory.

For example, ORACLE_HOME/log/hostA/admin/clscfg.ini is the name for the CLSCFG debugging initialization file on hostA.

Oracle Clusterware Shutdown and StartupYou can start or stop Oracle Clusterware by issuing crsctl start and stop commands.

Example 1 Stopping Oracle ClusterwareTo stop Oracle Clusterware and its related resources on a specific node, issue the following command:

crsctl stop crs

Table F–1 Components for the CRS, OCR, EVM, and CSS Modules

CRS Modules1

1 List the CRS component modules using the crsctl lsmodules crs command.

OCR Modules2

2 You cannot list the OCR modules using the crsctl lsmodules command.

EVM Modules3

3 List the EVM component modules using the crsctl lsmodules evm command.

CSS Modules4

4 List the CSS component modules using the crsctl lsmodules css command.

CRSUICRSCOMMCRSRTICRSMAINCRSPLACECRSAPPCRSRESCRSCOMMCRSOCRCRSTIMERCRSEVTCRSDCLUCLSCSSCLNTCOMMCRSCOMMNS

OCRAPIOCRCLIOCRSRVOCRMASOCRMSGOCRCACOCRRAWOCRUTLOCROSD

OCR Tools Modules

OCRCONFOCRDUMPOCRCHECK

EVMDEVMDMAINEVMCOMMEVMEVTEVMAPPEVMAGENTCRSOCRCLUCLSCSSCLNTCOMMCRSCOMMNS

CSSDCOMMCRSCOMMNS

See Also: "Enabling Debugging for CRS, OCR, CSS, and EVM Modules" on page F-3 for information about dynamically changing debugging levels by specifying the level number (from 1 to 5) on the crsctl command

Enabling and Disabling Oracle Clusterware Daemons

F-6 Oracle Clusterware Administration and Deployment Guide

Example 2 Starting Oracle ClusterwareTo start Oracle Clusterware and its related resources on a specific node, issue the following command:

crsctl start crs

Enabling and Disabling Oracle Clusterware DaemonsWhen the Oracle Clusterware daemons are enabled, they start automatically at the time the node is started. To prevent the daemons from starting, you can disable them using crsctl commands. You can use crsctl commands as follows to enable and disable the startup of the Oracle Clusterware daemons.

Run the following command to enable startup for all of the Oracle Clusterware daemons:

crsctl enable crs

Run the following command to disable the startup of all of the Oracle Clusterware daemons:

crsctl disable crs

Determining the Active Versions and Software VersionsYou can determine the active version or the software version running on the local node cluster by issuing crsctl activeversion and softwarewareversion commands.

■ The software version is the binary version of the software on a particular cluster node.

■ The active version is the lowest software version running in a cluster.

These versions are used while upgrading a cluster.

Example 1 Determining the Active VersionTo determine the active version on the local node, issue the following command:

crsctl query crs activeversion

Example 2 Determining the Software VersionTo determine the software version on the local node, issue the following command:

crsctl query crs softwareversion

Note: You must run these crsctl commands as the root user.

Notes:

■ You must run these crsctl commands as the root user.

■ Neiter of these commands is supported on Windows systems

Checking the Health of the Clusterware

Troubleshooting Oracle Clusterware F-7

Diagnostics Collection ScriptEvery time an Oracle Clusterware error occurs, you should use run the diagcollection.pl script to collect diagnostic information from Oracle Clusterware in trace files. The diagnostics provide additional information so Oracle Support can resolve problems. Run this script from the following location:

CRS_home/bin/diagcollection.pl

Oracle Clusterware AlertsOracle Clusterware posts alert messages when important events occur. The following is an example of an alert from the CRSD process:

2007-09-03 10:05:35.463[cssd(3073)]CRS-1605:CSSD voting file is online: /dev/sdm2. Details in /scratch/crs/log/stnsp012/cssd/ocssd.log.2007-09-03 10:05:35.484[cssd(3073)]CRS-1605:CSSD voting file is online: /dev/sdl3. Details in /scratch/crs/log/stnsp012/cssd/ocssd.log.[cssd(3073)]CRS-1601:CSSD Reconfiguration complete. Active nodes are stnsp011 stnsp012 stnsp013 stnsp014 .2007-09-03 10:05:36.949[evmd(2218)]CRS-1401:EVMD started on node stnsp012.2007-09-03 10:05:36.999[crsd(2232)]CRS-1012:The OCR service started on node stnsp012.2007-09-03 10:05:38.770[crsd(2232)]CRS-1201:CRSD started on node stnsp012.

The location of this alert log on Linux, UNIX, and Windows systems is in the following directory path, where CRS_home is the name of the location of Oracle Clusterware: CRS_home/log/hostname/alerthostname.log.

The following example shows an EVMD alert:

[NORMAL] CLSD-1401: EVMD started on node %s [ERROR] CLSD-1402: EVMD aborted on node %s. Error [%s]. Details in %s.

Resource DebuggingYou can use crsctl command to enable resource debugging using the following syntax:

crsctl debug log res "ora.node1.vip:1"

This has the effect of setting the environment variable USER_ORA_DEBUG, to 1, before running the start, stop, or check action scripts for the ora.node1.vip resource.

Checking the Health of the ClusterwareUse the crsctl check command to determine the health of your clusterware as in the following example:

crsctl check crs

Note: You must run this script as the root user.

Note: You must run this crsctl command as the root user.

Clusterware Log Files and the Unified Log Directory Structure

F-8 Oracle Clusterware Administration and Deployment Guide

Run the following command to determine the health of individual daemons where daemon is crsd, cssd or evmd:

crsctl check daemon

Clusterware Log Files and the Unified Log Directory StructureOracle Database uses a unified log directory structure to consolidate the Oracle Clusterware component log files. This consolidated structure simplifies diagnostic information collection and assists during data retrieval and problem analysis.

Oracle Clusterware retains one current log file and five older log files that are 50 MB in size (300 MB of storage) for the cssd process, and one current log file and 10 older log files that are 10 MB in size (110 MB of storage) for the crsd process. In addition, Oracle Clusterware overwrites the oldest retained log file for any log file group when the current log file gets stored. Alert files are stored in the directory structures shown in Table F–2.

Note: You do not have to be the root user to perform health checks.

Table F–2 Locations of Oracle Clusterware Component Log Files

Component Log File Location1

1 The directory structure is the same for Linux, UNIX, and Windows systems.

Cluster Ready Services Daemon (crsd) Log Files

CRS home/log/hostname/crsd

Oracle Cluster Registry (OCR) records l

For the OCR tools (OCRDUMP, OCRCHECK, OCRCONFIG) record log information in the following location:2

CRS_Home/log/hostname/client

The OCR server records log information in the following location:3

CRS_home/log/hostname/crsd

2 To change the amount of logging, edit the path in the CRS_home/srvm/admin/ocrlog.ini file.

Oracle Process Monitor Daemon (OPROCD)

The following path is specific to Linux4:

/etc/oracle/hostname.oprocd.log

Cluster Synchronization Services (CSS)

CRS_home/log/hostname/cssd

Event Manager (EVM) information generated by evmd

CRS_home/log/hostname/evmd

Oracle RAC RACG The Oracle RAC high availability trace files are located in the following two locations:

CRS_home/log/hostname/racg

and

$ORACLE_HOME/log/hostname/racg

Core files are in subdirectories of the log directory. Each RACG executable has a subdirectory assigned exclusively for that executable. The name of the RACG executable subdirectory is the same as the name of the executable.

Troubleshooting the Oracle Cluster Registry

Troubleshooting Oracle Clusterware F-9

Troubleshooting the Oracle Cluster RegistryThis following topics in this section explain how to troubleshoot the OCR:

■ Using the OCRDUMP Utility to View Oracle Cluster Registry Content

■ Using the OCRCHECK Utility

■ Oracle Cluster Registry Troubleshooting

Using the OCRDUMP Utility to View Oracle Cluster Registry ContentThis section explains how to use the OCRDUMP utility to view OCR content for troubleshooting. The OCRDUMP utility enables you to view the OCR contents by writing OCR content to a file or stdout in a readable format.

You can use a number of options for OCRDUMP. For example, you can limit the output to a key and its descendents. You can also write the contents to an XML file that you can view using a browser. OCRDUMP writes the OCR keys as ASCII strings and values in a datatype format. OCRDUMP retrieves header information based on a best effort basis.

OCRDUMP also creates a log file in CRS_home/log/hostname/client. To change the amount of logging, edit the file CRS_Home/srvm/admin/ocrlog.ini.

To change the logging component, edit the entry containing the comploglvl= entry. For example, to change the logging of the ORCAPI component to 3 and to change the logging of the OCRRAW component to 5, make the following entry in the ocrlog.ini file:

comploglvl="OCRAPI:3;OCRRAW:5"

OCRDUMP Utility Syntax and Options This section describes the OCRDUMP utility command syntax and usage. Run the ocrdump command with the following syntax where filename is the name of a target file to which you want Oracle Database to write the OCR output and where keyname is the name of a key from which you want Oracle Database to write OCR subtree content:

ocrdump [file_name|-stdout] [-backupfile backup_file_name] [-keyname keyname] [-xml] [-noheader]

Table F–3 describes the OCRDUMP utility options and option descriptions.

3 To change the amount of logging, edit the path in the CRS_home/log/hostname/crsd/crsd.ini file.4 This path is dependent upon the installed Linux or UNIX platform.

Note: Make sure that you have file creation privileges in the CRS_home directory before using the OCRDUMP utility.

Table F–3 OCRDUMP Options and Option Descriptions

Options Description

file_name The name of a file to which you want OCRDUMP to write output.

By default, output from the OCRDUMP utility is written to the predefined output file named OCRDUMPFILE. The file_name option redirects OCRDUMP output to the file that you specify.

Troubleshooting the Oracle Cluster Registry

F-10 Oracle Clusterware Administration and Deployment Guide

OCRDUMP Utility Examples The following ocrdump utility examples extract various types of OCR information and write it to various targets:

ocrdump

Writes the OCR content to a file called OCRDUMPFILE in the current directory.

ocrdump MYFILE

Writes the OCR content to a file called MYFILE in the current directory.

ocrdump -stdout -keyname SYSTEM

Writes the OCR content from the subtree of the key SYSTEM to stdout.

ocrdump -stdout -xml

Writes the OCR content to stdout in XML format.

Sample OCRDUMP Utility OutputThe following OCRDUMP examples show the KEYNAME, VALUE TYPE, VALUE, permission set (user, group, world) and access rights for two sample runs of the ocrdump command. The following shows the output for the SYSTEM.language key that has a text value of AMERICAN_AMERICA.WE8ASCII37.

[SYSTEM.language]ORATEXT : AMERICAN_AMERICA.WE8ASCII37SECURITY : {USER_PERMISSION : PROCR_ALL_ACCESS, GROUP_PERMISSION : PROCR_READ, OTHER_PERMISSION : PROCR_READ, USER_NAME : user, GROUP_NAME : group} The following shows the output for the SYSTEM.version key that has integer value of 3:

[SYSTEM.version]UB4 (10) : 3SECURITY : {USER_PERMISSION : PROCR_ALL_ACCESS, GROUP_PERMISSION : PROCR_READ, OTHER_PERMISSION : PROCR_READ, USER_NAME : user, GROUP_NAME : group}

-stdout Use this option to redirect the OCRDUMP output to the text terminal that initiated the program.

If you do not redirect the output, output from the OCRDUMP utility is written to the predefined output file named OCRDUMPFILE by default.

-keyname The name of an OCR key whose subtree is to be dumped.

-xml Writes the output in XML format.

-noheader Does not print the time at which you ran the command and when the OCR configuration occurred.

-backupfile Option to identify a backup file.

backup_file_name

The name of the backup file with the content you want to view. You can query the backups using the ocrconfig -showbackup command.

Table F–3 (Cont.) OCRDUMP Options and Option Descriptions

Options Description

Enabling Additional Tracing for Oracle Clusterware High Availability

Troubleshooting Oracle Clusterware F-11

Using the OCRCHECK UtilityThe OCRCHECK utility displays the version of the OCR’s block format, total space available and used space, OCRID, and the OCR locations that you have configured. OCRCHECK performs a block-by-block checksum operation for all of the blocks in all of the OCRs that you have configured. It also returns an individual status for each file as well as a result for the overall OCR integrity check.

The following example shows a sample of the OCRCHECK utility output:

Status of Oracle Cluster Registry is as follows : Version : 2 Total space (kbytes) : 262144 Used space (kbytes) : 16256 Available space (kbytes) : 245888 ID : 1918913332 Device/File Name : /dev/raw/raw1 Device/File integrity check succeeded Device/File Name : /dev/raw/raw2 Device/File integrity check succeeded Cluster registry integrity check succeeded

OCRCHECK creates a log file in the directory CRS_home/log/hostname/client. To change amount of logging, edit the file CRS_home/srvm/admin/ocrlog.ini.

Oracle Cluster Registry TroubleshootingTable F–4 describes common OCR problems with corresponding resolution suggestions.

Enabling Additional Tracing for Oracle Clusterware High AvailabilityOracle Support may ask you to enable tracing to capture additional information. Because the procedures described in this section may affect performance, only perform these activities with the assistance of Oracle Support. This section includes the following topics:

■ Generating Additional Trace Information for a Running Resource

■ Verifying Event Manager Daemon Communications

Table F–4 Common OCR Problems and Solutions

Problem Solution

Not currently using OCR mirroring and would like to enable it.

Run the ocrconfig command with the -replace option as described on page 2-4.

An OCR failed and you need to replace it. Error messages in Oracle Enterprise Manager or OCR log file.

Run the ocrconfig command with the -replace option as described on page 2-5.

An OCR has a misconfiguration. Run the ocrconfig command with the -repair option as described on page 2-5.

You are experiencing a severe performance effect from OCR processing or you want to remove an OCR for other reasons.

Run the ocrconfig command with the -replace option as described on page 2-4.

An OCR has failed and before you can fix it, the node need to be rebooted with only one OCR.

Run the ocrconfig -repair command to remove the bad ocr file. Oracle Clusterware will not start if it cannot find all OCRs defined.

Enabling Additional Tracing for Oracle Clusterware High Availability

F-12 Oracle Clusterware Administration and Deployment Guide

Generating Additional Trace Information for a Running ResourceTo generate additional trace information for a running resource, Oracle recommends that you use CRSCTL commands. For example, issue the following command to turn on debugging for resources:

$ crsctl debug log res "resource_name:level"

For example, to set the value of the USR_ORA_DEBUG initialization parameter to 1 for the VIP resource, issue the following command:

$ crsctl debug log res ora.cwclu011.vip:1

Verifying Event Manager Daemon CommunicationsThe event manager daemons (evmd) running on separate nodes communicate through specific ports. To determine whether the evmd for a node can send and receive messages, perform the test described in this section while running session 1 in the background.On node 1, session 1 enter:

$ evmwatch –A –t "@timestamp @@"

On node 2, session 2 enter:

$ evmpost -u "hello" [-h nodename]

Session 1 should show output similar to the following:

$ 21-Jul-2007 08:04:26 hello

Ensure that each node can both send and receive messages by executing this test in several permutations.

Oracle Clusterware API Messages G-1

GOracle Clusterware API Messages

This appendix describes the Oracle Clusterware API messages. The Oracle Clusterware commands and APIs are the primary sources of these error messages.

CRS-0184: Cannot communicate with the CRS daemon.Cause: The CRS daemon on the local node is either not running or there was an internal communication error with the CRS daemon.

Action: Check if the CRS daemon process is running on the local node.

CRS-0210: Could not find resource '%s'.Cause: An attempt was made to operate on a resource that is not registered.

Action: Check if the resource is registered using crs_stat.

CRS-0211: Resource '%s' has already been registered.Cause: An attempt was made to register a resource that is already registered.

Action: Check if the resource is registered using crs_stat.

CRS-0213: Could not register resource '%s'.Cause: There was an internal error while registering the resource.

Action: Check the CRS daemon log file.

CRS-0214: Could not unregister resource '%s'.Cause: There was an internal error while unregistering the resource.

Action: Check the CRS daemon log file.

CRS-0215: Could not start resource '%s'.Cause: There was an internal error while starting the resource.

Action: Check the CRS daemon log file.

CRS-0216: Could not stop resource '%s'.Cause: There was an internal error while stopping the resource.

See Also: Oracle Database Platform Guide for Microsoft Windows for Windows messages and for all other messages refer search online at

http://tahiti.oracle.com

Note: The Oracle Clusterware API demo is not supported for Windows.

G-2 Oracle Clusterware Administration and Deployment Guide

Action: Check the CRS daemon log file.

CRS-0217: Could not relocate resource '%s'.Cause: There was an internal error while relocating the resource.

Action: Check the CRS daemon log file.

CRS-0218: Could not restart the resource '%s' on the original node.Cause: There was an internal error while restarting the resource.

Action: Check the CRS daemon log file.

CRS-0219: Could not update resource '%s'.Cause: There was an internal error while updating the resource.

Action: Check the CRS daemon log file.

CRS-0220: Resource '%s' has invalid resource profile. Cause: Invalid attributes in the resource profile.

Action: Run crs_profile -validate to identify the invalid attributes.

CRS-0221: Resource '%s''s action script cannot be found.Cause: The action script has been deleted from the file system.

Action: Run crs_stat -p to determine the action script location and to check for its existence.

CRS-0223: Resource '%s' has placement error.Cause: There was no host available on which to failover/start the resource based on the Placement Policy for the resource.

Action: Check the target host for the resource and restart the resource using the crs_start command.

CRS-0230: Member '%s' is not in the cluster.Cause: The hostname was not found in the cluster.

Action: Check the hostnames in the cluster.

CRS-0232: Cluster member is down. Cannot perform operation.Cause: The node on which CRS is attempting to start the resource is down.

Action: Start the node and retry the operation.

CRS-0233: Resource or relatives are currently involved with another operation.Cause: Another CRS daemon was operating on the same resource.

Action: Wait for a minute and try the command or operation again.

CRS-0253: CRS configuration error, the CRS default directory is not set in OCR. Cause: The OCR key which contains the user default CRS key is not initialized.

Action: Check the CRS configuration. If necessary reinstall CRS.

CRS-0254: Authorization failure.Cause: The user permissions were insufficient to operate on the resource.

Action: Check the permissions associated with the resource using crs_getperm.

CRS-0255: CRSD is not running in privileged mode. Insufficient permissions to run this command. Cause: The CRS daemon was not running as the privileged user.

Oracle Clusterware API Messages G-3

Action: Check if the CRS daemon is running as root (UNIX or Linux) or Administrator (Windows).

CRS-0256: Username conflicts with the owner of the resource.Cause: An attempt was made to give separate user level permissions for the owner of the resource.

Action: Check the owner of the resource and the user being given permissions.

CRS-0257: Groupname conflicts with the primary group of the resource.Cause: An attempt was made to give separate group level permissions for the primary group of the resource.

Action: Check the primary group of the resource and the group being given permissions.

CRS-0258: Invalid ACL string format.Cause: CRS-258: Invalid ACL string format.

Action: Check the syntax of the permission string (ACL).

CRS-0259: Owner of the resource does not belong to the group.Cause: The owner of the resource does not belong to the expected group.

Action: If this resource is owned by the root user, check if the root user belongs to the DBA group.

CRS-0402: Could not make safe dir('%s').Cause: Unable to create safe directory('%s').

Action: Please check if you have proper permissions and sufficient space on the disk to create the directory.

CRS-0403: Could not chdir to safe dir('%s').Cause: Unable to change directory to safe dir('%s').

Action: Please check if safe dir exists and if you have proper permissions.

CRS-0406: Could not create lock dir ('%s').Cause: Unable to create lock directory ('%s')

Action: Please check if you have proper permissions and sufficient space on the disk to create the directory.

CRS-0407: Another CRSD may be running, could not obtain lock file '%s'.Cause: Unable to obtain lock file as another CRSD may be running

Action: Please stop the existing CRSD before attempting to start CRSD again.

CRS-0413: Could not initialize the CSS context.Cause: Unable to communicate with the cluster services.

Action: Verify that the CSS Daemon is properly configured and is running.

CRS-0414: Could not establish EVM connection.Cause: Unable to communicate with EVM daemon.

Action: Run the 'crsctl check evmd' command to determine whether EVM daemon is properly configured and is running.

CRS-0451: CRS configuration error, unable to initialize OCR.

G-4 Oracle Clusterware Administration and Deployment Guide

Cause: The OCR that contains information about the CRS configuration is unavailable.

Action: Check the CRS configuration. If necessary reinstall CRS.

CRS-0452: CRS configuration error, unable to find CRSD Connection Information in OCR. Cause: The OCR key which contains the user default CRSD connection is not initialized.

Action: Check the CRS configuration. If necessary reinstall CRS.

CRS-0453: CRS configuration error, unable to find Instance Information in OCR. Cause: The OCR key which contains the Instance's information is not initialized.

Action: Add the instance using srvctl.

CRS-0471: Node number is not found.Cause: Cluster Services is unable to retrieve the node name.

Action: Verify your cluster installation, including any vendor cluster ware. If necessary reinstall the cluster.

CRS-0472: Node name is not found.Cause: Cluster Services is unable to retrieve the node name.

Action: Verify your cluster installation, including any vendor cluster ware. If necessary reinstall the cluster.

CRS-1005: Failed to get required resources.Cause: There was an internal error while evaluating the required resources for the subject resource.

Action: Check if the status of any resource is UNKONOWN using crs_stat -t.

CRS-1006: No more members to consider.Cause: There was no host found on which to start the resource based on the placement policy.

Action: Check the placement policy and the required resources for the subject resource.

CRS-1007: Failed after successful dependency consideration.Cause: There was no host found on which to start the resource based on the placement policy.

Action: Check the placement policy and the required resources for the subject resource.

CRS-1009: Resource '%s' is already running on member '%s'.Cause: An attempt was made to start a resource on a host while it is already running on that host.

Action: This is an insignificant error. Check the operation being performed. 1011, 0, Trying to relocate to a dead member.

CRS-2001: User does not have permission to start CRSD.Cause: Unable to start CRSD due to insufficient permissions.

Action: Start CRSD as a privileged user.

CRS-2007: Could not communicate with EVM.

Oracle Clusterware API Messages G-5

Cause: Unable to communicate with EVM daemon.

Action: Run the 'crsctl check evmd' command to determine whether EVM daemon is properly configured and is running.

G-6 Oracle Clusterware Administration and Deployment Guide

Oracle Clusterware Alert Messages H-1

HOracle Clusterware Alert Messages

This appendix describes the Oracle Clusterware messages that are returned in the Oracle Clusterware alert log.

CRS--0600: [%s] Error [%s]. Details in %s.Cause: None

Action: None

CRS-1001 The OCR was formatted using version %d.Cause: Successfully formatted the OCR location(s).

Action: None

CRS-1002 The OCR was restored from %s.Cause: The OCR was successfully restored from a backup file as requested by the user.

Action: None

CRS-1003 The OCR format was downgraded to version %d.Cause: The OCR was successfully downgraded to an earlier block format as requested by the user.

Action: None

CRS-1004 The OCR was imported from %s.Cause: Successfully imported the OCR contents from a file as requested by the user.

Action: None

CRS-1005 The OCR upgrade was completed. Version has changed from %d to %d. Details in %s.Cause: The OCR was successfully upgraded to a newer block format.

Action: None

CRS-1006 The OCR location %s is inaccessible. Details in %s.Cause: An error occurred while accessing the OCR.

Action: Use the ocrcheck command to validate the accessibility of the device and its block integrity. Check that the OCR location in question has the correct

See Also: Oracle Database Platform Guide for Microsoft Windows for Windows messages and for all other messages search online at

http://tahiti.oracle.com

H-2 Oracle Clusterware Administration and Deployment Guide

permissions. Determine whether this behavior is limited to one node or whether it occurs across all of the nodes in the cluster. Use the ocrconfig command with the -replace option to replace the OCR location.

CRS-1007 The OCR/OCR mirror location was replaced by %s.Cause: The OCR location was successfully replaced as requested by the user.

Action: None

CRS-1008 Node %s is not responding to OCR requests. Details in %s.Cause: Error in communicating to the OCR server on a peer node. This OCR did not receive a notification regarding its peer's death within the specified time.

Action: Contact Oracle Support.

CRS-1009 The OCR configuration is invalid. Details in %s.Cause: The OCR configuration on this node does not match the OCR configuration on the other nodes in the cluster.

Action: Determine the OCR configuration on the other nodes in the cluster on which Oracle Clusterware is running by using the ocrcheck command. Run the ocrconfig command with the -repair option to correct the OCR configuration on this node.

CRS-1010 The OCR mirror location %s was removed.Cause: The OCR location was successfully removed as requested by the user.

Action: None

CRS-1011 OCR cannot determine that the OCR content contains the latest updates. Details in %s.Cause: The OCR could not be started. The OCR location configured on this node does not have the necessary votes and might not have the latest updates.

Action: Ensure that the other nodes in the cluster have the same OCR location configured. If the configuration on the other nodes in the cluster does not match, then run the ocrconfig command with the -repair option to correct the configuration on this node.

If the configurations on all of the nodes match, use the ocrdump command to ensure that the existing OCR location has the latest updates. Run the ocrconfig command with the -overwrite option to correct the problem. If the se procedures do not correct the problem, then contact Oracle Support Services.

CRS-1012 The OCR service started on node %s.Cause: The OCR was successfully started.

Action: None

CRS-1013 The OCR at %s was successfully formatted using version %d. Ignore earlier CRS-1006 messages if any.Cause: Successfully formatted the OCR location(s).

Action: Ignore earlier CRS-1006 messages if any.

CRS-1201 CRSD started on node %s.Cause: CRSD has started, possibly due to a CRS start, or a node reboot or a CRSD restart.

Action: None Required. You can run the command 'crsctl check crsd' to validate the health of the CRSD

Oracle Clusterware Alert Messages H-3

CRS-1202 CRSD aborted on node %s. Error [%s]. Details in %s.Cause: Fatal Internal Error. Check the CRSD log file to determine the cause.

Action: Determine whether the CRSD gets auto-started.

CRS-1203 Failover failed for the CRS resource %s. Details in %s.Cause: Failover failed due to an internal error. Examine the contents of the CRSD log file to determine the cause.

Action: None

CRS-1204 Recovering CRS resources for node %s.Cause: CRS resources are being recovered, possibly because the cluster node is starting up online.

Action: Check the status of the resources using the crs_stat command.

CRS-1205 Auto-start failed for the CRS resource %s. Details in %s.Cause: This message comes up when the auto-start for the resource has failed during a reboot of the cluster node.

Action: Start the resources using the crs_start command.

CRS-1206 Resource %s went into an UNKNOWN state. Force stop the resource using the crs_stop -f command and restart %s.Cause: Resource went into an UNKNOWN state because the check or the stop action on the resource failed.

Action: Force stop the resource using the crs_stop -f command and restart the resource

CRS-1207 There are no more restart attempts left for resource %s. Restart the resource manually using the crs_start command.Cause: The Oracle Clusterware is no longer attempting to restart the resource because the resource has failed and the Oracle Clusterware has exhausted the maximum number of restart attempts.

Action: Use the crs_start command to restart the resource manually.

CRS-1401 EVMD started on node %s.Cause: EVMD has started either because of a CRS start, a node reboot, or an EVMD restart.

Action: None required. You can run the 'crsctl check evmd' command to validate the health of EVMD.

CRS-1402 EVMD aborted on node %s. Error [%s]. Details in %s.Cause: EVMD has aborted due to an internal error. Check the EVMD log file to determine the cause.

Action: Determine whether the EVMD is auto-started.

CRS-1601 CSSD Reconfiguration complete. Active nodes are %s.Cause: A node joined or left the cluster

Action: None

CRS-1602 CSSD aborted on node %s. Error [%s]. Details in %s.Cause: The CSS daemon aborted on the listed node with the listed return code

Action: Collect the CSS daemon logs from all nodes and any CSS daemon core files and contact Oracle Support

H-4 Oracle Clusterware Administration and Deployment Guide

CRS-1603 CSSD on node %s shutdown by user.Cause: The CSS daemon on the listed node was terminated by a user

Action: None

CRS-1604 CSSD voting file is offline: %s. Details in %s.Cause: The listed voting file became unusable on the local node

Action: Verify that the file system containing the listed voting file is available on the local node

CRS-1605 CSSD voting file is online: %s. Details in %s.Cause: The CSS daemon has detected a valid configured voting file

Action: None

CRS-1606 CSSD Insufficient voting files available [%s of %s]. Details in %s.Cause: The number of voting files has decreased to a number of files that is insufficient.

Action: Locate previous 1604 messages and take action as indicated for message 1604

CRS-1607 CSSD evicting node %s. Details in %s.Cause: The local node is evicting the indicated node

Action: Collect the CSS daemon logs from all nodes and any CSS daemon core files and contact Oracle Support

CRS-1608 CSSD Evicted by node %s. Details in %s.Cause: The local node was evicted by the indicated node

Action: Collect the CSS daemon logs from all nodes and any CSS daemon core files and contact Oracle Support

CRS-1609 CSSD detected a network split. Details in %s.Cause: Heartbeat messages between one or more nodes were not received and one or more nodes were evicted from the cluster to preserve data integrity.

Action: Verify all network connections between cluster nodes and repair any problematic connections. If there do not appear to be any network problems:

1. Collect the CSS daemon logs, system messages and any CSS daemon core files from all nodes

2. Contact Oracle Support

CRS-1610 node %s (%d) at 90%% heartbeat fatal, eviction in %d.%03d secondsCause: Did not receive heartbeat messages from the node. This could be due network problems or failure of listed node.

Action: Check if the private interconnect network used by cluster is functioning properly, including all the cables, network cards, switches/routers and so on between this node and listed node. Correct any problems discovered.

CRS-1611 node %s (%d) at 75%% heartbeat fatal, eviction in %d.%03d secondsCause: Did not receive heartbeat messages from the node. This could be due to network problems or failure of the listed node.

Action: Check if the private interconnect network used by cluster is functioning properly, including all the cables, network cards, switches/routers and so on between this node and listed node. Correct any problems discovered.

Oracle Clusterware Alert Messages H-5

CRS-1612 node %s (%d) at 50%% heartbeat fatal, eviction in %d.%03d secondsCause: Did not receive heartbeat messages from the node. This could be due to network problems or failure of listed node.

Action: Check if the private interconnect network used by cluster is functioning properly, including all the cables, network cards, switches/routers and so on between this node and listed node. Correct any problems discovered.

CRS-1613 voting device hang at 90%% fatal, termination in %u ms, disk (%d/%s)Cause: Voting device I/O has not completed for a long time. This could be due some error with the device voting file is on or in some element in the path of the I/O to the device.

Action: Verify if the device is working properly including all the I/O paths. Voting file listed will be considered inactive in the number of seconds specified. Failure of a majority of devices would result in node reboot.

CRS-1614 voting device hang at 75%% fatal, termination in %u ms, disk (%d/%s)Cause: Voting device I/O has not completed for a long time. This could be due some error with the device voting file is on or in some element in the path of the I/O to the device.

Action: Verify if the device is working properly including all the I/O paths. Voting file listed will be considered inactive in the number of seconds specified. Failure of a majority of devices would result in node reboot.

CRS-1615 voting device hang at 50%% fatal, termination in %u ms, disk (%d/%s)Cause: Voting device I/O has not completed for a long time. This could be due some error with the device voting file is on or in some element in the path of the I/O to the device.

Action: Verify if the device is working properly including all the I/O paths. Voting file listed will be considered inactive in the number of seconds specified. Failure of a majority of devices would result in node reboot.

CRS-1801 Cluster %s configured with nodes %s.Cause: None

Action: None

CRS-1802 Node %s added to cluster.Cause: None

Action: None

CRS-1803 Node %s deleted from cluster.Cause: None

Action: None

CRS-1804 Node %s upgraded to version %s.Cause: None

Action: None

CRS-1901 CRS service setting (%s) is changed from [%s] to [%s].Cause: None

Action: None

H-6 Oracle Clusterware Administration and Deployment Guide

CRS-2001 memory allocation error when initiating the connection failed to allocate memory for the connection with the target process Cause: Failed to allocate memory for the connection with the target process.

Action: None.

CRS-2002 connection by user %s to %s refusedCause: User command cannot connect to the target process.

Action: The user may not have sufficient privilege to connect.

CRS-2003 error %d encountered when connecting to %sCause: Connection to the target process failed.

Action: Examine whether the connection is made properly. Retry again at a later time if necessary.

CRS-2004 error %d encountered when sending messages to %sCause: User command cannot communicate with the target process properly.

Action: Retry again at a later time.

CRS-2005 timed out when waiting for response from %dCause: The target process does not return acknowledgment in time.

Action: Retry again at a later time.

CRS-2006 failed to get response from %dCause: The target process failed to return acknowledgement.

Action: Retry again at a later time.

CRS-2007 invalid component key name <%s> usedCause: The given component key name could not be recognized.

Action: re-run the command with a valid component key name.

CRS-2008 invalid message type <%d> usedCause: An unrecognized message type was sent.

Action: Retry with a valid command again.

CRS-2009 unable to get authentication for user %sCause: Current user was not authenticated for connection.

Action: Log in as another user and try again.

Index-1

Index

Aaction program

defined, 5-2action scripts

xclock example, 5-9action_script, 5-8ACTION_SCRIPT attribute, 5-5ACTIVE_PLACEMENT attribute, 5-5administrative tools

overview and concepts, 1-10alert messages

CRSD, F-7Event Manager (EVM), F-7log file location, F-7

application placement policies, 5-11application profile attributes

overview, 5-1summary of attributes, 5-5

application profilescheck_interval attribute, 5-5

applicationsaccess and usage privileges, 5-2action scripts timeout, 5-7adding to OCR, 5-14attributes described for, 5-2command syntax in profiles, D-1defining a virtual IP address, 1-12highly available, 1-12, 5-1managing with CRS commands, D-1naming, 5-6resource dependency, 5-2resource entities, 5-2responding to status changes, 5-1scripts, 5-8storing profiles in OCR, 5-2template files for profiles, 5-2TYPE attribute, 5-7

architecture, 1-1

Bbackground processes, 1-5, 1-6backups

voting disks, 2-1balanced placement

application profiles, 5-6benefits

of cloning Oracle Clusterware, 3-2block devices

support for the OCR, 2-3block-by-block checksum operation, E-13, F-11booting

disabling Oracle Clusterware daemons, F-6

CCache Fusion

communication, C-3changing VIP addresses, 2-12check action

script timeout, 5-7check entry point

resource action program, 5-5check_interval attribute

crs_profile example, 5-8, 5-19, D-4description, 5-5, 5-8-o flag for crs_profile, D-6

checksum operationblock-by-block, E-13, F-11

client-side diagosability infrastructure (CRSD)alerts, F-7

clone.pl scriptenvironment variables, 3-9

clone.pl script variables, 3-8, 3-12cloning, 1-11, 3-1

benefits, 3-2clone.pl script variables, 3-8, 3-12creating Oracle Clusterware environments on

Linux and UNIX, 3-3CRS home, 3-5CRS home to more nodes, 3-10log files, 3-15Oracle Clusterware, 3-1preparation phase, 3-3running orainstRoot.sh script, 3-10running the CRS_home/root.sh script, 3-10setting CRS home, 3-9setting ORACLE_BASE, 3-9setting ORACLE_HOME, 3-9setting ORACLE_HOME_NAME, 3-9

clscrs_reslist resource list, D-35

Index-2

CLSD-1009 messageresolving, 2-7

CLSD-1011 messageresolving, 2-7

cluster interconnectCache Fusion communication, C-3changing private network addresses, 2-14

Cluster Manager (CSS)log files, F-8

cluster node membershipmanaging with ocssd, 1-5

Cluster Ready Services (CRS)debugging, F-3resources managed, 1-2, 1-6 See Also Oracle Clusterware

Cluster Ready Services Control (CRSCTL)see crsctl

Cluster Ready Services Daemonlog files, F-8

cluster resourceslist of, 1-2, 1-6

cluster restartupon ocssd failure, 1-5

cluster storagerecording with OCR, 1-4

Cluster Synchronization Services (CSS), 1-6debugging, F-3purpose, 1-7

Cluster Verification Utility (CVU)commands, A-1concepts, 1-9installation requirements, A-2known issues, A-9nodelist shortcuts, A-3online Help system, A-3performing tests, A-5use during cloning, 3-6using, A-1verbose mode, A-3verifications

cluster integrity, A-8connectivity, A-6installation, A-8node comparisons and, A-7Oracle Clusterware component, A-8storage, A-5system requirements, A-5user and permissions, A-7

CLUSTER_INTERCONNECT interfacespecifying with OIFCFG, C-3

clusterscreating through cloning, 3-5extending Oracle Clusterware to more

nodes, 3-10cluvfy comp nodecon -n all, 2-14compatibility

clusterware, ASM, and database, 1-8component parameter

supplying to crsctl commands, F-4components

initialization file for debugging purposes, F-5configurations

reinitializing the OCR, 2-10storage in OCR, 5-2

configuringnodes, 1-4voting disks, 2-1

creating a new clusterusing cloning, 3-5

CRS commands, D-2crs_getperm command, D-3crs_profile command, D-4crs_register command, D-7

permission required, D-2crs_relocate command, D-9

permission required, D-2crs_setperm command, D-12crs_start command, D-15

permission required, D-2crs_stat command, D-12

security and permissions, D-2crs_stop command, D-16

permission required, D-2crs_unregister command, D-17

permission required, D-2overview, 5-2, D-1permissions required, D-2syntax, D-1using, 5-14 to 5-19write permissions required, D-2

CRS home, 3-9cloning to another node in the cluster, 3-10cloning to create a new cluster, 3-5

CRS_home, 1-9CRS_home/root.sh script

cloning, 3-10crs_postman script, 5-8

check_interval attribute, 5-8optional_resources parameter, 5-8required_resources parameter, 5-8

crsctlchecking the Oracle Clusterware status, 2-5

crsctl add resource commandadd applications to OCR, 5-14

crsctl commandscomponent parameter, F-4debug log, F-3debugging_level parameter, F-4disabling Oracle Clusterware daemons, F-6enabling Oracle Clusterware, F-6lsmodules, F-4module_name parameter, F-3overview, 1-10startup and shutdown, F-5

crsd background processalert messages, F-7log files, F-8purpose, 1-5

cruser programWindows systems, 5-14

Index-3

CSSlog files, F-8

cssd log fileslocation, F-8

CVUSee also Cluster Verification Utility (CVU)

Ddaemons

oprocd, 1-7starting Oracle Clusterware, F-6

dd commandbacking up voting disks, 2-1restoring voting disks, 2-2

debuggingCRS, CSS, and EVM modules, F-3

debugging_level parametersupplying to crsctl commands, F-4

defining network interfacesOIFCFG command-line interface, C-1

deleting Oracle Clusterware home, 4-7delif command

OIFCFG command-line interface, C-5dependencies

amongst resources, 5-2DESCRIPTION attribute

application placement, 5-5devices

OCR support for block, 2-3diagcollection.pl script, F-7diagnostics collection script, F-7diagnostics data

collecting with the diagcollection.pl script, F-7disabling Oracle Clusterware, F-6Domain Name Service, 1-5

Eenabling debugging for the CRS, CSS, and EVM

modules, F-3enabling Oracle Clusterware daemons

starting Oracle Clusterware, F-6environment variables

passed to the clone.pl script, 3-9error messages

for management tools, G-1, H-1Event Manager (EVM)

alert messages, F-7daemon, 1-5debugging, F-3evmd background process, 1-5log files, F-8overview, 1-6

EVMSee Also Event Manager (EVM)

evmd background process, 1-5

FFAILOVER_DELAY attribute, 5-5

application placement, 5-5failure threshold, 5-5FAILURE_INTERVAL attribute, 5-5

application placement, 5-5FAILURE_THRESHOLD attribute, 5-5

application placement, 5-5failures

detecting, 5-5tracking, 5-5

FAN server calloutsmanaging, 1-5

Fast Application Notificationsee FAN

favored placement, 5-6features, new, xv

Ggetif command

OIFCFG command-line interface, C-4global interface

network interface stored as, C-2Global Services Daemon (GSD)

set up during cloning, 3-10Grid control

cloning Oracle Clusterware, 1-11grid environment

cloning Oracle Clusterware in, 3-1

HH.A.R.D.

implementing for the OCR, 2-11hardware requirements, 1-3high availability

and Oracle Clusterware, 1-6application programming interface, 1-12applications in Oracle Clusterware

framework, 5-1framework, 1-12

HOSTING_MEMBERS attribute, 5-6application placement, 5-6

Iiflist command

OIFCFG command-line interface, C-4importing

Oracle Cluster Registry (OCR), 2-10initialization files

creating for the component level debugging, F-5installation

introduction, 1-7Oracle Clusterware, 1-9

installationsconfiguring voting disks, 2-1

Interconnects pagemonitoring clusterware with Oracle Enterprise

Manager, F-1INVENTORY_LOCATION parameter

in clone.pl script, 3-12

Index-4

IP addressinstallation requirements, 1-9

IP addresseschanging private, 2-16

LLinux systems

cloning to create Oracle Clusterware environments, 3-3

Oracle Clusterware processes, 1-5listeners

cluster resource, 1-2, 1-6in the OCR, 1-4

log filesfor CSS, F-8for EVM, F-8for OCR, F-8

lsmodules parameteron the crsctl command, F-4

Mmanaging applications

CRS commands, D-1managing Oracle Clusterware

with CRSCTL, 1-10mass deployment

cloning, 1-11, 3-1membership

managing cluster nodes, 1-5messages

errors for management tools, G-1, H-1mirroring

Oracle Cluster Registry, 2-3module_name parameter

supplying to crsctl commands, F-3modules

CRSdebugging, F-3

CSSdebugging, F-3

EVMdebugging, F-3

OCRdebugging, F-3

monitorsoprocd process, 1-5

multiplexed OCR, 1-4multiplexing

Oracle Cluster Registry, 1-4

NNAME attribute, 5-6

application placement, 5-6Network Attached (NAS) storage, 1-4network interface

global, C-2node-specific, C-2OIFCFG syntax, C-3

network interfacesdefining with OIFCFG, C-1types, C-3updating subnet classification, C-1

new features, xvnode application, 1-5nodeapp, 1-5nodes

connected to a private network, 1-4deleting Oracle Clusterware home, 4-7placement of application resources, 5-6placement of resources, 5-5restarting resources, 5-6virtual IP addresses, 1-5

nodes that can host a resource, 5-6node-specific interface

network interface stored as, C-2

OOCR

See Oracle Cluster Registry (OCR)troubleshooting, F-11

OCR (Oracle Cluster Registry)OCRCHECK utility, E-13OCRDUMP utility command examples, E-15troubleshooting, E-12viewing content with OCRDUMP, E-14

OCR configuration toolsee OCRCONFIG command

OCR recordslog file location, F-8

OCRCHECKdiagnosing OCR problems with, 2-9

OCRCHECK utilitychanging the amount of logging, E-13, F-11log files, E-13, F-11overview, E-13, F-11sample output, E-13

OCRCONFIG commandoverview and concepts, 1-11

OCRCONFIG utilitycommands

-add, E-3-backuploc, E-3-delete, E-4-downgrade, E-5-export, E-5-import, E-5-manualbackup, E-6-overwrite, E-6-repair, E-6-replace, E-7-restore, E-8-showbackup, E-9-upgrade, E-10

log files, E-2, E-16syntax, E-2

OCRDUMP utilitychanging the amount of logging, E-14, F-9

Index-5

command examples, E-15, F-10commands, E-15

backup, E-15diagnosing OCR problems with, 2-9, E-14, F-9log files, E-14, F-9sample output, E-15, F-10syntax and options, E-14, F-9SYSTEM.language key output, E-15, F-10SYSTEM.version key output, E-15, F-10

ocr.loc file, 2-4ocrlog.ini file

editing, E-13, E-14, F-9, F-11ocssd

purpose, 1-5ocssd background process, 1-5

failure, 1-5OIFCFG command-line interface

commands, C-2 to C-5interface types, C-3invoking, C-1overview and concepts, 1-11, C-1syntax, C-2

OLR (Oracle Local Registry)viewing content with OCRDUMP, E-14

OLSNODES commandreference, B-1

operating systemsrequirements for Oracle Clusterware, 1-3

OPMD, Oracle Process Manager Daemon, 1-6OPROCD

location of log files, F-8oprocd process

on Linux and UNIX systems, 1-5See Also Oracle Process Monitor Daemon

(OPROCD)OPTIONAL_RESOURCES attribute, 5-6optional_resources parameter, 5-8ora prefix

naming Oracle resources, 5-3Oracle Cluster Registry

contents, 2-3sample output, F-11troubleshooting, 2-9

Oracle Cluster Registry (OCR)adding, 2-3, 2-4adding applications, 5-14application profiles, 5-2backup and recovery, 2-7common problems and solutions, F-11debugging, F-3diagnosing problems with OCRDUMP, 2-9downgrading, 2-11exporting, 2-10implementing H.A.R.D., 2-11importing

on Linux and UNIX systems, 2-10on Windows systems, 2-11

installation, 1-9log file location, F-8log files, F-8

managing, 2-3node applications set up during cloning, 3-10OCRCHECK utility, F-11ocrdump command examples, F-10ocr.loc file, 2-4overriding data loss protection mechanism, 2-6recording cluster configuration information, 1-3recording cluster storage, 1-4redundancy, 1-9removing, 2-3, 2-6repairing, 2-3, 2-5replacing, 2-3, 2-5restoring

on Linux or UNIX systems, 2-8on Windows systems, 2-9using automatically generated OCR

backups, 2-8sample output, F-10troubleshooting, F-9upgrading, 2-11viewing content with OCRDUMP, F-9

Oracle Clusterwareadding a home to a new node, 4-3Cluster Ready Services (CRS) background

process, 1-6starting and stopping, F-5

Oracle Clusterware commandsSee CRS commands

Oracle Clusterware homeadding, silent mode, 4-6deleting from a failed node, 4-9deleting manually, 4-7

Oracle Enterprise ManagerCluster Database page, 9-xvioverview and concepts, 1-10using the Interconnects page to monitor Oracle

Clusterware, F-1Oracle Interface Configuration tool

see OIFCFGOracle Notification Service (ONS)

purpose, 1-7Oracle Notification Services (ONS)

set up during cloning, 3-10Oracle Process Manager Daemon (OPMD), 1-6Oracle Process Monitor Daemon (OPROCD)

log files, F-8processes and services, 1-7purpose, 1-7See Also oprocd process

Oracle Real Application Clustersoverview of administration, 1-1

Oracle resourcescreating with the SRVCTL utility, 5-8

Oracle Universal InstallerOracle Clusterware installation, 1-7

ORACLE_BASE environment variable, 3-9ORACLE_HOME environment variable, 3-9ORACLE_HOME_NAME environment variable, 3-9OracleCRService

purpose, 1-6

Index-6

OracleCRSToken_user servicecreating, 5-14

OracleCSServicepurpose, 1-6

OracleEVMServicepurpose, 1-6

OraFenceServicepurpose, 1-6

orainstRoot.sh script, 3-10

Ppermissions

CRS commands, D-1listing for a resource with crs_getperm, D-3required for CRS commands, D-2

phasescloning preparation, 3-3

PLACEMENT attribute, 5-6placement of a resource, 5-5private interconnects

connecting nodes, 1-4private network address

changing, 2-14private network addresses

changing the network interface for, 2-17private network IP address

change when a new subnet is used, 2-16on one node

IP addresseschanging in the same subnet, 2-15

privilegesfor applications, 5-2

PRKO-2117 error message, 2-14process manager daemon, OPMD, 1-6process monitor

oprocd process, 1-5profiles

templates for applications, 5-2public interface

specifying with OIFCFG, C-3

RRACG

purpose, 1-7racgevt process, 1-5recording cluster configuration information, 1-3recording node membership information

voting disk, 1-3redundancy

OCR, 1-9voting disk, 1-4, 1-9

REQUIRED_RESOURCES attribute, 5-6required_resources parameter, 5-8resource file

application profile, 5-2resource profiles

attributes, 5-4resources

application profile, 5-2attempts to restart, 5-6check entry point, 5-5defined, 5-2delaying failover, 5-5dependencies, 5-2describing, 5-5list of host nodes, 5-6list of optional, 5-6list of required resource names, 5-6list operations, D-35marking as unavailable, 5-5naming, 5-3operations, D-34placement policy, 5-6placement when restarting a cluster node, 5-5required, 5-6script to start and stop, 5-5times restarted, 5-6UPTIME_THRESHOLD attribute, 5-7

RESTART_ATTEMPTS attribute, 5-6RESTART_COUNT attribute, 5-6restoring

OCR, 2-10restricted placement, 5-6

Ss_OcrVdskMirror1RetVal parameter

in clone.pl script, 3-13s_VdskMirror2RetVal parameter

in clone.pl script, 3-13s_votingdisklocation parameter

in clone.pl script, 3-13scalability

adding nodes and instances, quick-start format, 4-3, 4-10

SCRIPT_TIMEOUT attribute, 5-7scripts

action_script, 5-8ACTION_SCRIPT attribute, 5-5CRS_home/root.sh, 3-10crs_postman, 5-8orainstRoot.sh, 3-10

securityCRS commands, D-1

Server Control (SRVCTL) utilityusage on Oracle resources, 5-8

Server Control Utility (SRVCTL), 1-4overview and concepts, 1-10

serversOracle Clusterware requirements, 1-3

servicesOracleCSService on Windows systems, 1-6OracleEVMService on Windows systems, 1-6OraFenceService on Windows systems, 1-6

setif commandOIFCFG command-line interface, C-4

shutdownusing crsctl commands, F-5

Index-7

software requirements, 1-4SRVCTL

see Server Control Utility (SRVCTL)srvctl stop nodeapps command, 2-13start action

timeout, 5-7START_TIMEOUT attribute, 5-7starting the OIFCFG interface, C-1startup

using crsctl commands, F-5status

application responsiveness to changes, 5-1stop action

timeout, 5-7STOP_TIMEOUT attribute, 5-7storage

application profiles, 5-2Storage Area Network (SAN) storage, 1-4subnet

configuring for virtual IP address, 1-5syntax

CRS commands, D-1OCRDUMP utility, E-14, F-9

SYSTEM.language keyoutput, E-15, F-10

SYSTEM.version keyoutput, E-15, F-10

Ttemplate files

for application profiles, 5-2timeout

action programs, 5-7start action, 5-7

tracingenabling for Oracle RAC high availability, F-11

troubleshootingOCR, E-12, F-11

TYPE attribute, 5-7

UUNIX systems

cloning to create Oracle Clusterware environments, 3-3

Oracle Clusterware processes, 1-5UPTIME_THRESHOLD attribute, 5-7USR_ORA user-defined attribute names, 5-14

Vversions

compatibility for clusterware, ASM, and Oracle Database software, 1-8

VIP addresses, changing, 2-12VIPCA utility, 1-5Virtual Internet Protocol (VIP) address

changing, 2-12Virtual IP (VIP)

set up during cloning, 3-10

virtual IP address, 1-9defining for applications, 1-12requirements, 1-5

voting disks, 1-4administering, 2-1backing up, 2-1installing, 1-9maximum number of, 1-4recovering, 2-2

WWindows crsuser program, 5-14Windows systems

services for clusterware, 1-6write permissions

required for CRS commands, D-2

Xxclock action script

example, 5-9

Index-8