Use this MAP to resolve SCSI
RAID adapter, cache, or drive problems.
- Step 0270-1
- If the system displayed a FRU part number on the screen, use that part
number. If there is no FRU part number displayed on the screen, refer to the
SRN listing. Record the SRN source code and the failing function codes in
the order listed.
- Find the failing function codes in the FFC listing, and record the FRU
part number and description of each FRU.
Go to Step 0270-2.
- Step 0270-2
Is the FRU a RAID
drive?
- NO
- Go to Step 0270-6.
- YES
- Go to Step 0270-3.
- Step 0270-3
If the RAID drive you
want to replace is not already in the failed state,
then ask the customer to run the PCI SCSI Disk Array Manager using smit to
fail the drive that you want to replace. An example of this procedure is:
- Log in as root user.
- Type smit pdam.
- Select Fail a Drive in a PCI SCSI Disk Array.
- Select the appropriate disk array by placing the cursor over that array
and press Enter.
- Select the appropriate drive to fail based on the Channel and ID called
out in diagnostics. The Fail a Drive screen will appear.
- Verify that you are failing the correct drive by looking at the Channel
ID row. Press Enter when verified correct. Press Enter again.
- Press F10 and type smit pdam
- Select .
- Select the drive that just failed.
Go to Step 0270-4.
- Step 0270-4
Replace the RAID drive
using the RAID HOT PLUG DEVICES service aid:
Note: The drive you want to replace
must be either a SPARE or FAILED drive. Otherwise, the drive would not be
listed as an IDENTIFY AND REMOVE RESOURCES selection within the RAID HOT PLUG
DEVICES screen. In that case you must ask the customer to put the drive into
FAILED state. For information on putting the drive in a FAILED state, refer
the customer to the
SAS RAID Controller Reference Guide for AIX®.
- Select the option RAID HOT PLUG DEVICES within
the HOT PLUG TASK under DIAGNOSTIC SERVICE AIDS.
- Select the RAID adapter that is connected to the RAID array containing
the RAID drive you want to remove, then select COMMIT.
- Choose the option IDENTIFY in the IDENTIFY AND
REMOVE RESOURCES menu.
- Select the physical disk which you want to remove from the RAID array
and press Enter. The disk will go into the IDENTIFY state, indicated by a
flashing light on the drive.
- Verify that it is the physical drive you want to remove, then press Enter.
- At the IDENTIFY AND REMOVE RESOURCES menu, choose the option REMOVE and
press Enter. A list of the physical disks in the system that may be removed
will be displayed.
- If the physical disk you want to remove is listed, select it and press
Enter. The physical disk will go into the REMOVE state, as indicted by the
LED on the drive. If the physical disk you want to remove is not listed, it
is not a SPARE or FAILED drive. Ask the customer to put the drive in the FAILED
state before you can proceed to remove it. For information on putting the
drive in a FAILED state, refer the customer to the SAS RAID Controller Reference Guide for AIX.
- Refer to the service guide for the system unit or enclosure that contains
the physical drive for removal and replacement procedures for the following
substeps:
- Remove the old hot-swap RAID drive.
- Install the new hot-swap RAID drive. After the hot-swap drive is in place,
press Enter. The drive will exit the REMOVE state, and will go to the NORMAL
state after you exit diagnostics.
Note: There are no elective tests to run
on a RAID drive itself under diagnostics (the drives are tested by the RAID
adapter).
Go to Step 0270-5.
- Step 0270-5
If the RAID did not
begin reconstructing automatically, perform the following steps.
Adding
a Disk to the RAID array and Reconstructing:
Ask the customer to
run the PCI SCSI Disk Array Manager using smit. An
example of this procedure is:
- Log in as root user.
- Type smit pdam.
- Select Change/Show PCI SCSI RAID Drive Status.
- Select Add a Spare Drive.
- Select the appropriate adapter.
- Select the channel and ID of the drive that was replaced.
- Press Enter when verified.
- Press F3 until you return to the Change/Show PCI SCSI RAID
Drive Status screen.
- Select Add a Hot Spare.
- Select the drive you just added as a spare.
If there was no hot spare previously installed in the array, the array
will begin reconstructing immediately. Reconstruction time will vary based
on the size of the RAID array. Allow 1-2 hours for completion.
To check
the progress of the reconstruction:
- Log in as root user.
- Type smit pdam.
- Select List PCI SCSI RAID Arrays.
- Choose the array containing the drive you replaced.
If the state of
the RAID array is reconstructing, then it is in process of
reconstructing. If it is optimal, then reconstruction has
completed.
- Press F10 to exit.
Go to Step 027017.
- Step 0270-6
Is the FRU a RAID
adapter base card, RAID adapter cache card, or RAID adapter battery? - NO
- Go to Step 0270-15.
- YES
- Go to Step 0270-7.
- Step 0270-7
Do you want to change
the FRU using a hot-swap operation? - NO
- Power off the system, and remove the RAID adapter. Go to Step
0270-8.
- YES
- Remove the RAID adapter. Go to Step 0270-8.
- Step 0270-8
Is the FRU you want
to replace a RAID adapter cache card or RAID adapter battery? - NO
- Go to Step 0270-10.
- YES
- Go to Step 0270-9.
- Step 0270-9
Replace the FRU onto
the existing base card.
Go to Step
0270-11.
- Step 0270-10
After physically removing
the base card from the system, remove any other good FRUs (RAID cache card
or cache battery) from the RAID base card adapter. Plug these FRUs on to the
replacement RAID base card adapter FRU.
Go to Step
0270-11.
- Step 0270-11
Did you change
the FRU using a hot-swap operation? - NO
- Install the RAID adapter assembly into the system. Power on the system
and log in to AIX.
Go to Step 0270-12.
- YES
- Install the RAID adapter assembly into the system. Go to Step
0270-12.
- Step 0270-12
- Step 0270-13
Attention: Prior
to cabling the SCSI RAID adapter to the subsystem, check for preexisting configurations
on the replacement SCSI RAID base card. The replacement base card can overwrite
your system's configuration data if it already has a configuration written
to it. Check it before cabling the SCSI RAID subsystem array.
Ask
to customer to check for preexisting configuration on the SCSI RAID base card.
Below is an example of this procedure:
- Log in as root (if not already root).
- Type smit pdam.
- Select List PCI SCSI RAID Arrays.
- If no RAID arrays are listed, then there are no preexisting configurations
on the base card.
- Press F10 key to exit.
If a preexisting configuration exists on the base card, ask the customer
to run the PCI SCSI Disk Array Manager using smitty.
- Log in as root (if not already root).
- Type smit pdam from the AIX command prompt (if not already in the
RAID manager).
- Select Recovery Options.
- Select Clear PCI SCSI RAID Adapter Configuration.
Select the adapter that you just installed. Press Enter to confirm.
- Return to the Recovery Options menu (if not already
there). Select Resolve PCI SCSI RAID Adapter Configuration.
Select Accept Configuration on Drives. Select the
adapter that you just installed. Press Enter to confirm. The configuration
on the new adapter should now match the configuration existent on the drives.
- Press F10 to exit.
You may now proceed to cable the RAID system array.
Go to Step 0270-16.
- Step 0270-14
Ask the customer
to resynchronize the RAID array configuration:
- Log in as root (if not already root).
- Type smit pdam.
- Select Recovery Options.
- Select Resolve PCI SCSI RAID Adapter Configuration.
- Select Retry Current Configuration.
- Select the appropriate scraid (SCSI RAID) adapter. A message will be displayed
as to the success of the operation.
- Press F10 to exit.
Go to Step 0270-16.
- Step 0270-15
Other RAID FRUs require
that the system be shut down prior to replacement.
- If the operating system is running, perform the operating system shutdown
procedure (get help if needed).
- Turn off the system power.
- Replace the FRU indicated by the FFC.
Go to Step 0270-16.
- Step 0270-16
Run the diagnostics
in system verification mode on the RAID subsystem.
- Step 0270-17
- Use the option Log Repair Action in the TASK SELECTION
menu to update the AIX error log. Select scraidX (where X is
the RAID adapter number of the RAID subsystem you have been working on).
Note: On systems with fault indicator LED, this changes the fault indicator
LED from the Fault state to the Normal state.
- While in diagnostics, go to the FUNCTION SELECTION menu. Select the option Advanced
Diagnostics Routines.
- When the DIAGNOSTIC MODE SELECTION menu displays, select the option System
Verification. Run the diagnostic test on scraidX (where X is
the RAID adapter number).
Did the diagnostics run with no trouble found? - NO
- Go to the Step 0270-18.
- YES
- If you changed the service processor or network settings, restore the
settings to the value they had prior to servicing the system.
This completes the repair; return the system to the
customer. Go to Closing a service call.
- Step 0270-18
Have you exchanged
all the FRUs that correspond to the failing function codes? - NO
- Go to Step 0270-19.
- YES
- The SRN did not identify the failing FRU. Schedule a time to run diagnostics
in service mode. If the same SRN is reported in service mode, go to MAP 0030: Additional problem determination.
- Step 0270-19
Note: Note: Before proceeding,
remove the FRU you just replaced and install the original FRU in its place.
Use
the next FRU on the list and go to Step 0270-2.