MAP 0050: SCSI bus problems
Use this MAP to analyze problems with a SCSI bus.
Purpose of this MAP
Use this MAP to analyze problems
with a SCSI bus.
Considerations
- To prevent hardware damage or erroneous diagnostic results from a system
with its power turned on, use "PCI hot-plug manager" subtask "replace/remove
PCI hot plug adapter" before connecting or disconnecting cables or devices.
- Also, use this MAP for SCSI adapters that are built into system backplanes
or I/O backplanes. If this procedure calls for replacing a SCSI adapter and
the SCSI adapter is built into the system backplane or I/O backplane, replace
the system backplane or I/O backplane as appropriate.
- If the failure is a terminator power failure (SRNs xxx-226, xxx-240, xxx-800),
always allow five minutes for the PTC to cool.
- The differential version of the adapter has socket-type terminators to
support high-availability. If this is the adapter's configuration, the terminators
would have been removed from the adapter. MAP steps requiring the removal
of the cable from the adapter are inapplicable, since an adapter that is not
terminated always fails diagnostics. Proper SCSI diagnostics require proper
termination. If the configuration involves a Y-cable, leave it, with the appropriate
terminator, attached to the adapter. Or, place an external differential terminator
on the external port.
- If the system uses shared disk-drive hardware or a high-availability configuration,
ensure that the other system that is sharing the devices is not using the
devices. For additional information concerning high-availability configurations,
see SCSI service hints.
- For intermittent problems that cannot be resolved with this MAP, refer
to SCSI service hints.
- If the SCSI bus is attached to a RAID subsystem, refer to the RAID subsystem
documentation for any problem determination. If the RAID adapter is a PCI-X
RAID adapter, refer to the PCI-X SCSI RAID Controller Reference Guide
for AIX.
Follow the steps in this MAP to isolate a SCSI bus problem.
Note:
This
procedure steps you through a process to systematically remove devices and
components from a SCSI bus until a problem or a symptom or problem is eliminated.
If you go through the whole procedure you will remove all components of a
SCSI bus in the following order:
- Hot-swap devices
- Devices that are not hot-swap
- SCSI Enclosure Services (SES) device or enclosures
- SCSI cables
- SCSI adapter
Do the following:
-
Step 0050-1
Have changes been
made recently to the SCSI configuration?
- NO
- Go to Step 0050-2.
- YES
- Go to Step 0050-5.
-
Step 0050-2
Are there any hot-swap
devices (SCSI disk drives or media devices) controlled by the adapter?
- NO
- Go to Step 0050-3.
- YES
- Go to Step 0050-11.
-
Step 0050-3
Are there any devices
other than hot-swappable devices controlled by the adapter?
- NO
- Go to Step 0050-4.
- YES
- Go to Step 0050-13.
-
Step 0050-4
Is an enclosure or
drawer that supports hot-swap devices controlled by the adapter?
- NO
- Go to Step 0050-22.
- YES
- Go to Step 0050-15.
-
Step 0050-5
This step handles cases
where recent changes have been made to the SCSI configuration.
Using
the first three digits of the SRN, refer to the FFC listing and determine
if the adapter is single-ended or differential.
Is the adapter a
single-ended adapter?
- NO
- Go to Step 0050-6.
- YES
- Go to Step 0050-7.
-
Step 0050-6
The adapter's termination
jumper settings may be incorrect. Power off the system, and inspect jumper
J7.
Are the jumpers correct?
- NO
- Go to Step 0050-8.
- YES
- Go to Step 0050-9.
-
Step 0050-7
If the adapter is
not being used in a high-availability configuration, be sure sockets RN1,
RN2, and RN3 are populated.
If the adapter is being used in a
high-availability configuration, be sure sockets RN1, RN2, and RN3 are
not populated.
Go to Step 0050-9.
-
Step 0050-8
- Correct the jumper settings and reinstall the adapter and all cables.
- Power on the system, and run diagnostics in system verification mode on
the adapter.
Did the diagnostic pass?
- NO
- Go to Step 0050-9.
- YES
- Go to Step 0050-10.
-
Step 0050-9
Check for the following
problems:
- Address conflicts between devices.
- Cabling problems such as, configurations that exceed the maximum cable
lengths, missing termination, or excessive termination.
Did you find a problem?
- NO
- Go to Step 0050-2.
- YES
- Go to Step 0050-10.
-
Step 0050-10
- Correct the problem.
- Power on the system, and run diagnostics in system verification mode on
the adapter.
Did a failure occur?
- NO
- Go to Verifying a repair.
- YES
- Go to Step 0050-2.
-
Step 0050-11
This step determines
if a hot-swap device is causing the failure.
- Go to Preparing for hot-plug SCSI device or cable deconfiguration.
- Disconnect all hot-swap devices attached to the adapter.
- Go to After hot-plug SCSI device or cable deconfiguration.
- If the Missing Options menu displays, select the The resource
has been turned off, but should remain in the system configuration option
for all the devices that were disconnected.
- Run the diagnostics in system verification mode on the adapter.
Did a failure occur?
- NO
- Go to Step 0050-12.
- YES
- Go to Step 0050-3.
-
Step 0050-12
Go to Preparing for hot-plug SCSI device or cable deconfiguration.
Reconnect the hot-plug devices one at a time. After reconnecting each device,
do the follow:
- Go to After hot-plug SCSI device or cable deconfiguration.
- Rerun the diagnostics on the adapter.
- If the adapter fails, the problem may be with the last device reconnected.
Perform these substeps:
- Follow repair procedures for that last device.
- Rerun diagnostics on the adapter.
- If diagnostics fail, replace the SES backplane corresponding to the slot
for the device.
- Rerun diagnostics.
- If diagnostics fail, replace the last device.
- Rerun diagnostics on the adapter.
- If diagnostics pass, go to Verifying a repair.
Otherwise, contact your support center.
Note:
A device problem can cause other
devices attached to the same SCSI adapter to go into the defined state. Ask
the system administrator to make sure that all devices attached to the same
SCSI adapter as the device that you replaced are in the available state.
- If no errors occur, the problem could be intermittent. Make a record of
the problem. Running the diagnostics for each device on the bus may provide
additional information.
-
Step 0050-13
This step determines
if a device other than a hot-swappable device is causing the failure. Follow
these steps:
- Go to Preparing for hot-plug SCSI device or cable deconfiguration.
- Disconnect all devices attached to the adapter (except for the device
from which you boot to run diagnostics; you may want to temporarily move this
device to another SCSI port while you are trying to find the problem).
- Go to After hot-plug SCSI device or cable deconfiguration.
- If the Missing Options menu displays, select the The resource
has been turned off, but should remain in the system configuration option
for all the devices that were disconnected.
- Run the diagnostics in system verification mode on the adapter.
Did a failure occur?
- NO
- Go to Step 0050-14.
- YES
- Go to Step 0050-4.
-
Step 0050-14
Reconnect the devices
one at time. After reconnecting each device, follow this procedure:
- Rerun the diagnostics in system verification mode on the adapter.
- If there is a failure, the problem should be with the last device reconnected.
Follow the repair procedures for that device, then go to Verifying a repair.
- If no errors occur, the problem could be intermittent. Make a record of
the problem. Running the diagnostics for each device on the bus may provide
additional information.
-
Step 0050-15
This step determines
whether the SCSI Enclosure Services (SES) controller, which provides hot-plug
capability for SCSI drives in the server, is causing the problem.
Note:
In
most cases the SES controller is integrated on the backplane that is used
to connect SCSI devices, for example a disk drive backplane. If your system
has hot-plug capability and the SES controller is separate from the SCSI drive
backplane, there will be an intermediate card on the SCSI bus between the
SCSI adapter and the device or SCSI backplane. You will have to make a visual
check to see if there are any intermediate cards on the SCSI bus that is displaying
a problem.
Does a separate SES controller plug into the SCSI
device backplane?
- NO
- Go to Step 0050-18.
- YES
- Go to Step 0050-16.
-
Step 0050-16
Follow these steps:
- Power off the system.
- Remove the intermediate SES controller card. Locate the SES controller
part number under Part Information.
- Power on the system.
- If the Missing Options menu displays, select the The resource
has been turned off, but should remain in the system configuration option
for all the devices that were disconnected.
- Run the diagnostics in system verification mode on the adapter.
Did a failure occur?
- NO
- Go to Step 0050-17.
- YES
- Go to Step 0050-18.
-
Step 0050-17
Follow these steps:
- Power off the system.
- Replace the intermediate SES controller card.
- Go to Verifying a repair.
-
Step 0050-18
Follow these steps:
- Go to Preparing for hot-plug SCSI device or cable deconfiguration.
- Disconnect all cables attached to the SCSI adapter. For SCSI differential
adapters in a high-availability configuration, see Considerations.
- Go to After hot-plug SCSI device or cable deconfiguration.
- If the Missing Options menu displays, select the The resource
has been turned off, but should remain in the system configuration option
for all the devices that were disconnected.
- Run the diagnostics in system verification mode on the adapter.
Did a failure occur?
- NO
- Go to Step 0050-19.
- YES
- Replace the adapter, then go to Verifying a repair.
-
Step 0050-19
Follow these steps:
- Go to Preparing for hot-plug SCSI device or cable deconfiguration.
- Reconnect the cables to the adapter.
Does the SES controller (an intermediated SES controller) plug
into the backplane?
- NO
- Go to Step 0050-20.
- YES
- Go to Step 0050-21.
-
Step 0050-20
Follow these steps:
- Replace the SES controller. Locate the intermediate SES controller part
number under Part Information.
- Power on the system.
- If the Missing Options menu displays, select the The resource
has been turned off, but should remain in the system configuration option
for all the devices that were disconnected.
- Run the diagnostics in system verification mode on the adapter.
Did a failure occur?
- NO
- Go to Verifying a repair.
- YES
- Go to Step 0050-21.
-
Step 0050-21
One of the cables
remaining in the system is defective. Refer to Part Information for
the cable part numbers. Replace the parts one at a time in the order listed.
Follow these steps for each FRU replaced:
- Rerun the diagnostics for the adapter.
- If there is any failure, continue with the next FRU.
- If there is no failure, go to Verifying a repair.
-
Step 0050-22
Follow these steps:
- Go to Preparing for hot-plug SCSI device or cable deconfiguration.
- Disconnect all cables attached to the adapter (except for the cable to
the device from which you boot to run diagnostics; you may want to temporarily
move this device to another SCSI port while you are trying to find the problem).
- Go to After hot-plug SCSI device or cable deconfiguration.
- If the Missing Options menu displays, select the The resource
has been turned off, but should remain in the system configuration option
for all the devices that were disconnected.
- Run the diagnostics on the adapter.
Did a failure occur?
- NO
- Go to Step 0050-23.
- YES
- Replace the adapter, then go to Verifying a repair.
-
Step 0050-23
One of the cables
remaining in the system is defective. Refer to Part Information for
the cable part numbers. Replace the parts one at time in the order listed.
Follow these steps for each FRU replaced:
- Rerun the diagnostics for the adapter.
- If there is any failure, continue with the next FRU.
- If there is no failure, go to Verifying a repair.
Preparing for hot-plug SCSI device or cable deconfiguration
Use this procedure when you are preparing to deconfigure a hot-plug
SCSI device or cable. This procedure will help determine if a SCSI device
or SCSI device cable is causing your system problem.
Purpose of this procedure
Use this procedure to determine if a SCSI device or SCSI device
cable is causing the symptom.
Disconnect all cables attached to the
adapter, (except for the cable to the device from which you boot to run diagnostics;
you may want to temporarily move this device to another SCSI port while you
are trying to find the problem).
Note:
Move this device to another SCSI port
while you are trying to find the problem.
- Go to Running the online and eServer stand-alone diagnostics and
perform the prerequisite tasks described in the "Before you begin" topic.
- Determine which SCSI adapter you plan to remove the cables or devices
from.
- Adapter slots are numbered on the rear of the system unit, record
the slot number and location of each adapter being removed.
- Ensure that any processes or applications that might use the adapter
are stopped.
- Enter the system diagnostics by logging in as "root" user or as
the "celogin" user. Type the diag command
on the AIX(R) command
line.
- When the "DIAGNOSTIC OPERATING INSTRUCTIONS" menu displays, press
Enter. The "FUNCTION SELECTION menu appears.
- From the "FUNCTION SELECTION" menu, select "Task Selection", then
press Enter.
- From the "Task Selection" list, select "PCI Hot Plug Manager".
- From the PCI Hot Plug Manager menu, select "Unconfigure a Device",
then press Enter.
- Press F4 or ESC 4 to display the "Device Names" menu.
- Select the adapter from which you are removing the cables or devices
in the "Device Names" menu.
- In the "Keep Definition" field, use the tab key to answer "Yes".
- In the "Unconfigure Child Devices" field, use the Tab key to answer
"Yes", then press Enter.
- The "ARE YOU SURE" screen displays. Press Enter to verify
the information. A successful deconfiguration is indicated by the "OK" message
displayed next to the Command field at the top of the screen.
- Press F4 or ESC 4 twice to return to the "Hot Plug Manager" menu.
- Select "replace/remove PCI Hot Plug adapter".
- Select the slot that has the adapter you want to remove the cables
or devices from in the system.
- Select "remove".
Note:
A fast blinking amber LED located
at the back of the machine near the adapter indicates that the slot has been
identified.
- Press Enter. This places the adapter in the "action"
state, meaning it is ready to be removed from the system. (Don't need to remove
the adapter, unless it makes removing the cables attached to it easier).
After hot-plug SCSI device or cable deconfiguration
Use this procedure after you deconfigure or hot-plug a SCSI device
to ensure that the replaced component was successfully installed.
- Press Enter, then continue to follow the screen instructions until
you receive a message that the replacement is successful. A successful
replacement is indicated by the OK message displayed next
to the command field at the top of the screen.
- Press the F3 or ESC 3 key to return to the PCI Hot-Plug Manager
menu.
- Press the F3 or ESC 3 key to return to the TASK selection list.
- Select "Log Repair Action".
- Select the adapter you just removed the cables or devices from,
then press Enter.
- Press Commit (F7 or ESC 7), then press Enter.
- Press the F10 or the ESC 0 key to exit diagnostics.
- Type the diag -a command on the command
line.
SCSI service hints
Use one or more of the following procedures when servicing SCSI
adapter or devices.
General SCSI Configuration Checks
The following steps apply to all types of SCSI
problems:
- Verify that all SCSI devices on the SCSI bus have a unique address.
- Verify that all cables are connected securely and that there is
proper termination at both ends of the SCSI bus.
- Verify that the cabling configuration does not exceed the maximum
cable length for the adapter in use.
- Verify that the adapters and devices that you are working with
are at the appropriate microcode levels for the customer situation. If you
need assistance with microcode issues, contact your service support structure.
- If there are multiple SCSI adapters on the SCSI bus, verify that
the customer is using the appropriate software (such as HACMP(TM) or HANFS)
to support such an arrangement. If the correct software is not in use, some
SCSI errors should be expected when multiple adapters attempt to access the
same SCSI device. Also, each adapter should have a unique address.
High Availability or Multiple SCSI System Checks
If you have a high-availability configuration,
or if more than one system is attached to the same SCSI bus, do the following:
- Verify that the adapters and devices have unique SCSI addresses. The default SCSI adapter address is always 7. If you have more than
one adapter on the bus, change the address of at least one adapter. This can
be done by using SMIT (SMIT Devices -> SCSI
Adapter -> Change/Show characteristics of an adapter). You must make the changes to the database only, then reboot
the system in order for the change to take effect.
Note:
Diagnostics defaults
to using ID 7 (it is recommended that this ID not be used in high availability
configurations).
- If RAID devices such as the 7135 or 7137 are attached, be sure
to run the proper diagnostics for the device. If problems occur,
contact your service support structure for assistance. If the diagnostics
are run incorrectly on these devices, misleading SRNs can result.
- Diagnostics cannot be run against OEM devices; doing so results
in misleading SRNs.
- Verify that all cables are connected securely and that both ends
of the SCSI bus is terminated correctly.
- Verify that the cabling configuration does not exceed the maximum
cable length for the adapter in use. Refer to the SCSI Cabling
section in the RS/6000(R) eServer(TM) pSeries(R) Adapters, Devices, and Cable
Information for Multiple Bus Systems for more details on SCSI cabling
issues.
- Verify that adapter and devices are at the appropriate microcode
levels for the customer situation. If you need assistance with
microcode issues, contact your service support structure.
SCSI-2 Single-Ended Adapter PTC Failure Isolation Procedure
Before replacing a SCSI-2 single-ended adapter, use these procedures
to determine if a short-circuit condition exists on the SCSI bus. The same
positive temperature coefficient (PTC) resistor is used for both the internal
and external buses. The PTC protects the SCSI bus from high currents due to
shorts on the cable, terminator, or device. It is unlikely that the PTC can
be tripped by a defective adapter. Unless instructed to do so by these procedures,
do not replace the adapter because of a tripped PTC resistor.
A fault
(short-circuit) causes an increase in PTC resistance and temperature. The
increase in resistance causes the PTC to halt current flow. The PTC returns
to a low resistive and low temperature state when the fault is removed from
the SCSI bus or when the system is turned off. Wait 5 minutes for the PTC
resistor to fully cool, then retest.
These procedures determine if the
PTC resistor is still tripped and then determine if there is a short somewhere
on the SCSI bus.
Determining Where to Start
Use the following to determine the adapter configuration and select
the proper procedure:
- If there are external cables attached to the adapter, start with the "External
Bus PTC Isolation Procedure" for your type adapter. The procedures are found
in this chapter.
- If there are no external cables attached, start with the Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure.
- If there is a combination of external and internal cables start with the
"External Bus PTC Isolation Procedure" for your type adapter. The procedures
are found in this chapter. If this procedure does not resolve the problem,
continue with the "Internal Bus PTC Isolation Procedure" for your type adapter.
The procedures are found in this chapter.
External SCSI-2 Single-Ended Bus PTC Isolation Procedure
Isolate the external SCSI bus PTC fault with the following procedure:
Note:
The
external bus is of single-ended design.
- Ensure the system power and all externally attached device power
is turned off. All testing is accomplished with the power off.
- Disconnect any internal and external cables from the adapter and
remove the adapter from the system.
- Verify with a digital Ohmmeter that the internal PTC resistor,
labeled Z1, (refer to the illustration after Internal SCSI-2 Single-Ended
Bus PTC Isolation Procedure, step 3) is
cool and in a low resistance state, typically less than 1/2 Ohm. Measuring
across, be sure to probe both sides of the PTC where the solder joints and
board come together. The polarity of the test leads is not important. If necessary,
allow the PTC resistor to cool and measure again.
- This step determines if there is a short on the adapter. Locate Capacitor C1 and measure the resistance across it by using the
following procedure:
- Connect the positive lead to the side of the capacitor where
the + is indicated on the board near C1. Be sure to probe at the solder joint
where the capacitor and board come together.
- Connect the negative lead to the opposite side of the capacitor
marked "GND." Be sure to probe at the solder joint where the capacitor
and board come together.
- If there is no short present, then the resistance reading is
high, typically hundreds of Ohms.
Note:
Because this is a measurement across unpowered silicon devices,
the reading is a function of the Ohmmeter used.
- If there is a fault, the resistance reading is low, typically below 10
Ohms. Because there are no cables attached, the fault is on the adapter. Replace
the adapter.
Note:
Some multi-function meters label the leads specifically
for voltage measurements. When using this type of meter to measure resistance,
the plus lead and negative lead my not be labeled correctly. If you are not
sure that your meter leads accurately reflect the polarity for measuring resistance,
repeat this step with the leads reversed. If the short circuit is not indicated
with the leads reversed, the SCSI bus is not faulted (shorted).
- If the resistance measured was high, proceed to the next step.
- Reattach the external cable to the adapter, then do the following:
- Measure across C1 as previously described.
- If the resistance is still high, in this case above 10 Ohms,
then there is no apparent cause for a PTC failure from this bus. If
there are internal cables attached continue to the Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure.
- If the resistance is less than 10 Ohms, there is a possibility
of a fault on the external SCSI bus. Troubleshoot the external
SCSI bus by disconnecting devices and terminators. Measure across C1 to determine
if the fault has been removed. Replace the failing component. Go to Verifying a repair.
External SCSI-2 Single-Ended Bus Probable Tripped PTC Causes
The following list provides some suggestions of things to check
when the PTC is tripped:
- A shorted terminator or cable. Check for bent pins on each connector and
removable terminator.
- Intermittent PTC failures can be caused by improperly seated cable connectors.
Reseat the connector and flex the cable in an attempt to duplicate the fault
condition across C1.
- Plugging or unplugging a cable or terminator while the system is turned
on (hot plugging).
- A shorted device.
- Differential devices or terminators are attached to the single-ended SCSI
bus.
Note:
The SCSI-2 Fast/Wide and Ultra PCI Adapters use an onboard electronic
terminator on the external SCSI bus. When power is removed from the adapter,
as in the case of this procedure, the terminator goes to a high impedance
state and the resistance measured cannot be verified, other than it is high.
Some external terminators use an electronic terminator, which also goes to
a high impedance state when power is removed. Therefore, this procedure is
designed to find a short or low resistance fault as opposed to the presence
of a terminator or a missing terminator.
Internal SCSI-2 Single-Ended Bus PTC Isolation Procedure
Isolate the internal SCSI bus PTC resistor fault with the following
procedure:
Note:
The internal bus is single-ended.
- Ensure that system power and all externally attached device power
is turned off.
- Disconnect any internal and external cables from the adapter then
remove the adapter from the system.
-
Verify with a digital Ohmmeter, that the internal
PTC resistor, labeled Z1, is cool and in a low resistance state, typically
less than 1/2 Ohm. Measuring across, be sure to probe both sides
of the PTC where the solder joints and board come together. The polarity of
the test leads is not important. If necessary, allow the PTC to cool and measure
again. Refer to the following illustration.
Note:
Only the probe tips are touching the solder joints. Do not allow
the probes to touch any other part of the component.
- This step determines if there is a short on the adapter. Locate capacitor C1 and measure the resistance across it using the following
procedure:
- Connect the positive lead to the side of the capacitor where
the + is indicated. Be sure to probe at the solder joint where
the capacitor and board come together.
- Connect the negative lead to the opposite side of the capacitor. Be sure to probe at the solder joint where the capacitor and board come
together.
- If there is no short present, the resistance reading is high,
typically hundreds of Ohms.
Note:
Because this is a measurement across unpowered silicon devices,
the reading is a function of the Ohmmeter used.
- If there is a fault, the resistance reading is low, typically below 10
Ohms. Because there are no cables attached, the fault is on the adapter. Replace
the adapter.
Note:
Some multi-function meters label the leads specifically
for voltage measurements. When using this type of meter to measure resistance,
the plus lead and negative lead my not be labeled correctly. If you are not
sure that your meter leads accurately reflect the polarity for measuring resistance,
repeat this step with the leads reversed. Polarity is important in this measurement
to prevent forward-biasing diodes which lead to a false low resistance reading.
If the short circuit is not indicated with the leads reversed, the SCSI bus
is not faulted (shorted).
- If the resistance is high and there is no internal cable to reattach,
there is no apparent cause for the PTC resistor diagnostic failure.
- If the resistance is high and there is an internal cable to reattach,
proceed to the next step.
- Reattach the internal cable to the adapter, then do the following:
- Measure across C1 as described above.
- If the resistance is still high, above 25 Ohms, there is no
apparent cause for a PTC failure.
- If the resistance is less than 10 Ohms, a fault on the internal
SCSI bus is possible. Troubleshoot the internal SCSI bus by disconnecting
devices and terminators. Measure across C1 to determine if the fault has been
removed.
Note:
Some internal cables have nonremovable terminators.
Internal SCSI-2 Single-Ended Bus Probable Tripped PTC Resistor Causes
The following list provides some suggestions of things to check
when the PTC is tripped:
- A shorted terminator or cable. Check for bent pins on each connector and
removable terminator.
- Intermittent PTC failures can be caused by incorrectly seated cable connectors.
Reseat the connector and flex the cable in an attempt to duplicate the fault
condition across C1.
- A shorted device.
- On some systems, the terminator is fixed to the internal cable and cannot
be removed. If all devices are removed from the cable and the resistance is
still low, then the cable should be replaced.
Note:
The SCSI-2 Fast/Wide and
Ultra PCI adapters use an onboard electronic terminator on the internal SCSI
bus. When power is removed from the adapter, as in the case of this procedure,
the terminator goes to a high impedance state and the resistance measured
cannot be verified, other than it is high. Some internal terminators use an
electronic terminator, which also goes to a high impedance state when power
is removed. Therefore, this procedure is designed to find a short or low resistance
fault as opposed to the presence of a terminator or a missing terminator.
SCSI-2 Differential Adapter PTC Failure Isolation Procedure
Use this procedure when SRN xxx-240 or xxx-800 has
been indicated.
The differential adapter can be identified by the 4-B
or 4-L on the external bracket plate.
Before replacing a SCSI-2 differential
adapter, use these procedures to determine if a short-circuit condition exists
on the SCSI Bus. The PTC protects the SCSI bus from high currents due to shorts
on the cable, terminator, or device. It is unlikely that the PTC can be tripped
by a defective adapter. Unless instructed to do so by these procedures, do
not replace the adapter because of a tripped PTC resistor.
A fault (short-circuit)
causes an increase in PTC resistance and temperature. The increase in resistance
causes the PTC to halt current flow. The PTC returns to a low resistive and
low temperature state when the fault is removed from the SCSI bus or when
the system is turned off. Wait 5 minutes for the PTC resistor to fully cool,
then retest.
These procedures determine if the PTC resistor is still
tripped and then determine if there is a short somewhere on the SCSI bus.
External SCSI-2 Differential Adapter Bus PTC Isolation Procedure
Isolate the external SCSI bus PTC fault with the following procedure:
Notes:
- Only the probe tips are touching the solder joints. Do not allow the probes
to touch any other part of the component.
- The external bus is differential.
- Ensure that system power and all externally attached device power
is turned off.
- Check to ensure all devices are marked SCSI Differential and that
the terminator on the end of the SCSI bus is also marked differential. If not, you may have a single-ended SCSI device or terminator on the
differential SCSI bus. Single-ended devices do not work on a differential
SCSI bus and may cause a PTC type error to be reported. The entire SCSI bus
may appear to be intermittent. After ensuring the system is completely differential,
continue.
- Disconnect the external cables from the adapter and remove the
adapter from the system.
- Verify with a digital Ohmmeter that the internal PTC resistor,
labeled Z1, (refer to the illustration on page External SCSI-2 Differential Adapter Bus PTC Isolation Procedure)
is cool and in a low resistance state, typically less than 1/2 Ohm. Measuring across, be sure to probe both sides of the PTC resistor where
the solder joints and board come together. The polarity of the test leads
is not important. If necessary, allow the PTC resistor to cool and measure
again.
- This step determines if there is a short on the adapter. Locate capacitor C1 and measure the resistance across it using the following
procedure:
- Connect the negative lead to the side of the capacitor marked
"GND". Be sure to probe at the solder joint where the capacitor
and board come together.
- Connect the positive lead to the side of the capacitor marked
"Cathode D1" on the board near C1. Be sure to probe at the solder
joint where the capacitor and board come together.
- If there is no fault present, then the resistance reading is 25 to 35
Ohms. The adapter is not faulty. Continue to the next step.
- If the resistance measured is higher than 35 Ohms, check to see if RN1,
RN2, and RN3 are plugged into their sockets. If these sockets are empty, you
are working with a Multi-Initiators or High-Availability system. With these
sockets empty, a resistive reading across C1 cannot be verified other than
it measures a high resistance (not a short). If the resistance measurement
is not low enough to be suspected as a fault (lower than 10 Ohms), continue
to the next step.
- If the resistance is high and there is no external cable to reattach,
there is no apparent cause for the PTC diagnostic failure.
- If the resistance reading is low, typically below 10 Ohms, there is a
fault. Because there are no cables attached, the fault is on the adapter.
Replace the adapter.
- If the resistance measured was high and there is an external cable to
reattach, proceed to the next step.
- Reattach the external cable to the adapter.
- Measure across C1 as previously described.
- If the resistance is between 10 to 20 Ohms, there is no apparent
cause for a PTC resistor failure.
- If the resistance is less than 10 Ohms, there is a possibility
of a fault on the external SCSI bus. Troubleshoot the external
SCSI bus by disconnecting devices and terminators. Measure across C1 to determine
if the fault has been removed.
SCSI-2 Differential Adapter Probable Tripped PTC Causes
The following list provides some suggestions of things to check
when the PTC is tripped:
- A shorted terminator or cable. Check for bent pins on each connector and
removable terminator.
- Intermittent PTC failures can be caused by incorrectly seated cable connectors.
Reseat the connector and flex the cable in an attempt to duplicate the fault
condition across C1.
- Plugging or unplugging a cable or terminator while the system is turned
on (hot-plugging).
- A shorted device.
- Single-ended devices are attached to the differential SCSI bus.
Dual-Channel Ultra SCSI Adapter PTC Failure Isolation Procedure
Use the following procedures if diagnostics testing indicates
a potential positive temperature coefficient (PTC) resistor fault or the TERMPWR
Shorted LED is lit.
This procedure is used for SRNs 637-240 and 637-800
on the Dual-Channel Ultra SCSI Adapter. If the TERMPWR Shorted LED is lit,
use this procedure to help isolate the source of the problem on the failing
channel.
- Identify the adapter by its label of 4-R on the external bracket. Then, determine if the failure is on channel A or channel B.
- The same PTC is used for both the internal and external buses. The PTC protects the SCSI bus from high currents due to shorts on the
cable, terminator, or device. It is unlikely that the PTC can be tripped by
a defective adapter. A fault (short-circuit) causes an increase in PTC resistance
and temperature. The increase in resistance causes the PTC to halt current
flow. The PTC returns to a low resistive and low temperature state when the
fault is removed from the SCSI bus or when the system is turned off.
Wait
5 minutes for the PTC resistor to fully cool, then retest.
- If this same error persists, or the TERMPWR Shorted LED is lit,
replace the components of the failing channel in the following order (wait
five minutes between steps):
- If the failure is on the external cable, replace the following:
- Cable
- Device
- Attached subsystem
- Adapter
- If the failure is on the internal cable, replace the following:
- Cable
- Device
- Backplane
- Adapter
- If the failure persists, verify that the parts exchanged are
in the correct channel (internal or external, A or B).
If the
errors are still occurring, continue isolating the problem by going to MAP 0050: SCSI bus problems.
64-bit PCI-X Dual Channel SCSI Adapter PTC Failure Isolation Procedure
Use the following procedures if diagnostics testing indicates a potential
self-resetting thermal fuse problem. This procedure is used for SRN 2524-702
on the integrated dual-channel SCSI adapter in a 7039/651 system.
- Identify the adapter as the one embedded in the system board. Then, determine if the failure is on channel 0 or channel 1.
- The thermal fuse protects the SCSI bus from high currents due to
shorts on the terminator, cable, or device. It is unlikely that
the thermal fuse can be tripped by a defective adapter. A fault (short-circuit)
causes an increase in resistance and temperature of the thermal fuse. The
increase in temperature causes the thermal fuse to halt current flow. The
thermal fuse returns to a low resistive and low temperature state when the
fault is removed from the SCSI bus or when the system is turned off.
Wait
10 seconds for the thermal fuse to reset itself and recover, then retest.
- If the same error persists, replace the components of the failing
channel in the following order. Wait 10 seconds for the thermal
fuse to reset itself between steps.
- Cable
- Device
- DASD backplane (if present)
- System board (adapter)
- If the failure persists, verify that the parts exchanged are in
the correct channel (0 or 1). If the errors are still occurring, continue
isolating the problem by going to MAP 0050: SCSI bus problems.