LICIP13

A disk unit seems to have stopped communicating with the system.

The system has stopped normal operation until the cause of the disk unit failure is found and corrected. Ensure you have read the Danger notices in Licensed internal code (LIC) isolation procedures before continuing with this procedure.

If the disk unit that stopped communicating with the system has mirrored protection active, normal operation of the system stops for one to two minutes. Then the system suspends mirrored protection for that disk unit and continues normal operation.

Note:
Do not power off the system or partition using the white button, function 08, ASMI, or HMC immediate power-off when performing this procedure. If this procedure or other isolation procedures referenced by this procedure direct you to IPL or power off the system,

  1. If the system has logical partitions, perform this procedure from the logical partition that reported the problem. To determine if the system has logical partitions, go to Determining if the system has logical partitions before continuing with this procedure.
  2. Was a problem summary form completed for this problem?
  3. Fill out a Problem Reporting Form ... completely with the instructions provided.
  4. Recovery from a device command time-out may have caused the communications loss condition (indicated by an SRC on the control panel or in the HMC). This communications loss condition has the following symptoms:

    Does the communication loss condition have the above symptoms?

  5. Verify that all Licensed Internal Code PTFs have been applied to the system. Apply any Licensed Internal Code PTFs that have not been applied to the system. Does the intermittent condition continue?
  6. A manual reset of the IOP may clear the attention reference code. Perform the following:

    If you are working from the control panel:

    1. Select Manual mode on the control panel.
    2. Select Function 25 and press Enter.
    3. Select Function 26 and press Enter.
    4. Select Function 67 and press Enter to reset the IOP.
    5. Wait 10 minutes.
    6. Select Function 25 and press Enter to disable the service functions on the control panel.

    If you are working from the HMC:

    1. These need to be updated for the new HMC UI....
    2. In the Navigation Area, open the Service Applications folder.
    3. Select Service Focal Point.
    4. In the contents area, select Service Utilities.
    5. In the Service Utilities window, select the system you are working on.
    6. Select Selected -> Operator Panel Service Functions.
    7. Select the logical partition, and then select Partition Functions.
    8. Select Disk Unit IOP Reset/Reload (67).
    9. Wait 10 minutes.

    Did the reset successfully clear the control panel SRC or HMC panel value and can commands be entered on the partition console?

  7. Is the SRC the same reference code that sent you here?
  8. Powering off and powering on the affected IOP domain may clear the attention reference code. Perform the following:

    If you are working from the control panel:

    1. Select Manual mode on the control panel.
    2. Select Function 25 and press Enter.
    3. Select Function 26 and press Enter.
    4. Select Function 68 and press Enter to power off the domain.
    5. After the domain has been powered off or 10 minutes have passed, select Function 69 and press Enter to power on the domain.
    6. Wait 10 minutes.
    7. Select Function 25 and press Enter to disable the service functions on the control panel.

    If you are working from the HMC:

    1. In the Navigation Area, open the Service Applications folder.
    2. Select Service Focal Point.
    3. In the contents area, select Service Utilities.
    4. In the Service Utilities window, select the system you are working on.
    5. Select Selected -> Operator Panel Service Functions.
    6. Select the logical partition, and then select Partition Functions.
    7. Select Power off domain (68).
    8. After the domain has been powered off or 10 minutes have passed, select Power on domain (69).
    9. Wait 10 minutes.

    Did this successfully clear the control panel SRC or HMC panel value, and can commands be entered on the partition console?

  9. Is the SRC the same reference code that sent you here?
  10. Perform a main storage dump, then perform an IPL by performing the following:

    If you are working from the control panel:

    1. Select Manual mode on the control panel.
    2. Select Function 22 and press Enter to dump the main storage to the load-source disk unit.
    3. Wait for SRC A100 300x to occur, indicating that the dump is complete.
    4. Then perform an IPL to DST (see Performing an IPL to DST).

    If you are working from the HMC:

    1. In the Navigation Area, open Server and Partition.
    2. Select Server Management.
    3. In the contents area, open the server on which the logical partition is located.
    4. Select Partitions.
    5. Right-click the logical partition profile and select Restart Partition.
    6. In the Restart Partition window, select the Dump restart option.

      Does a different SRC occur, or does a display appear on the console showing reference codes?

      • No: Continue with the next step.
      • Yes: Perform problem analysis to correct the new problem. This ends the procedure.
  11. Does the same reference code occur?
  12. Are characters 7-8 of the top 16 character line of function 12 (2 rightmost characters of word 2) equal to 13 or 17?
  13. Use the word 1 through 9 information recorded on the Problem summary form to determine the disk unit that stopped communicating with the system:
  14. Is the disk unit reference code 0000?
  15. Are characters 7-8 of the top 16 character line of function 12 (the two rightmost characters of word 2) equal to 27?
  16. Use the word 1 through 9 information recorded on the Problem summary form to determine the disk unit that stopped communicating with the system:
  17. Is the disk unit reference code 0000?
  18. Are characters 9-16 of the bottom 16 character line of function 13 (word 9) B6xx 51xx?
  19. Are the 2 rightmost characters of word 2 on the Problem summary form equal to 62?
  20. Are characters 9-16 of the top 16 character line of function 12 (word 3) equal to 00010004?
  21. Are characters 13-16 of the bottom 16 character line of function 12 (4 rightmost characters of word 5) equal to 0000?
  22. Note the following:

    Find the table for the disk unit type (characters 1-4 of the bottom 16 character line of function 13 - 4 leftmost characters of word 8), and use characters 13-16 of the bottom 16 character line of function 12 (4 rightmost characters of word 5) as the unit reference code. This ends the procedure.

  23. Are characters 9-16 of the top 16 character line of function 12 (word 3) equal to 0002000D?
  24. Note the following:

    Find the table for the disk unit type (characters 1-4 of the bottom 16 character line of function 13 (4 leftmost characters of word 8) and use 3002 as the unit reference code. Exchange the FRUs for URC 3002 one at a time. This ends the procedure.