Manual Ibm X3650 PDF
Manual Ibm X3650 PDF
Manual Ibm X3650 PDF
System x3650 M3
Types 4255, 7945, and 7949
Problem Determination and Service Guide
System x3650 M3
Types 4255, 7945, and 7949
Problem Determination and Service Guide
Note
Note: Before using this information and the product it supports, read the general information in
Appendix B, “Notices,” on page 521; and read the IBM Safety Information and the IBM Systems
Environmental Notices and User Guide on the IBM Documentation CD.
iv System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Appendix B. Notices . . . . . . . . 521 Germany Class A statement . . . . . . . 526
Trademarks . . . . . . . . . . . . . . 522 Japan VCCI Class A statement. . . . . . . 527
Important notes . . . . . . . . . . . . 522 Japan Electronics and Information Technology
Particulate contamination . . . . . . . . . 523 Industries Association (JEITA) statement . . . 528
Documentation format . . . . . . . . . . 524 Korea Communications Commission (KCC)
Telecommunication regulatory statement . . . . 524 statement. . . . . . . . . . . . . . 528
Electronic emission notices . . . . . . . . . 525 Russia Electromagnetic Interference (EMI) Class
Federal Communications Commission (FCC) A statement . . . . . . . . . . . . . 528
statement. . . . . . . . . . . . . . 525 People's Republic of China Class A electronic
Industry Canada Class A emission compliance emission statement . . . . . . . . . . 528
statement. . . . . . . . . . . . . . 525 Taiwan Class A compliance statement . . . . 529
Avis de conformité à la réglementation
d'Industrie Canada . . . . . . . . . . 525 Index . . . . . . . . . . . . . . . 531
Australia and New Zealand Class A statement 525
European Union EMC Directive conformance
statement. . . . . . . . . . . . . . 526
Contents v
vi System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Safety
Before installing this product, read the Safety Information.
Ennen kuin asennat tämän tuotteen, lue turvaohjeet kohdasta Safety Information.
viii System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Guidelines for trained service technicians
This section contains information for trained service technicians.
Each IBM product, as it was designed and manufactured, has required safety items
to protect users and service technicians from injury. The information in this section
addresses only those items. Use good judgment to identify potential unsafe
conditions that might be caused by non-IBM alterations or attachment of non-IBM
features or optional devices that are not addressed in this section. If you identify
an unsafe condition, you must determine how serious the hazard is and whether
you must correct the problem before you work on the product.
Consider the following conditions and the safety hazards that they present:
v Electrical hazards, especially primary power. Primary voltage on the frame can
cause serious or fatal electrical shock.
v Explosive hazards, such as a damaged CRT face or a bulging capacitor.
v Mechanical hazards, such as loose or missing hardware.
To inspect the product for potential unsafe conditions, complete the following
steps:
1. Make sure that the power is off and the power cords are disconnected.
2. Make sure that the exterior cover is not damaged, loose, or broken, and observe
any sharp edges.
3. Check the power cords:
v Make sure that the third-wire ground connector is in good condition. Use a
meter to measure third-wire ground continuity for 0.1 ohm or less between
the external ground pin and the frame ground.
v Make sure that the power cords are the correct type.
v Make sure that the insulation is not frayed or worn.
4. Remove the cover.
5. Check for any obvious non-IBM alterations. Use good judgment as to the safety
of any non-IBM alterations.
6. Check inside the system for any obvious unsafe conditions, such as metal
filings, contamination, water or other liquid, or signs of fire or smoke damage.
7. Check for worn, frayed, or pinched cables.
8. Make sure that the power-supply cover fasteners (screws or rivets) have not
been removed or tampered with.
Safety ix
Guidelines for servicing electrical equipment
Observe these guidelines when you service electrical equipment.
v Check the area for electrical hazards such as moist floors, nongrounded power
extension cords, and missing safety grounds.
v Use only approved tools and test equipment. Some hand tools have handles that
are covered with a soft material that does not provide insulation from live
electrical current.
v Regularly inspect and maintain your electrical hand tools for safe operational
condition. Do not use worn or broken tools or testers.
v Do not touch the reflective surface of a dental mirror to a live electrical circuit.
The surface is conductive and can cause personal injury or equipment damage if
it touches a live electrical circuit.
v Some rubber floor mats contain small conductive fibers to decrease electrostatic
discharge. Do not use this type of mat to protect yourself from electrical shock.
v Do not work alone under hazardous conditions or near equipment that has
hazardous voltages.
v Locate the emergency power-off (EPO) switch, disconnecting switch, or electrical
outlet so that you can turn off the power quickly in the event of an electrical
accident.
v Disconnect all power before you perform a mechanical inspection, work near
power supplies, or remove or install main units.
v Before you work on the equipment, disconnect the power cord. If you cannot
disconnect the power cord, have the customer power-off the wall box that
supplies power to the equipment and lock the wall box in the off position.
v Never assume that power has been disconnected from a circuit. Check it to
make sure that it has been disconnected.
v If you have to work on equipment that has exposed electrical circuits, observe
the following precautions:
– Make sure that another person who is familiar with the power-off controls is
near you and is available to turn off the power if necessary.
– When you work with powered-on electrical equipment, use only one hand.
Keep the other hand in your pocket or behind your back to avoid creating a
complete circuit that could cause an electrical shock.
– When you use a tester, set the controls correctly and use the approved probe
leads and accessories for that tester.
– Stand on a suitable rubber mat to insulate you from grounds such as metal
floor strips and equipment frames.
v Use extreme care when you measure high voltages.
v To ensure proper grounding of components such as power supplies, pumps,
blowers, fans, and motor generators, do not service these components outside of
their normal operating locations.
v If an electrical accident occurs, use caution, turn off the power, and send another
person to get medical aid.
x System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Safety statements
These statements provide the caution and danger information that is used in this
documentation.
Important:
Be sure to read all caution and danger statements in this documentation before you
perform the procedures. Read any additional safety information that comes with
your system or optional device before you install the device.
Statement 1
DANGER
To Connect: To Disconnect:
Safety xi
Statement 2
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has a
module containing a lithium battery, replace it only with the same module type
made by the same manufacturer. The battery contains lithium and can explode if
not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water
v Heat to more than 100°C (212°F)
v Repair or disassemble
Statement 3
CAUTION:
When laser products (such as CD-ROMs, DVD drives, fiber optic devices, or
transmitters) are installed, note the following:
v Do not remove the covers. Removing the covers of the laser product could
result in exposure to hazardous laser radiation. There are no serviceable parts
inside the device.
v Use of controls or adjustments or performance of procedures other than those
specified herein might result in hazardous radiation exposure.
xii System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
DANGER
Some laser products contain an embedded Class 3A or Class 3B laser diode.
Note the following.
Laser radiation when open. Do not stare into the beam, do not view directly
with optical instruments, and avoid direct exposure to the beam.
Statement 4
CAUTION:
Use safe practices when lifting.
Statement 5
CAUTION:
The power control button on the device and the power switch on the power
supply do not turn off the electrical current supplied to the device. The device
also might have more than one power cord. To remove all electrical current from
the device, ensure that all power cords are disconnected from the power source.
2
1
Safety xiii
Statement 6
CAUTION:
If you install a strain-relief bracket option over the end of the power cord that is
connected to the device, you must connect the other end of the power cord to an
easily accessible power source.
Statement 8
CAUTION:
Never remove the cover on a power supply or any part that has the following
label attached.
Hazardous voltage, current, and energy levels are present inside any component
that has this label attached. There are no serviceable parts inside these
components. If you suspect a problem with one of these parts, contact a service
technician.
Statement 12
CAUTION:
The following label indicates a hot surface nearby.
Statement 26
xiv System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
CAUTION:
Do not place any object on top of rack-mounted devices.
DANGER
Safety xv
xvi System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Chapter 1. Start here
You can solve many problems without outside assistance by following the
troubleshooting procedures in this documentation and on the World Wide Web.
This document describes the diagnostic tests that you can perform, troubleshooting
procedures, and explanations of error messages and error codes. The
documentation that comes with your operating system and software also contains
troubleshooting information.
Diagnosing a problem
Before you contact IBM or an approved warranty service provider, follow these
procedures in the order in which they are presented to diagnose a problem with
your server.
Procedure
1. Return the server to the condition it was in before the problem occurred. If
any hardware, software, or firmware was changed before the problem occurred,
if possible, reverse those changes. This might include any of the following
items:
v Hardware components
v Device drivers and firmware
v System software
v UEFI firmware
v System input power or network connections
2. View the light path diagnostics LEDs and event logs. The server is designed
for ease of diagnosis of hardware and software problems.
v Light path diagnostics LEDs: See “Light path diagnostics” on page 360 for
information about using light path diagnostics LEDs.
v Event logs: See “System event messages log” on page 31 for information
about notification events and diagnosis.
v Software or operating-system error codes: See the documentation for the
software or operating system for information about a specific error code. See
the manufacturer's website for documentation.
3. Run IBM Dynamic System Analysis (DSA) and collect system data. Run
Dynamic System Analysis (DSA) to collect information about the hardware,
firmware, software, and operating system. Have this information available
when you contact IBM or an approved warranty service provider. For
instructions for running DSA, see the Dynamic System Analysis Installation and
User's Guide.
To download the latest version of DSA code and the Dynamic System Analysis
Installation and User's Guide, go to http://www.ibm.com/support/entry/portal/
docdisplay?lndocid=SERV-DSA.
4. Check for and apply code updates. Fixes or workarounds for many problems
might be available in updated UEFI firmware, device firmware, or device
drivers. To display a list of available updates for the server, go to
http://www.ibm.com/support/fixcentral/.
2 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
For information about configuring the server, see Chapter 6, “Configuration
information and instructions,” on page 491.
6. See controller and management software documentation. If the problem is
associated with a specific function (for example, if a RAID hard disk drive is
marked offline in the RAID array), see the documentation for the associated
controller and management or controlling software to verify that the controller
is correctly configured.
Problem determination information is available for many devices such as RAID
and network adapters.
For problems with operating systems or IBM software or devices, go to
http://www.ibm.com/supportportal.
7. Check for troubleshooting procedures and RETAIN tips. Troubleshooting
procedures and RETAIN tips document known problems and suggested
solutions. To search for troubleshooting procedures and RETAIN tips, go to
http://www.ibm.com/supportportal.
8. Use the troubleshooting tables. See “Troubleshooting tables” on page 340 to
find a solution to a problem that has identifiable symptoms.
A single problem might cause multiple symptoms. Follow the troubleshooting
procedure for the most obvious symptom. If that procedure does not diagnose
the problem, use the procedure for another symptom, if possible.
If the problem remains, contact IBM or an approved warranty service provider
for assistance with additional problem determination and possible hardware
replacement. To open an online service request, go to http://www.ibm.com/
support/entry/portal/Open_service_request. Be prepared to provide
information about any error codes and collected data.
Undocumented problems
If you have completed the diagnostic procedure and the problem remains, the
problem might not have been previously identified by IBM. After you have
verified that all code is at the latest level, all hardware and software configurations
are valid, and no light path diagnostics LEDs or log entries indicate a hardware
component failure, contact IBM or an approved warranty service provider for
assistance.
For information about the terms of the warranty and getting service and assistance,
see the Warranty and Support Information document on the IBM Documentation CD.
Related documentation
This Installation and User’s Guide contains general information about the server,
including how to set up the server, how to install supported optional devices, and
how to configure the server.
The System x and xSeries Tools Center is an online information center that contains
information about tools for updating, managing, and deploying firmware, device
drivers, and operating systems. The System x and xSeries Tools Center is at
http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp.
The server might have features that are not described in the documentation that
comes with the server. The documentation might be updated occasionally to
include information about those features, or technical updates might be available
to provide additional information that is not included in the server documentation.
These updates are available from the IBM Web site. To check for updated
documentation and technical updates, complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Publications lookup.
4. From the Product family menu, select System x3650 M3 and click Continue.
6 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Features and specifications
The following information is a summary of the features and specifications of the
server. Depending on the model, some features might not be available, or some
specifications might not apply.
Racks are marked in vertical increments of 4.45 cm (1.75 inches). Each increment is
referred to as a unit, or “U.” A 1-U-high device is 1.75 inches tall.
Note:
1. Power consumption and heat output vary depending on the number and type
of optional features that are installed and the power-management optional
features that are in use.
2. The sound levels were measured in controlled acoustical environments
according to the procedures specified by the American National Standards
Institute (ANSI) S12.10 and ISO 7779 and are reported in accordance with ISO
9296. Actual sound-pressure levels in a given location might exceed the average
values stated because of room reflections and other nearby noise sources. The
declared sound-power levels indicate an upper limit, below which a large
number of computers will operate.
Chapter 2. Introduction 7
Table 1. Features and specifications
Microprocessor: Size (2U):
v Supports up to two Intel Xeon™ v Height: 85.2 mm (3.346 in.) Video controller (integrated into IMM):
multi-core microprocessors (one v Depth: EIA flange to rear - 698 mm v Matrox G200eV (two analog ports - one
installed) (27.480 in.), Overall - 729 mm (28.701 front and one rear that can be connected
v Level-3 cache in.) at the same time)
v QuickPath Interconnect (QPI) links v Width: With top cover - 443.6 mm Note: The maximum video resolution is
speed up to 6.4 GT per second (17.465 in.), With front bezel - 482.0 1600 x 1200 at 75 Hz.
mm (18.976 in.) – SVGA compatible video controller
Note: v Weight: approximately 21.09 kg (46.5 – DDR2 250 MHz SDRAM video
v Do not install an Intel Xeon™ 5500 series lb) to 25 kg (55 lb) depending upon memory controller
microprocessor and an Xeon™ 5600 configuration – Avocent Digital Video Compression
series microprocessor in the same server. – 16 MB of video memory (not
v Use the Setup utility to determine the Integrated functions: expandable)
type and speed of the microprocessors. v Integrated management module
(IMM), which provides service ServeRAID controller (depending on the
v For a list of supported microprocessors, processor control and monitoring model):
see http://www.ibm.com/systems/ functions, video controller, and (when v A ServeRAID-BR10il v2 SAS/SATA
info/x86servers/serverproven/compat/
the optional virtual media key is adapter that provides RAID levels 0, 1,
us/. installed) remote keyboard, video, and 1E (comes standard on some
mouse, and remote hard disk drive hot-swap models).
Memory:
capabilities
v Minimum: 2 GB v An optional ServeRAID-BR10il
v Dedicated or shared management
v Maximum: 288 GB SAS/SATA adapter that provides RAID
network connections
– 48 GB using unbuffered DIMMs levels 0, 1, and 1E can be ordered.
v Serial over LAN (SOL) and serial
(UDIMMs) v An optional ServeRAID-MR10i
redirection over Telnet or Secure Shell
– 288 GB using registered DIMMs SAS/SATA adapter that provides RAID
(SSH)
(RDIMMs) levels 0, 1, 5, 6, 10, 50, and 60 can be
v One systems-management RJ-45 for
v Type: PC3-10600R-999, 800, 1067, and ordered.
connection to a dedicated
1333 MHz, ECC, DDR3 registered or
systems-management network v An optional ServeRAID-M1015
unbuffered SDRAM DIMMs
v Support for remote management SAS/SATA adapter that provides RAID
v Slots: 18 dual inline
presence through an optional virtual levels 0, 1, and 10 with optional RAID
v Supports (depending on the model):
media key 5/50 and SED (Self Encrypting Drive)
– 2 GB and 4 GB unbuffered DIMMs
v Broadcom BCM5709 Gb Ethernet upgrade.
– 2 GB, 4 GB, 8 GB, and 16 GB
controller with TCP/IP Offload Engine v An optional ServeRAID-M5014
registered DIMMs
(TOE) and Wake on LAN support SAS/SATA adapter that provides RAID
v Four Ethernet ports (two on system levels 0, 1, 5, 10 and 50 with optional
SATA optical drives (optional):
board and two additional ports when
v DVD-ROM battery and RAID 6/60 and SED
the optional IBM Dual-Port 1 Gb
v Multi-burner upgrade.
Ethernet Daughter Card is installed)
v One serial port, shared with the v An optional ServeRAID-M5015
Hard disk drive expansion bays SAS/SATA adapter with battery that
(depending on the model): integrated management module (IMM)
v Four Universal Serial Bus (USB) ports provides RAID levels 0, 1, 5, 10, and 50
v Eight 2.5-inch SAS hot-swap bays for (two on front, two on rear of server), with optional RAID 6/60 and SED
hard disk drive bays with option to add v2.0 supporting v1.1, plus one or more upgrade
eight more 2.5-inch SAS hot-swap hard dedicated internal USB ports on the
disk drive bays Note:
SAS riser-card
v Four 2.5-inch simple-swap, solid state 1. RAID is supported in hot-swap models
v Two video ports (one on front and one
SATA hard disk drive bays only.
on rear of server)
v One SATA tape connector, one USB 2. The ServeRAID controllers are installed
PCI expansion slots: tape connector, and one tape power in a PCI Express x8 mechanical slot (x4
v Two PCI Express riser cards with two connector on SAS riser-card (some electrical); however, the controllers run
PCI Express x8 slots (x8 lanes) each, models) at x4 bandwidth.
standard v Support for hypervisor function
v Support for the following optional riser through an optional USB flash device
cards: on the SAS riser-card (not available on
– Two 133 MHz/64-bit PCI-X 1.0a slots simple-swap models)
– One PCI Express x16 slot (x16 lanes)
Note: In messages and documentation,
the term service processor refers to the
integrated management module (IMM).
8 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 1. Features and specifications (continued)
Electrical input with hot-swap ac power Environment: Hot-swap fans: Three - provide redundant
supplies: v Air temperature: cooling.
v Sine-wave input (47 - 63 Hz) required – Server on: 10°C to 35°C (50.0°F to
v Input voltage range automatically 95.0°F); altitude: 0 to 914.4 m (3000 Power supply:
selected ft). Decrease system temperature by v Up to two hot-swap power supplies for
v Input voltage low range: 1°C for every 1000-foot increase in redundancy support
– Minimum: 100 V ac altitude.
– 460-watt ac
– Maximum: 127 V ac – Server off: 5°C to 45°C (41.0°F to
v Input voltage high range: 113.0°F); maximum altitude: 3048 m – 675-watt ac
– Minimum: 200 V ac (10000 ft) – 675-watt high-efficiency ac
– Maximum: 240 V ac – Shipment: -40°C to +60°C (-40°F to – 675-watt dc
v Input kilovolt-amperes (kVA) 140°F); maximum altitude: 3048 m
approximately: (10000 ft) Note: You cannot mix 460-watt and
– Minimum: 0.090 kVA v Humidity: 675-watt power supplies, or high-efficiency
– Maximum: 0.700 kVA – Server on: 20% to 80%; maximum and non-high-efficiency power supplies, or
dew point: 21°C; maximum rate of ac and dc power supplies in the server.
change 5°C per hour.
– Server off: 8% to 80%; maximum Acoustical noise emissions:
dew point: 27°C v Declared sound power, idle: 6.3 bel
– Shipment: 5% to 100% v Declared sound power, operating: 6.5 bel
v Particulate contamination:
Heat output:Approximate heat output:
Attention: Airborne particulates and
v Minimum configuration: 662 Btu per
reactive gases acting alone or in
hour (194 watts)
combination with other environmental
v Maximum configuration: 2302 Btu per
factors such as humidity or
hour (675 watts)
temperature might pose a risk to the
server. For information about the limits
for particulates and gases, see
“Particulate contamination” on page
523.
Chapter 2. Introduction 9
Table 2. Power supply efficiency (PSU 2) (continued)
Input Output Efficiency
IRMS A PF ITHD (%) Load (%) Watts Watts %
3.28 0.976 17.5% 100% 736.58 670.10 90.97%
PSU 3:
Table 3. Power supply efficiency (PSU 3)
Input Output Efficiency
IRMS A PF ITHD (%) Load (%) Watts Watts %
0.50 0.694 19.02% 10% 79.4 68.2 85.9%
0.74 0.867 15.1% 20% 148.4 134.8 90.8%
1.63 0.968 4.85% 50% 362.1 336.1 93.0%
3.24 0.988 5.3% 100% 737.3 672.3 91.2%
10 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Server controls, LEDs, and power
This section describes the controls and light-emitting diodes (LEDs) and how to
turn the server on and off.
Front view
The following illustration shows the controls, connectors, and hard disk drive bays
on the front of the server.
Rack Rack
release release
latch latch
Figure 1. Controls, connectors, and hard disk drive bays front view
Hard disk drive activity LED: Each hard disk drive has an activity LED. When
this LED is flashing, it indicates that the drive is in use.
Hard disk drive status LED: Each hard disk drive has a status LED. When this
LED is lit, it indicates that the drive has failed. When this LED is flashing slowly
(one flash per second), it indicates that the drive is being rebuilt as part of a RAID
configuration. When the LED is flashing rapidly (three flashes per second), it
indicates that the controller is identifying the drive.
Video connector: Connect a monitor to this connector. The video connectors on the
front and rear of the server can be used simultaneously.
USB connectors: Connect a USB device, such as USB mouse, keyboard, or other
USB device, to either of these connectors.
(LEDs), and connectors. For information about the controls and LEDs on the
operator information panel, see “Operator information panel” on page 12.
Rack release latches: Press these latches to release the server from the rack.
Optional CD/DVD drive activity LED: When this LED is lit, it indicates that the
CD-RW/DVD drive is in use.
Chapter 2. Introduction 11
Operator information panel
The following illustration shows the controls and LEDs on the operator
information panel.
The following controls and LEDs are on the operator information panel:
v Power-control button and power-on LED: Press this button to turn the server
on and off manually or to wake the server from a reduced-power state. The
states of the power-on LED are as follows:
– Off: AC power is not present, or the power supply or the LED itself has
failed.
– Flashing rapidly (4 times per second): The server is turned off and is not
ready to be turned on. The power-control button is disabled. This will last
approximately 20 to 40 seconds.
12 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Light path diagnostics panel
The light path diagnostics panel is on the top of the operator information panel.
To access the light path diagnostics panel, slide the blue release button on the
operator information panel to the left. Pull forward on the operator information
panel until the hinge of the panel is free of the server chassis. Then pull down on
the operator information panel, so that you can view the light path diagnostics
panel information.
Operator information
panel
Light path
diagnostics LEDs
Release latch
The following illustration shows the controls and LEDs on the light path
diagnostics panel.
Note:
1. Do not run the server for an extended period of time while the light path
diagnostics panel is pulled out of the server.
2. Light path diagnostics LEDs remain lit only while the server is connected to
power.
Checkpoint
code display
v Remind button: This button places the system-error LED on the front panel into
Remind mode. In Remind mode, the system-error LED flashes once every 2
seconds until the problem is corrected, the server is restarted, or a new problem
occurs.
By placing the system-error LED indicator in Remind mode, you acknowledge
that you are aware of the last failure but will not take immediate action to
correct the problem. The remind function is controlled by the IMM.
Chapter 2. Introduction 13
v NMI button: Press this button to force a nonmaskable interrupt to the
microprocessor, if directed to do so by IBM service and support.
v Reset button: Press this button to reset the server and run the power-on self-test
(POST). You might have to use a pen or the end of a straightened paper clip to
press the button. The reset button is in the lower-right corner of the light path
diagnostics panel.
For more information about light path diagnostics, see the Problem Determination
and Service Guide on the IBM Documentation CD.
Rear view
The following illustration shows the connectors on the rear of the server.
USB connectors: Connect a USB device, such as USB mouse, keyboard, or other
USB device, to any of these connectors.
Serial connector: Connect a 9-pin serial device to this connector. The serial port is
shared with the integrated management module (IMM). The IMM can take control
of the shared serial port to perform text console redirection and to redirect serial
traffic, using Serial over LAN (SOL).
Video connector: Connect a monitor to this connector. The video connectors on the
front and rear of the server can be used simultaneously.
The following illustration shows the LEDs on the rear of the server.
14 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Ethernet Ethernet
activity LED link LED AC power
LED (green)
DC power
LED (green)
Power-supply
error LED (amber)
Power-on System-error
LED (green) LED (amber)
Locator LED (blue)
Ethernet activity LEDs: When these LEDs are lit, they indicate that the server is
transmitting to or receiving signals from the Ethernet LAN that is connected to the
Ethernet port.
Ethernet link LEDs: When these LEDs are lit, they indicate that there is an active
link connection on the 10BASE-T, 100BASE-TX, or 1000BASE-TX interface for the
Ethernet port.
AC power LED: Each hot-swap power supply has an ac power LED and a dc
power LED. When the ac power LED is lit, it indicates that sufficient power is
coming into the power supply through the power cord. During typical operation,
both the ac and dc power LEDs are lit. For any other combination of LEDs, see the
Problem Determination and Service Guide on the IBM Documentation CD.
DC power LED: Each hot-swap power supply has a dc power LED and an ac
power LED. When the dc power LED is lit, it indicates that the power supply is
supplying adequate dc power to the system. During typical operation, both the ac
and dc power LEDs are lit. For any other combination of LEDs, see the Problem
Determination and Service Guide on the IBM Documentation CD.
OUT OK power LED: Each hot-swap dc power supply has an IN OK power LED
and an OUT OK power LED. When the OUT OK power LED is lit, it indicates that
the power supply is supplying adequate dc power to the system. During typical
operation, both the IN OK and OUT OK power LEDs are lit.
Power-supply error LED: When the power-supply error LED is lit, it indicates that
the power supply has failed.
System-error LED: When this LED is lit, it indicates that a system error has
occurred. An LED on the light path diagnostics panel is also lit to help isolate the
error. This LED is the same as the system-error LED on the front of the server.
Locator LED: Use this LED to visually locate the server among other servers. You
can use IBM Systems Director to light this LED remotely. This LED is the same as
the system-locator LED on the front of the server.
Chapter 2. Introduction 15
Power-on LED: Press this button to turn the server on and off manually or to
wake the server from a reduced-power state. The states of the power-on LED are
as follows:
v Off: AC power is not present, or the power supply or the LED itself has failed.
v Flashing rapidly (4 times per second): The server is turned off and is not ready
to be turned on. The power-control button is disabled. This will last
approximately 20 to 40 seconds.
16 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
System-board internal connectors
The following illustration shows the internal connectors on the system board.
Chapter 2. Introduction 17
Figure 8. System-board external connectors
Note: If there is a clear protective sticker on the top of the switch blocks, you must
remove and discard it to access the switches.
The default positions for the UEFI and the IMM recovery jumpers are pins 1 and 2.
18 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
IMM recovery jumper
(J147)
1
2
UEFI boot recovery 3 3
jumper (J29) 2
1
Chapter 2. Introduction 19
Table 4. System board jumpers (continued)
Jumper Jumper
number name Jumper setting
Note:
1. If no jumper is present, the server responds as if the pins are set to 1 and 2.
2. Changing the position of the UEFI boot recovery jumper from pins 1 and 2 to pins 2
and 3 before the server is turned on alters which flash ROM page is loaded. Do not
change the jumper pin position after the server is turned on. This can cause an
unpredictable problem.
The following illustration shows the jumper settings for switch blocks SW3 and
SW4 on the system board.
12 34 12 34
Table 5 on page 21 and Table 6 on page 21 describe the function of each switch on
SW3 and SW4 switch blocks on the system board.
20 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 5. System board switch block 3, switches 1 - 4
Switch
number Default value Switch description
1 Off Clear CMOS memory. When this switch is toggled to On, it clears the data in
CMOS memory.
2 Off Trusted Platform Module (TPM) physical presence. Turning this switch to the on
position indicates a physical presence to the TPM.
3 Off Reserved.
4 Off Reserved.
Changing the position of this switch does not affect the administrator password
check if an administrator password is set.
See “Passwords” on page 497 for additional information about the power-on
password.
2 Off Power-on override. When this switch is toggled to On and then to Off, you force a
power-on which overrides the power-on and power-off button on the server and
they become nonfunctional.
3 Off Forced power permission overrides the IMM power-on checking process. (Trained
service technician only)
4 Off Reserved.
Important:
1. Before you change any switch settings or move any jumpers, turn off the
server; then, disconnect all power cords and external cables. (Review the
information in “Safety” on page vii, “Installation guidelines” on page 395, and
“Handling static-sensitive devices” on page 398.)
2. Any system-board switch or jumper blocks that are not shown in the
illustrations in this document are reserved.
Chapter 2. Introduction 21
System-board LEDs
The following illustration shows the light-emitting diodes (LEDs) on the system
board.
Note: Error LEDs remain lit only while the server is connected to power.
22 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
System pulse LEDs
The following LEDs are on the system board and monitor the system power-on
and power-off sequencing and boot progress (see “System-board LEDs” on page 22
for the location of these LEDs).
Table 7. System-pulse LEDs
LED Description Action
Enclosure manager heartbeat
Indicates the status of (Trained service technician
power-on and power-off only) If the server is
sequencing. connected to power and the
LED is not flashing, replace
When the server is connected the system board.
to power, this LED flashes
slowly to indicate that the
enclosure manager is
working correctly.
IMM heartbeat
Indicates the status of the If the LED does not begin
boot process of the IMM. flashing within 30 seconds of
when the server is connected
When the server is connected to power, complete the
to power this LED flashes following steps:
quickly to indicate that the
1. (Trained service
IMM code is loading. When
technician only) Use the
the loading is complete, the
IMM recovery jumper to
LED stops flashing briefly
recover the firmware (see
and then flashes slowly to
Table 4 on page 19).
indicate that the IMM if fully
operational and you can 2. (Trained service
press the power-control technician only) Replace
button to start the server. the system board.
Chapter 2. Introduction 23
Figure 12. System-board optional device connectors
Note: Error LEDs remain lit only while the server is connected to power.
A 16-drive-capable model server contains the riser card that is shown in the
following illustration.
A tape-enabled model server contains the riser card that is shown in the following
illustration.
24 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 14. The riser card
Adapter
PCI
riser-card
assembly
Adapter
connectors
Chapter 2. Introduction 25
PCI riser-card assembly LEDs
The following illustration shows the light-emitting diodes (LEDs) on the PCI
riser-card assembly.
Note: Error LEDs remain lit only while the server is connected to power.
26 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Chapter 3. Diagnostics
This chapter describes the diagnostic tools that are available to help you solve
problems that might occur in the server.
If you cannot locate and correct a problem by using the information in this chapter,
see Appendix A, “Getting help and technical assistance,” on page 517 for more
information.
Diagnostic tools
The following tools are available to help you diagnose and solve hardware-related
problems.
v Troubleshooting tables
These tables list problem symptoms and actions to correct the problems. See
“Troubleshooting tables” on page 340.
v Light path diagnostics
Use the light path diagnostics to diagnose system errors quickly. See “Light path
diagnostics” on page 360 for more information.
v Dynamic System Analysis Preboot (DSA) diagnostic programs
The DSA Preboot diagnostic programs provide problem isolation, configuration
analysis, and error log collection. The diagnostic programs are the primary
method of testing the major components of the server and are stored in
integrated USB memory. The diagnostic programs collect the following
information about the server:
– System configuration
– Network interfaces and settings
– Installed hardware
– Light path diagnostics status
– Service processor status and configuration
– Vital product data, firmware, and IBM's implementation of UEFI
configuration
– Hard disk drive health
– RAID controller configuration
– ServeRAID controller and service processor event logs, including the
following information:
- System-event logs
- Temperature, voltage, and fan speed information
- Tape drive presence and read/write test results
- Systems management analysis and reporting technology (SMART) data
- USB information
- Monitor configuration information
- PCI slot information
The diagnostic programs create a merged log that includes events from all
collected logs. The information is collected into a file that you can send to IBM
service and support. Additionally, you can view the server information locally
through a generated text report file. You can also copy the log to removable
media and view the log from a Web browser. See “Running the diagnostic
programs” on page 287 for more information.
v IBM Electronic Service Agent
Event logs
Error codes and messages displayed in POST event log, system-event log,
integrated management module (IMM) event log, and DSA event log.
v POST event log: This log contains the three most recent error codes and
messages that were generated during POST. You can view the POST event log
through the Setup utility.
v System-event log: This log contains all BMC, POST, and system management
interrupt (SMI) events. You can view the system-event log from the Setup utility
and through the Dynamic System Analysis (DSA) program (as the IPMI event
log).
The system-event log is limited in size. When it is full, new entries will not
overwrite existing entries; therefore, you must periodically save and then clear
the system-event log through the Setup utility when the IMM logs an event that
indicates that the log is more than 75% full. When you are troubleshooting, you
might have to save and then clear the system-event log to make the most recent
events available for analysis.
Messages are listed on the left side of the screen, and details about the selected
message are displayed on the right side of the screen. To move from one entry
to the next, use the Up Arrow (↑) and Down Arrow (↓) keys.
Some IMM sensors cause assertion events to be logged when their setpoints are
reached. When a setpoint condition no longer exists, a corresponding deassertion
event is logged. However, not all events are assertion-type events.
v Integrated management module (IMM) event log: This log contains a filtered
subset of all IMM, POST, and system management interrupt (SMI) events. You
can view the IMM event log through the IMM Web interface and through the
Dynamic System Analysis (DSA) program (as the ASM event log).
v DSA log: This log is generated by the Dynamic System Analysis (DSA) program,
and it is a chronologically ordered merge of the system-event log (as the IPMI
event log), the IMM event log (as the ASM event log), and the operating-system
event logs. You can view the DSA log through the DSA program.
28 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Viewing event logs from the Setup utility
Use this information to view event logs from the Setup utility.
To view the POST event log or system-event log, complete the following steps:
1. Turn on the server.
2. When the prompt <F1> Setup is displayed, press F1. If you have set both a
power-on password and an administrator password, you must type the
administrator password to view the event logs.
3. Select System Event Logs and use one of the following procedures:
v To view the POST event log, select POST Event Viewer.
v To view the system-event log, select System Event Log.
If you have installed Portable or Installable Dynamic System Analysis (DSA), you
can use it to view the system-event log (as the IPMI event log), the IMM event log
(as the ASM event log), the operating-system event logs, or the merged DAA log.
You can also use DSA Preboot to view these logs, although you must restart the
server to use DSA Preboot. To install Portable DSA, Installable DSA, or DSA
Preboot or to download a DSA Preboot CD image, go to http://www.ibm.com/
support/entry/portal/docdisplay?lndocid=SERV-DSA or complete the following
steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
Procedure
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Software and device drivers.
4. Under Related downloads, click Dynamic System Analysis (DSA) to display
the matrix of downloadable DSA files.
Results
If IPMItool is installed in the server, you can use it to view the system-event log.
Most recent versions of the Linux operating system come with a current version of
IPMItool. Complete the following steps to obtain more information about IPMItool.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp.
2. In the navigation pane, click IBM System x and BladeCenter Tools Center.
3. Expand Tools reference, expand Configuration tools, expand IPMI tools, and
click IPMItool.
Chapter 3. Diagnostics 29
For an overview of IPMI, complete the following steps:
1. Go to http://publib.boulder.ibm.com/infocenter/toolsctr/v1r0/index.jsp.
2. In the navigation pane, click IBM Systems Information Center.
3. Expand Operating systems, expand Linux information, expand Blueprints for
Linux on IBM systems, and click Using Intelligent Platform Management
Interface (IPMI) on IBM Linux platforms.
You can view the IMM event log through the Event Log link in the integrated
management module (IMM) Web interface.
The following table describes the methods that you can use to view the event logs,
depending on the condition of the server. The first two conditions generally do not
require that you restart the server.
Table 8. Methods for viewing event logs
Condition Action
The server is not hung and is connected to a
network. Use any of the following methods:
v Run Portable or Installable DSA to view
the event logs or create an output file that
you can send to IBM service and support.
v Type the IP address of the IMM and go to
the Event Log page.
v Use IPMItool to view the system-event
log.
The server is not hung and is not connected Use IPMItool locally to view the
to a network. system-event log.
The server is hung. v If DSA Preboot is installed, restart the
server and press F2 to start DSA Preboot
and view the event logs.
v If DSA Preboot is not installed, insert the
DSA Preboot CD and restart the server to
start DSA Preboot and view the event
logs.
v Alternatively, you can restart the server
and press F1 to start the Setup utility and
view the POST event log or system-event
log. For more information, see “Viewing
event logs from the Setup utility” on page
29.
30 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Error messages
This section provides the list of error codes and messages for UEFI/POST, IMM,
and DSA that are generated when a problem is detected.
See “POST error codes” on page 274, “Integrated management module error
messages,” and “Diagnostic messages” on page 289 for more information.
Each message contains date and time information, and it indicates the source of the
message (POST or the IMM).
Chapter 3. Diagnostics 31
Table 9. Events that automatically notify Support (continued)
Automatically
Notify
Event ID Message String Support
806f0021-2582xxxx Fault in slot Yes
[PhysicalConnectorSystemElementName] on
system [ComputerSystemElementName].
(All PCI Error)
806f0021-3001xxxx Fault in slot Yes
[PhysicalConnectorSystemElementName] on
system [ComputerSystemElementName].
(PCI 1)
806f0108-0a01xxxx [PowerSupplyElementName] has Failed. Yes
(Power Supply 1)
806f0108-0a02xxxx [PowerSupplyElementName] has Failed. Yes
(Power Supply 2)
806f010c-2001xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
1)
806f010c-2002xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
2)
806f010c-2003xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
3)
806f010c-2004xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
4)
806f010c-2005xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
5)
806f010c-2006xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
6)
806f010c-2007xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
7)
806f010c-2008xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
8)
806f010c-2009xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
9)
32 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 9. Events that automatically notify Support (continued)
Automatically
Notify
Event ID Message String Support
806f010c-200axxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
10)
806f010c-200bxxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
11)
806f010c-200cxxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
12)
806f010c-200dxxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
13)
806f010c-200exxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
14)
806f010c-200fxxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
15)
806f010c-2010xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
16)
806f010c-2011xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
17)
806f010c-2012xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
18)
806f010c-2581xxxx Uncorrectable error detected for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (All
DIMMS)
806f010d-0400xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 0)
806f010d-0401xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 1)
806f010d-0402xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 2)
Chapter 3. Diagnostics 33
Table 9. Events that automatically notify Support (continued)
Automatically
Notify
Event ID Message String Support
806f010d-0403xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 3)
806f010d-0404xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 4)
806f010d-0405xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 5)
806f010d-0406xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 6)
806f010d-0407xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 7)
806f010d-0408xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 8)
806f010d-0409xxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 9)
806f010d-040axxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 10)
806f010d-040bxxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 11)
806f010d-040cxxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 12)
806f010d-040dxxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 13)
806f010d-040exxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 14)
806f010d-040fxxxx The Drive [StorageVolumeElementName] Yes
has been disabled due to a detected fault.
(Drive 15)
806f011b-0701xxxx The connector Yes
[PhysicalConnectorElementName] has
encountered a configuration error. (Video
USB)
806f0207-0301xxxx [ProcessorElementName] has Failed with Yes
FRB1/BIST condition. (CPU 1)
806f0207-0302xxxx [ProcessorElementName] has Failed with Yes
FRB1/BIST condition. (CPU 2)
34 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 9. Events that automatically notify Support (continued)
Automatically
Notify
Event ID Message String Support
806f020d-0400xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 0)
806f020d-0401xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 1)
806f020d-0402xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 2)
806f020d-0403xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 3)
806f020d-0404xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 4)
806f020d-0405xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 5)
806f020d-0406xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 6)
806f020d-0407xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 7)
806f020d-0408xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 8)
806f020d-0409xxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 9)
806f020d-040axxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 10)
806f020d-040bxxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 11)
806f020d-040cxxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 12)
806f020d-040dxxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 13)
806f020d-040exxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 14)
806f020d-040fxxxx Failure Predicted on drive Yes
[StorageVolumeElementName] for array
[ComputerSystemElementName]. (Drive 15)
Chapter 3. Diagnostics 35
Table 9. Events that automatically notify Support (continued)
Automatically
Notify
Event ID Message String Support
806f0212-2584xxxx The System Yes
[ComputerSystemElementName] has
encountered an unknown system hardware
fault. (CPU Fault Reboot)
806f050c-2001xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
1)
806f050c-2002xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
2)
806f050c-2003xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
3)
806f050c-2004xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
4)
806f050c-2005xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
5)
806f050c-2006xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
6)
806f050c-2007xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
7)
806f050c-2008xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
8)
806f050c-2009xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
9)
806f050c-200axxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
10)
806f050c-200bxxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
11)
36 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 9. Events that automatically notify Support (continued)
Automatically
Notify
Event ID Message String Support
806f050c-200cxxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
12)
806f050c-200dxxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
13)
806f050c-200exxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
14)
806f050c-200fxxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
15)
806f050c-2010xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
16)
806f050c-2011xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
17)
806f050c-2012xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (DIMM
18)
806f050c-2581xxxx Memory Logging Limit Reached for Yes
[PhysicalMemoryElementName] on
Subsystem [MemoryElementName]. (All
DIMMS)
806f060d-0400xxxx Array [ComputerSystemElementName] has Yes
failed. (Drive 0)
806f060d-0401xxxx Array [ComputerSystemElementName] has Yes
failed. (Drive 1)
806f060d-0402xxxx Array [ComputerSystemElementName] has Yes
failed. (Drive 2)
806f060d-0403xxxx Array [ComputerSystemElementName] has Yes
failed. (Drive 3)
806f060d-0404xxxx Array [ComputerSystemElementName] has Yes
failed. (Drive 4)
806f060d-0405xxxx Array [ComputerSystemElementName] has Yes
failed. (Drive 5)
806f060d-0406xxxx Array [ComputerSystemElementName] has Yes
failed. (Drive 6)
806f060d-0407xxxx Array [ComputerSystemElementName] has Yes
failed. (Drive 7)
Chapter 3. Diagnostics 37
40000001-00000000
38 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
40000002-00000000 • 40000004-00000000
40000003-00000000 Ethernet Data Rate modified from [arg1] to [arg2] by user [arg3].
Explanation: This message is for the use case where a client modifies the Ethernet Port data rate.
May also be shown as 4000000300000000 or 0x4000000300000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0003
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
40000004-00000000 Ethernet Duplex setting modified from [arg1] to [arg2] by user [arg3].
Explanation: This message is for the use case where a client modifies the Ethernet Port duplex setting.
May also be shown as 4000000400000000 or 0x4000000400000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0004
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
Chapter 3. Diagnostics 39
40000005-00000000 • 40000007-00000000
40000005-00000000 Ethernet MTU setting modified from [arg1] to [arg2] by user [arg3].
Explanation: This message is for the use case where a client modifies the Ethernet Port MTU setting.
May also be shown as 4000000500000000 or 0x4000000500000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0005
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
40000006-00000000 Ethernet locally administered MAC address modified from [arg1] to [arg2] by user [arg3].
Explanation: This message is for the use case where a client modifies the Ethernet Port MAC address setting.
May also be shown as 4000000600000000 or 0x4000000600000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0006
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
40 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
40000008-00000000 • 4000000a-00000000
40000009-00000000 IP address of network interface modified from [arg1] to [arg2] by user [arg3].
Explanation: This message is for the use case where a client modifies the IP address of a Management Controller.
May also be shown as 4000000900000000 or 0x4000000900000000
Severity: Info
Alert Category: System - IMM Network event
Serviceable: No
CIM Information: Prefix: IMM and ID: 0009
SNMP Trap ID: 37
Automatically notify Support: No
User response: Information only; no action is required.
4000000a-00000000 IP subnet mask of network interface modified from [arg1] to [arg2] by user [arg3].
Explanation: This message is for the use case where a client modifies the IP subnet mask of a Management
Controller.
May also be shown as 4000000a00000000 or 0x4000000a00000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0010
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
Chapter 3. Diagnostics 41
4000000b-00000000 • 4000000d-00000000
4000000b-00000000 IP address of default gateway modified from [arg1] to [arg2] by user [arg3].
Explanation: This message is for the use case where a client modifies the default gateway IP address of a
Management Controller.
May also be shown as 4000000b00000000 or 0x4000000b00000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0011
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
42 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
4000000e-00000000 • 40000010-00000000
4000000e-00000000 Remote Login Successful. Login ID: [arg1] from [arg2] at IP address [arg3].
Explanation: This message is for the use case where a client successfully logs in to a Management Controller.
May also be shown as 4000000e00000000 or 0x4000000e00000000
Severity: Info
Alert Category: System - Remote Login
Serviceable: No
CIM Information: Prefix: IMM and ID: 0014
SNMP Trap ID: 30
Automatically notify Support: No
User response: Information only; no action is required.
40000010-00000000 Security: Userid: [arg1] had [arg2] login failures from WEB client at IP address [arg3].
Explanation: This message is for the use case where a client has failed to log in to a Management Controller from a
web browser.
May also be shown as 4000001000000000 or 0x4000001000000000
Severity: Warning
Alert Category: System - Remote Login
Serviceable: No
CIM Information: Prefix: IMM and ID: 0016
SNMP Trap ID: 30
Automatically notify Support: No
User response: Complete the following steps until the problem is solved: Make sure that the correct login ID and
password are being used. Have the system administrator reset the login ID or password.
Chapter 3. Diagnostics 43
40000011-00000000 • 40000013-00000000
40000011-00000000 Security: Login ID: [arg1] had [arg2] login failures from CLI at [arg3]..
Explanation: This message is for the use case where a client has failed to log in to a Management Controller from
the Legacy CLI.
May also be shown as 4000001100000000 or 0x4000001100000000
Severity: Warning
Alert Category: System - Remote Login
Serviceable: No
CIM Information: Prefix: IMM and ID: 0017
SNMP Trap ID: 30
Automatically notify Support: No
User response: Complete the following steps until the problem is solved: Make sure that the correct login ID and
password are being used. Have the system administrator reset the login ID or password.
40000012-00000000 Remote access attempt failed. Invalid userid or password received. Userid is [arg1] from WEB
browser at IP address [arg2].
Explanation: This message is for the use case where a client has failed to log in to a Management Controller from a
web browser.
May also be shown as 4000001200000000 or 0x4000001200000000
Severity: Info
Alert Category: System - Remote Login
Serviceable: No
CIM Information: Prefix: IMM and ID: 0018
SNMP Trap ID: 30
Automatically notify Support: No
User response: Make sure that the correct login ID and password are being used.
40000013-00000000 Remote access attempt failed. Invalid userid or password received. Userid is [arg1] from
TELNET client at IP address [arg2].
Explanation: This message is for the use case where a client has failed to log in to a Management Controller from a
telnet session.
May also be shown as 4000001300000000 or 0x4000001300000000
Severity: Info
Alert Category: System - Remote Login
Serviceable: No
CIM Information: Prefix: IMM and ID: 0019
SNMP Trap ID: 30
Automatically notify Support: No
User response: Make sure that the correct login ID and password are being used.
44 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
40000014-00000000 • 40000016-00000000
Chapter 3. Diagnostics 45
40000017-00000000 • 40000019-00000000
46 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
4000001a-00000000 • 4000001c-00000000
4000001b-00000000 Management Controller [arg1]: Configuration restored from a file by user [arg2].
Explanation: This message is for the use case where a client restores a Management Controller configuration from a
file.
May also be shown as 4000001b00000000 or 0x4000001b00000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0027
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
Chapter 3. Diagnostics 47
4000001d-00000000 • 4000001f-00000000
4000001f-00000000 Please ensure that the Management Controller [arg1] is flashed with the correct firmware. The
Management Controller is unable to match its firmware to the server.
Explanation: This message is for the use case where a Management Controller firmware version does not match the
server.
May also be shown as 4000001f00000000 or 0x4000001f00000000
Severity: Error
Alert Category: System - other
Serviceable: No
CIM Information: Prefix: IMM and ID: 0031
SNMP Trap ID: 22
Automatically notify Support: No
User response: Update the IMM firmware to a version that the server supports. Important: Some cluster solutions
require specific code levels or coordinated code updates. If the device is part of a cluster solution, verify that the
latest level of code is supported for the cluster solution before you update the code.
48 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
40000020-00000000 • 40000022-00000000
40000020-00000000 Management Controller [arg1] Reset was caused by restoring default values.
Explanation: This message is for the use case where a Management Controller has been reset due to a client
restoring the configuration to default values.
May also be shown as 4000002000000000 or 0x4000002000000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0032
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
40000021-00000000 Management Controller [arg1] clock has been set from NTP server [arg2].
Explanation: This message is for the use case where a Management Controller clock has been set from the Network
Time Protocol server.
May also be shown as 4000002100000000 or 0x4000002100000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0033
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
40000022-00000000 SSL data in the Management Controller [arg1] configuruation data is invalid. Clearing
configuration data region and disabling SSL.
Explanation: This message is for the use case where a Management Controller has detected invalid SSL data in the
configuration data and is clearing the configuration data region and disabling the SSL.
May also be shown as 4000002200000000 or 0x4000002200000000
Severity: Error
Alert Category: System - other
Serviceable: No
CIM Information: Prefix: IMM and ID: 0034
SNMP Trap ID: 22
Automatically notify Support: No
User response: Complete the following steps until the problem is solved: Make sure that the certificate that you are
importing is correct. Try to import the certificate again.
Chapter 3. Diagnostics 49
40000023-00000000 • 40000025-00000000
50 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
40000026-00000000 • 40000028-00000000
Chapter 3. Diagnostics 51
40000029-00000000 • 4000002b-00000000
40000029-00000000 Security: Userid: [arg1] had [arg2] login failures from an SSH client at IP address [arg3].
Explanation: This message is for the use case where a client has failed to log in to a Management Controller from
SSH.
May also be shown as 4000002900000000 or 0x4000002900000000
Severity: Info
Alert Category: System - Remote Login
Serviceable: No
CIM Information: Prefix: IMM and ID: 0041
SNMP Trap ID: 30
Automatically notify Support: No
User response: Complete the following steps until the problem is solved: Make sure that the correct login ID and
password are being used. Have the system administrator reset the login ID or password.
4000002a-00000000 [arg1] firmware mismatch internal to system [arg2]. Please attempt to flash the [arg3]
firmware.
Explanation: This message is for the use case where a specific type of firmware mismatch has been detected.
May also be shown as 4000002a00000000 or 0x4000002a00000000
Severity: Error
Alert Category: System - Other
Serviceable: No
CIM Information: Prefix: IMM and ID: 0042
SNMP Trap ID: 22
Automatically notify Support: No
User response: Reflash the IMM firmware to the latest version.
52 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
4000002c-00000000 • 4000002e-00000000
Chapter 3. Diagnostics 53
4000002f-00000000 • 40000031-00000000
54 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
40000032-00000000 • 40000034-00000000
Chapter 3. Diagnostics 55
40000035-00000000 • 40000037-00000000
56 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
40000038-00000000 • 4000003a-00000000
4000003a-00000000 IPv6 static address of network interface modified from [arg1] to [arg2] by user [arg3].
Explanation: This message is for the use case where a client modifies the IPv6 static address of a Management
Controller
May also be shown as 4000003a00000000 or 0x4000003a00000000
Severity: Info
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0058
SNMP Trap ID:
Automatically notify Support: No
User response: Information only; no action is required.
Chapter 3. Diagnostics 57
4000003b-00000000 • 80010202-0701xxxx
4000003b-00000000
Explanation: This message is for the use case where a DHCP6 server fails to assign an IP address to a Management
Controller.
May also be shown as 4000003b00000000 or 0x4000003b00000000
Severity: Warning
Alert Category: none
Serviceable: No
CIM Information: Prefix: IMM and ID: 0059
SNMP Trap ID:
Automatically notify Support: No
User response: Complete the following steps until the problem is solved: Make sure that the IMM network cable is
connected. Make sure that there is a DHCPv6 server on the network that can assign an IP address to the IMM.
80010002-2801xxxx Numeric sensor [NumericSensorElementName] going low (lower non-critical) has asserted.
(CMOS Battery)
Explanation: This message is for the use case when an implementation has detected a Lower Non-critical sensor
going low has asserted.
May also be shown as 800100022801xxxx or 0x800100022801xxxx
Severity: Warning
Alert Category: Warning - Voltage
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0476
SNMP Trap ID: 13
Automatically notify Support: No
User response: Replace the system battery.
80010202-0701xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has asserted. (Planar
12V)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has asserted.
May also be shown as 800102020701xxxx or 0x800102020701xxxx
Severity: Error
Alert Category: Critical - Voltage
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0480
SNMP Trap ID: 1
Automatically notify Support: Yes
User response: 1. Check power supply n LED. 2. Remove the failing power supply. 3. Follow actions for OVER
SPEC LED in Light path diagnostics LEDs. 4. (Trained technician only) Replace the system board. (n = power supply
number) Planar 3.3V : Planar 5V :
58 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80010202-2801xxxx • 80010204-1d02xxxx
80010202-2801xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has asserted.
(CMOS Battery)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has asserted.
May also be shown as 800102022801xxxx or 0x800102022801xxxx
Severity: Error
Alert Category: Critical - Voltage
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0480
SNMP Trap ID: 1
Automatically notify Support: Yes
User response: 1. Check power supply n LED. 2. Remove the failing power supply. 3. Follow actions for OVER
SPEC LED in Light path diagnostics LEDs. 4. (Trained technician only) Replace the system board. (n = power supply
number)
80010204-1d01xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has asserted. (Fan
1A Tach)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has asserted.
May also be shown as 800102041d01xxxx or 0x800102041d01xxxx
Severity: Error
Alert Category: Critical - Fan Failure
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0480
SNMP Trap ID: 11
Automatically notify Support: No
User response: 1. Reseat the failing fan n, which is indicated by a lit LED near the fan connector on the system
board. 2. Replace the failing fan. (n = fan number) Fan 1B Tach :
80010204-1d02xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has asserted. (Fan
2A Tach)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has asserted.
May also be shown as 800102041d02xxxx or 0x800102041d02xxxx
Severity: Error
Alert Category: Critical - Fan Failure
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0480
SNMP Trap ID: 11
Automatically notify Support: No
User response: 1. Reseat the failing fan n, which is indicated by a lit LED near the fan connector on the system
board. 2. Replace the failing fan. (n = fan number) Fan 2B Tach :
Chapter 3. Diagnostics 59
80010204-1d03xxxx • 80010901-0c01xxxx
80010204-1d03xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has asserted. (Fan
3A Tach)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has asserted.
May also be shown as 800102041d03xxxx or 0x800102041d03xxxx
Severity: Error
Alert Category: Critical - Fan Failure
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0480
SNMP Trap ID: 11
Automatically notify Support: No
User response: 1. Reseat the failing fan n, which is indicated by a lit LED near the fan connector on the system
board. 2. Replace the failing fan. (n = fan number) Fan 3B Tach :
80010701-0c01xxxx Numeric sensor [NumericSensorElementName] going high (upper non-critical) has asserted.
(Ambient Temp)
Explanation: This message is for the use case when an implementation has detected an Upper Non-critical sensor
going high has asserted.
May also be shown as 800107010c01xxxx or 0x800107010c01xxxx
Severity: Warning
Alert Category: Warning - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0490
SNMP Trap ID: 12
Automatically notify Support: No
User response: 1. Reduce the temperature. 2. Check the server airflow. Make sure that nothing is blocking the air
from coming into or preventing the air from exiting the server.
80010901-0c01xxxx Numeric sensor [NumericSensorElementName] going high (upper critical) has asserted.
(Ambient Temp)
Explanation: This message is for the use case when an implementation has detected an Upper Critical sensor going
high has asserted.
May also be shown as 800109010c01xxxx or 0x800109010c01xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0494
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Reduce the ambient temperature. 2. Check the server airflow. Make sure that nothing is blocking
the air from coming into or preventing the air from exiting the server.
60 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80010902-0701xxxx • 80030012-2301xxxx
80010902-0701xxxx Numeric sensor [NumericSensorElementName] going high (upper critical) has asserted.
(Planar 12V)
Explanation: This message is for the use case when an implementation has detected an Upper Critical sensor going
high has asserted.
May also be shown as 800109020701xxxx or 0x800109020701xxxx
Severity: Error
Alert Category: Critical - Voltage
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0494
SNMP Trap ID: 1
Automatically notify Support: Yes
User response: 1. Check power supply n LED. 2. Remove the failing power supply. 3. (Trained technician only)
Replace the system board. (n = power supply number) Planar 3.3V : Planar 5V :
Chapter 3. Diagnostics 61
80070201-0301xxxx • 80070201-0302xxxx
80070201-0301xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (CPU 1
OverTemp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702010301xxxx or 0x800702010301xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-0302xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (CPU 2
OverTemp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702010302xxxx or 0x800702010302xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
62 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80070201-2001xxxx • 80070201-2002xxxx
80070201-2001xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 1
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012001xxxx or 0x800702012001xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-2002xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 2
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012002xxxx or 0x800702012002xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
Chapter 3. Diagnostics 63
80070201-2003xxxx • 80070201-2004xxxx
80070201-2003xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 3
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012003xxxx or 0x800702012003xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-2004xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 4
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012004xxxx or 0x800702012004xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
64 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80070201-2005xxxx • 80070201-2006xxxx
80070201-2005xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 5
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012005xxxx or 0x800702012005xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-2006xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 6
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012006xxxx or 0x800702012006xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
Chapter 3. Diagnostics 65
80070201-2007xxxx • 80070201-2008xxxx
80070201-2007xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 7
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012007xxxx or 0x800702012007xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-2008xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 8
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012008xxxx or 0x800702012008xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
66 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80070201-2009xxxx • 80070201-200axxxx
80070201-2009xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 9
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012009xxxx or 0x800702012009xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-200axxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 10
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 80070201200axxxx or 0x80070201200axxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
Chapter 3. Diagnostics 67
80070201-200bxxxx • 80070201-200cxxxx
80070201-200bxxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 11
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 80070201200bxxxx or 0x80070201200bxxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-200cxxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 12
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 80070201200cxxxx or 0x80070201200cxxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
68 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80070201-200dxxxx • 80070201-200exxxx
80070201-200dxxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 13
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 80070201200dxxxx or 0x80070201200dxxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-200exxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 14
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 80070201200exxxx or 0x80070201200exxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
Chapter 3. Diagnostics 69
80070201-200fxxxx • 80070201-2010xxxx
80070201-200fxxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 15
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 80070201200fxxxx or 0x80070201200fxxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-2010xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 16
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012010xxxx or 0x800702012010xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
70 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80070201-2011xxxx • 80070201-2012xxxx
80070201-2011xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 17
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012011xxxx or 0x800702012011xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070201-2012xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (DIMM 18
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012012xxxx or 0x800702012012xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
Chapter 3. Diagnostics 71
80070201-2d01xxxx • 80070202-0701xxxx
80070201-2d01xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (IOH Temp
Status)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702012d01xxxx or 0x800702012d01xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n is installed correctly. 4.
(Trained technician only) Replace microprocessor n. (n = microprocessor number)
80070202-0701xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (Planar
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702020701xxxx or 0x800702020701xxxx
Severity: Error
Alert Category: Critical - Voltage
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 1
Automatically notify Support: No
User response: 1. Check the system-event log. 2. Check for an error LED on the system board. 3. Replace any failing
device. 4. Check for a server firmware update. Important: Some cluster solutions require specific code levels or
coordinated code updates. If the device is part of a cluster solution, verify that the latest level of code is supported
for the cluster solution before you update the code. 5. (Trained technician only) Replace the system board.
72 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80070204-0a01xxxx • 80070208-0a01xxxx
80070204-0a01xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (PS 1 Fan
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702040a01xxxx or 0x800702040a01xxxx
Severity: Error
Alert Category: Critical - Fan Failure
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 11
Automatically notify Support: No
User response: 1. Make sure that there are no obstructions, such as bundled cables, to the airflow from the
power-supply fan. 2. Replace power supply n. (n = power supply number)
80070204-0a02xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (PS 2 Fan
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702040a02xxxx or 0x800702040a02xxxx
Severity: Error
Alert Category: Critical - Fan Failure
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 11
Automatically notify Support: No
User response: 1. Make sure that there are no obstructions, such as bundled cables, to the airflow from the
power-supply fan. 2. Replace power supply n. (n = power supply number)
80070208-0a01xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (PS 1 OP
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702080a01xxxx or 0x800702080a01xxxx
Severity: Error
Alert Category: Critical - Power
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 4
Automatically notify Support: No
User response: 1. Make sure that there are no obstructions, such as bundled cables, to the airflow from the
power-supply fan. 2. Use the IBM Power Configurator utility to determine current system power consumption. For
more information and to download the utility, go to http://www-03.ibm.com/systems/bladecenter/resources/
powerconfig.html. 3. Replace power supply n. (n = power supply number) PS 1 Therm Fault :
Chapter 3. Diagnostics 73
80070208-0a02xxxx • 8007020f-2582xxxx
80070208-0a02xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (PS 2 OP
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702080a02xxxx or 0x800702080a02xxxx
Severity: Error
Alert Category: Critical - Power
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 4
Automatically notify Support: No
User response: 1. Make sure that there are no obstructions, such as bundled cables, to the airflow from the
power-supply fan. 2. Use the IBM Power Configurator utility to determine current system power consumption. For
more information and to download the utility, go to http://www-03.ibm.com/systems/bladecenter/resources/
powerconfig.html. 3. Replace power supply n. (n = power supply number) PS 2 Therm Fault :
8007020f-2582xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (No I/O
Resources)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 8007020f2582xxxx or 0x8007020f2582xxxx
Severity: Error
Alert Category: Critical - Other
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 50
Automatically notify Support: No
User response: Complete the following steps for PCI I/O resource error issue resolution: 1. Understand the I/O
resource requirements in a basic system. 2. Identify the I/O resource requirements for desired add-in adapters. For
examples, PCI-X or PCIe adapters. 3. Disable on-board devices that you can do without and that request I/O. 4. In F1
setup, select the System Settings then Device and I/O Ports menu. 5. Remove adapters or disable slots until the I/O
resource is less than 64 KB.
74 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80070219-0701xxxx • 80070301-0301xxxx
80070219-0701xxxx Sensor [SensorElementName] has transitioned to critical from a less severe state. (Sys Board
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to critical
from less severe.
May also be shown as 800702190701xxxx or 0x800702190701xxxx
Severity: Error
Alert Category: Critical - Other
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0522
SNMP Trap ID: 50
Automatically notify Support: No
User response: 1. Check the system-event log. 2. Check for an error LED on the system board. 3. Replace any failing
device. 4. Check for a server firmware update. Important: Some cluster solutions require specific code levels or
coordinated code updates. If the device is part of a cluster solution, verify that the latest level of code is supported
for the cluster solution before you update the code. 5. (Trained technician only) Replace the system board.
80070301-0301xxxx Sensor [SensorElementName] has transitioned to non-recoverable from a less severe state.
(CPU 1 OverTemp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to
non-recoverable from less severe.
May also be shown as 800703010301xxxx or 0x800703010301xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0524
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n. 4. (Trained technician
only) Replace microprocessor n. (n = microprocessor number)
Chapter 3. Diagnostics 75
80070301-0302xxxx • 80070301-2d01xxxx
80070301-0302xxxx Sensor [SensorElementName] has transitioned to non-recoverable from a less severe state.
(CPU 2 OverTemp)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to
non-recoverable from less severe.
May also be shown as 800703010302xxxx or 0x800703010302xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0524
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n. 4. (Trained technician
only) Replace microprocessor n. (n = microprocessor number)
80070301-2d01xxxx Sensor [SensorElementName] has transitioned to non-recoverable from a less severe state.
(IOH Temp Status)
Explanation: This message is for the use case when an implementation has detected a Sensor transitioned to
non-recoverable from less severe.
May also be shown as 800703012d01xxxx or 0x800703012d01xxxx
Severity: Error
Alert Category: Critical - Temperature
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0524
SNMP Trap ID: 0
Automatically notify Support: No
User response: 1. Make sure that the fans are operating, that there are no obstructions to the airflow (front and rear
of the server), that the air baffles are in place and correctly installed, and that the server cover is installed and
completely closed. 2. Check the ambient temperature. You must be operating within the specifications (see Features
and specifications for more information). 3. Make sure that the heat sink for microprocessor n. 4. (Trained technician
only) Replace microprocessor n. (n = microprocessor number)
76 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
80070603-0701xxxx • 80070608-0a01xxxx
Chapter 3. Diagnostics 77
80070608-0a02xxxx • 800b0108-1381xxxx
78 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
800b010a-1e81xxxx • 800b010a-1e83xxxx
Chapter 3. Diagnostics 79
800b010c-2581xxxx • 800b050a-1e81xxxx
80 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
800b050a-1e82xxxx • 800b050c-2581xxxx
Chapter 3. Diagnostics 81
806f0007-0301xxxx • 806f0007-0302xxxx
82 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f0008-0a01xxxx • 806f0009-1381xxxx
Chapter 3. Diagnostics 83
806f000d-0400xxxx • 806f000d-0402xxxx
84 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f000d-0403xxxx • 806f000d-0405xxxx
Chapter 3. Diagnostics 85
806f000d-0406xxxx • 806f000d-0408xxxx
86 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f000d-0409xxxx • 806f000d-040bxxxx
Chapter 3. Diagnostics 87
806f000d-040cxxxx • 806f000d-040exxxx
88 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f000d-040fxxxx • 806f000f-220102xx
806f000f-220101xx The System [ComputerSystemElementName] has detected no memory in the system. (ABR
Status)
Explanation: This message is for the use case when an implementation has detected that memory was detected in
the system.
May also be shown as 806f000f220101xx or 0x806f000f220101xx
Severity: Error
Alert Category: Critical - Memory
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0794
SNMP Trap ID: 41
Automatically notify Support: No
User response: 1. Make sure the server meets the minimum configuration to start (see Power-supply LEDs). 2.
Recover the server firmware from the backup page: a. Restart the server. b. At the prompt, press F3 to recover the
firmware. 3. Update the server firmware on the primary page. Important: Some cluster solutions require specific code
levels or coordinated code updates. If the device is part of a cluster solution, verify that the latest level of code is
supported for the cluster solution before you update the code. 4. Remove components one at a time, restarting the
server each time, to see if the problem goes away. 5. If the problem remains, (Trained technician only) replace the
system board. Firmware Error :
806f000f-220102xx Subsystem [MemoryElementName] has insufficient memory for operation. (ABR Status)
Explanation: This message is for the use case when an implementation has detected that the usable Memory is
insufficient for operation.
May also be shown as 806f000f220102xx or 0x806f000f220102xx
Severity: Error
Alert Category: Critical - Memory
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0132
SNMP Trap ID: 41
Automatically notify Support: No
User response: 1. Make sure the server meets the minimum configuration to start (see Power-supply LEDs). 2.
Update the server firmware on the primary page. Important: Some cluster solutions require specific code levels or
coordinated code updates. If the device is part of a cluster solution, verify that the latest level of code is supported
for the cluster solution before you update the code. 3. (Trained technician only) Replace the system board. Firmware
Error :
Chapter 3. Diagnostics 89
806f000f-220103xx • 806f000f-220107xx
90 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f000f-22010axx • 806f000f-22010bxx
Chapter 3. Diagnostics 91
806f000f-22010cxx • 806f0013-1701xxxx
92 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f0021-2201xxxx • 806f0021-3001xxxx
Chapter 3. Diagnostics 93
806f0023-2101xxxx • 806f0107-0302xxxx
solution before you update the code. 4. Remove both adapters. 5. Replace the riser cards. 6. (Trained service
technicians only) Replace the system board. PCI 2 : PCI 3 : PCI 4 : PCI 5 :
94 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f0108-0a01xxxx • 806f0109-1381xxxx
Chapter 3. Diagnostics 95
806f010c-2001xxxx • 806f010c-2002xxxx
96 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f010c-2003xxxx • 806f010c-2004xxxx
Chapter 3. Diagnostics 97
806f010c-2005xxxx • 806f010c-2006xxxx
98 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f010c-2007xxxx • 806f010c-2008xxxx
Chapter 3. Diagnostics 99
806f010c-2009xxxx • 806f010c-200axxxx
100 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f010c-200bxxxx • 806f010c-200cxxxx
102 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f010c-200fxxxx • 806f010c-2010xxxx
104 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f010c-2581xxxx • 806f010d-0400xxxx
806f010d-0400xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 0)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0400xxxx or 0x806f010d0400xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-0401xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 1)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0401xxxx or 0x806f010d0401xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-0402xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 2)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0402xxxx or 0x806f010d0402xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-0403xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 3)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0403xxxx or 0x806f010d0403xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
106 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f010d-0404xxxx • 806f010d-0406xxxx
806f010d-0404xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 4)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0404xxxx or 0x806f010d0404xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-0405xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 5)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0405xxxx or 0x806f010d0405xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-0406xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 6)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0406xxxx or 0x806f010d0406xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-0407xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 7)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0407xxxx or 0x806f010d0407xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-0408xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 8)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0408xxxx or 0x806f010d0408xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-0409xxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 9)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d0409xxxx or 0x806f010d0409xxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
108 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f010d-040axxxx • 806f010d-040cxxxx
806f010d-040axxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive
10)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d040axxxx or 0x806f010d040axxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-040bxxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive
11)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d040bxxxx or 0x806f010d040bxxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-040cxxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive
12)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d040cxxxx or 0x806f010d040cxxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-040dxxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive
13)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d040dxxxx or 0x806f010d040dxxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
806f010d-040exxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive
14)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d040exxxx or 0x806f010d040exxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
110 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f010d-040fxxxx • 806f011b-0701xxxx
806f010d-040fxxxx The Drive [StorageVolumeElementName] has been disabled due to a detected fault. (Drive 15)
Explanation: This message is for the use case when an implementation has detected a Drive was Disabled due to
fault.
May also be shown as 806f010d040fxxxx or 0x806f010d040fxxxx
Severity: Error
Alert Category: Critical - Hard Disk drive
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0164
SNMP Trap ID: 5
Automatically notify Support: Yes
User response: 1. Run the hard disk drive diagnostic test on drive n. 2. Reseat the following components: a. Hard
disk drive (wait 1 minute or more before reinstalling the drive) b. Cable from the system board to the backplane 3.
Replace the following components one at a time, in the order shown, restarting the server each time: a. Hard disk
drive b. Cable from the system board to the backplane c. Hard disk drive backplane (n = hard disk drive number)
112 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f0125-0b03xxxx • 806f0125-1d01xxxx
114 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f0207-0302xxxx • 806f020d-0400xxxx
116 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f020d-0404xxxx • 806f020d-0406xxxx
118 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f020d-040axxxx • 806f020d-040cxxxx
120 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f0212-2584xxxx • 806f0308-0a01xxxx
122 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f030c-2002xxxx • 806f030c-2003xxxx
124 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f030c-2006xxxx • 806f030c-2007xxxx
126 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f030c-200axxxx • 806f030c-200bxxxx
128 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f030c-200exxxx • 806f030c-200fxxxx
130 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f030c-2012xxxx • 806f030c-2581xxxx
132 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f040c-2002xxxx • 806f040c-2004xxxx
to this memory event. If no memory fault is recorded in the logs and no DIMM connector error LED is lit, you can
re-enable the DIMM through the Setup utility or the Advanced Settings Utility (ASU).
134 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f040c-2007xxxx • 806f040c-2009xxxx
to this memory event. If no memory fault is recorded in the logs and no DIMM connector error LED is lit, you can
re-enable the DIMM through the Setup utility or the Advanced Settings Utility (ASU).
136 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f040c-200cxxxx • 806f040c-200exxxx
to this memory event. If no memory fault is recorded in the logs and no DIMM connector error LED is lit, you can
re-enable the DIMM through the Setup utility or the Advanced Settings Utility (ASU).
138 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f040c-2011xxxx • 806f040c-2581xxxx
and restart the server. 3. Check the IBM support website for an applicable retain tip or firmware update that applies
to this memory event. If no memory fault is recorded in the logs and no DIMM connector error LED is lit, you can
re-enable the DIMM through the Setup utility or the Advanced Settings Utility (ASU). One of the DIMMs :
140 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f0507-0302xxxx • 806f050c-2001xxxx
142 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f050c-2004xxxx • 806f050c-2005xxxx
144 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f050c-2008xxxx • 806f050c-2009xxxx
146 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f050c-200cxxxx • 806f050c-200dxxxx
148 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f050c-2010xxxx • 806f050c-2011xxxx
150 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f050d-0400xxxx • 806f050d-0402xxxx
152 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f050d-0406xxxx • 806f050d-0408xxxx
154 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f050d-040cxxxx • 806f050d-040exxxx
156 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f0607-0301xxxx • 806f0608-0a01xxxx
the problem goes away. 5. If the problem remains, (trained service technician) replace the system board.
806f0607-0301xxxx An SM BIOS Uncorrectable CPU complex error for [ProcessorElementName] has asserted.
(CPU 1)
Explanation: This message is for the use case when an SM BIOS Uncorrectable CPU complex error has asserted.
May also be shown as 806f06070301xxxx or 0x806f06070301xxxx
Severity: Error
Alert Category: Critical - CPU
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0816
SNMP Trap ID: 40
Automatically notify Support: No
User response: 1. Make sure that the installed microprocessors are compatible with each other (see Installing a
microprocessor and heat sink for information about microprocessor requirements). 2. Update the server firmware to
the latest level (see Updating the firmware). 3. (Trained technician only) Replace the incompatible microprocessor.
806f0607-0302xxxx An SM BIOS Uncorrectable CPU complex error for [ProcessorElementName] has asserted.
(CPU 2)
Explanation: This message is for the use case when an SM BIOS Uncorrectable CPU complex error has asserted.
May also be shown as 806f06070302xxxx or 0x806f06070302xxxx
Severity: Error
Alert Category: Critical - CPU
Serviceable: Yes
CIM Information: Prefix: PLAT and ID: 0816
SNMP Trap ID: 40
Automatically notify Support: No
User response: 1. Make sure that the installed microprocessors are compatible with each other (see Installing a
microprocessor and heat sink for information about microprocessor requirements). 2. Update the server firmware to
the latest level (see Updating the firmware). 3. (Trained technician only) Replace the incompatible microprocessor.
158 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f060d-0402xxxx • 806f060d-0404xxxx
160 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f060d-0408xxxx • 806f060d-040axxxx
162 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f060d-040exxxx • 806f070c-2001xxxx
164 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f070c-2005xxxx • 806f070c-2007xxxx
166 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f070c-200bxxxx • 806f070c-200dxxxx
168 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f070c-2011xxxx • 806f070c-2581xxxx
170 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f070d-0403xxxx • 806f070d-0405xxxx
172 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f070d-0409xxxx • 806f070d-040bxxxx
174 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f070d-040fxxxx • 806f0807-0302xxxx
176 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
806f0813-2584xxxx • 806f0a07-0301xxxx
81010002-2801xxxx Numeric sensor [NumericSensorElementName] going low (lower non-critical) has deasserted.
(CMOS Battery)
Explanation: This message is for the use case when an implementation has detected a Lower Non-critical sensor
going low has deasserted.
May also be shown as 810100022801xxxx or 0x810100022801xxxx
Severity: Info
Alert Category: Warning - Voltage
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0477
SNMP Trap ID: 13
Automatically notify Support: No
User response: No action; information only.
81010202-0701xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has deasserted.
(Planar 12V)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has deasserted.
May also be shown as 810102020701xxxx or 0x810102020701xxxx
Severity: Info
Alert Category: Critical - Voltage
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0481
SNMP Trap ID: 1
Automatically notify Support: No
User response: No action; information only. Planar 3.3V : Planar 5V :
178 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
81010202-2801xxxx • 81010204-1d02xxxx
81010202-2801xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has deasserted.
(CMOS Battery)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has deasserted.
May also be shown as 810102022801xxxx or 0x810102022801xxxx
Severity: Info
Alert Category: Critical - Voltage
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0481
SNMP Trap ID: 1
Automatically notify Support: No
User response: No action; information only.
81010204-1d01xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has deasserted. (Fan
1A Tach)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has deasserted.
May also be shown as 810102041d01xxxx or 0x810102041d01xxxx
Severity: Info
Alert Category: Critical - Fan Failure
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0481
SNMP Trap ID: 11
Automatically notify Support: No
User response: No action; information only. Fan 1B Tach :
81010204-1d02xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has deasserted. (Fan
2A Tach)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has deasserted.
May also be shown as 810102041d02xxxx or 0x810102041d02xxxx
Severity: Info
Alert Category: Critical - Fan Failure
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0481
SNMP Trap ID: 11
Automatically notify Support: No
User response: No action; information only. Fan 2B Tach :
81010204-1d03xxxx Numeric sensor [NumericSensorElementName] going low (lower critical) has deasserted. (Fan
3A Tach)
Explanation: This message is for the use case when an implementation has detected a Lower Critical sensor going
low has deasserted.
May also be shown as 810102041d03xxxx or 0x810102041d03xxxx
Severity: Info
Alert Category: Critical - Fan Failure
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0481
SNMP Trap ID: 11
Automatically notify Support: No
User response: No action; information only. Fan 3B Tach :
81010701-0c01xxxx Numeric sensor [NumericSensorElementName] going high (upper non-critical) has deasserted.
(Ambient Temp)
Explanation: This message is for the use case when an implementation has detected an Upper Non-critical sensor
going high has deasserted.
May also be shown as 810107010c01xxxx or 0x810107010c01xxxx
Severity: Info
Alert Category: Warning - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0491
SNMP Trap ID: 12
Automatically notify Support: No
User response: No action; information only.
81010901-0c01xxxx Numeric sensor [NumericSensorElementName] going high (upper critical) has deasserted.
(Ambient Temp)
Explanation: This message is for the use case when an implementation has detected an Upper Critical sensor going
high has deasserted.
May also be shown as 810109010c01xxxx or 0x810109010c01xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0495
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
180 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
81010902-0701xxxx • 81030012-2301xxxx
81010902-0701xxxx Numeric sensor [NumericSensorElementName] going high (upper critical) has deasserted.
(Planar 12V)
Explanation: This message is for the use case when an implementation has detected an Upper Critical sensor going
high has deasserted.
May also be shown as 810109020701xxxx or 0x810109020701xxxx
Severity: Info
Alert Category: Critical - Voltage
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0495
SNMP Trap ID: 1
Automatically notify Support: No
User response: No action; information only. Planar 3.3V : Planar 5V :
81070201-0301xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (CPU 1
OverTemp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702010301xxxx or 0x810702010301xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-0302xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (CPU 2
OverTemp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702010302xxxx or 0x810702010302xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2001xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 1
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012001xxxx or 0x810702012001xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
182 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
81070201-2002xxxx • 81070201-2004xxxx
81070201-2002xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 2
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012002xxxx or 0x810702012002xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2003xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 3
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012003xxxx or 0x810702012003xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2004xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 4
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012004xxxx or 0x810702012004xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2005xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 5
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012005xxxx or 0x810702012005xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2006xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 6
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012006xxxx or 0x810702012006xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2007xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 7
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012007xxxx or 0x810702012007xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
184 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
81070201-2008xxxx • 81070201-200axxxx
81070201-2008xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 8
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012008xxxx or 0x810702012008xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2009xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 9
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012009xxxx or 0x810702012009xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-200axxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 10
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 81070201200axxxx or 0x81070201200axxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-200bxxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 11
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 81070201200bxxxx or 0x81070201200bxxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-200cxxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 12
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 81070201200cxxxx or 0x81070201200cxxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-200dxxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 13
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 81070201200dxxxx or 0x81070201200dxxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
186 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
81070201-200exxxx • 81070201-2010xxxx
81070201-200exxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 14
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 81070201200exxxx or 0x81070201200exxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-200fxxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 15
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 81070201200fxxxx or 0x81070201200fxxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2010xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 16
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012010xxxx or 0x810702012010xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2011xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 17
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012011xxxx or 0x810702012011xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2012xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (DIMM 18
Temp)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012012xxxx or 0x810702012012xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070201-2d01xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (IOH Temp
Status)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702012d01xxxx or 0x810702012d01xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
188 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
81070202-0701xxxx • 81070204-0a02xxxx
81070202-0701xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (Planar
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702020701xxxx or 0x810702020701xxxx
Severity: Info
Alert Category: Critical - Voltage
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 1
Automatically notify Support: No
User response: No action; information only.
81070204-0a01xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (PS 1 Fan
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702040a01xxxx or 0x810702040a01xxxx
Severity: Info
Alert Category: Critical - Fan Failure
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 11
Automatically notify Support: No
User response: No action; information only.
81070204-0a02xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (PS 2 Fan
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702040a02xxxx or 0x810702040a02xxxx
Severity: Info
Alert Category: Critical - Fan Failure
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 11
Automatically notify Support: No
User response: No action; information only.
81070208-0a01xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (PS 1 OP
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702080a01xxxx or 0x810702080a01xxxx
Severity: Info
Alert Category: Critical - Power
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 4
Automatically notify Support: No
User response: No action; information only. PS 1 Therm Fault :
81070208-0a02xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (PS 2 OP
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702080a02xxxx or 0x810702080a02xxxx
Severity: Info
Alert Category: Critical - Power
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 4
Automatically notify Support: No
User response: No action; information only. PS 2 Therm Fault :
8107020f-2582xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (No I/O
Resources)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 8107020f2582xxxx or 0x8107020f2582xxxx
Severity: Info
Alert Category: Critical - Other
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 50
Automatically notify Support: No
User response: No action; information only.
190 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
81070219-0701xxxx • 81070301-0302xxxx
81070219-0701xxxx Sensor [SensorElementName] has transitioned to a less severe state from critical. (Sys Board
Fault)
Explanation: This message is for the use case when an implementation has detected a Sensor transition to less
severe from critical.
May also be shown as 810702190701xxxx or 0x810702190701xxxx
Severity: Info
Alert Category: Critical - Other
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0523
SNMP Trap ID: 50
Automatically notify Support: No
User response: No action; information only.
81070301-0301xxxx Sensor [SensorElementName] has deasserted the transition to non-recoverable from a less
severe state. (CPU 1 OverTemp)
Explanation: This message is for the use case when an implementation has detected that the Sensor transition to
non-recoverable from less severe has deasserted.
May also be shown as 810703010301xxxx or 0x810703010301xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0525
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070301-0302xxxx Sensor [SensorElementName] has deasserted the transition to non-recoverable from a less
severe state. (CPU 2 OverTemp)
Explanation: This message is for the use case when an implementation has detected that the Sensor transition to
non-recoverable from less severe has deasserted.
May also be shown as 810703010302xxxx or 0x810703010302xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0525
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070301-2d01xxxx Sensor [SensorElementName] has deasserted the transition to non-recoverable from a less
severe state. (IOH Temp Status)
Explanation: This message is for the use case when an implementation has detected that the Sensor transition to
non-recoverable from less severe has deasserted.
May also be shown as 810703012d01xxxx or 0x810703012d01xxxx
Severity: Info
Alert Category: Critical - Temperature
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0525
SNMP Trap ID: 0
Automatically notify Support: No
User response: No action; information only.
81070603-0701xxxx Sensor [SensorElementName] has deasserted the transition to non-recoverable. (Pwr Rail A
Fault)
Explanation: This message is for the use case when an implementation has detected that the Sensor transition to
non-recoverable has deasserted.
May also be shown as 810706030701xxxx or 0x810706030701xxxx
Severity: Info
Alert Category: Critical - Power
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0531
SNMP Trap ID: 4
Automatically notify Support: No
User response: No action; information only. Pwr Rail B Fault : Pwr Rail C Fault : Pwr Rail D Fault : Pwr Rail E
Fault :
81070608-0701xxxx Sensor [SensorElementName] has deasserted the transition to non-recoverable. (VT Fault)
Explanation: This message is for the use case when an implementation has detected that the Sensor transition to
non-recoverable has deasserted.
May also be shown as 810706080701xxxx or 0x810706080701xxxx
Severity: Info
Alert Category: Critical - Power
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0531
SNMP Trap ID: 4
Automatically notify Support: No
User response: No action; information only.
192 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
81070608-0a01xxxx • 810b010a-1e81xxxx
81070608-0a01xxxx Sensor [SensorElementName] has deasserted the transition to non-recoverable. (PS 1 VCO
Fault)
Explanation: This message is for the use case when an implementation has detected that the Sensor transition to
non-recoverable has deasserted.
May also be shown as 810706080a01xxxx or 0x810706080a01xxxx
Severity: Info
Alert Category: Critical - Power
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0531
SNMP Trap ID: 4
Automatically notify Support: No
User response: No action; information only. PS1 12V OC Fault : PS1 12V OV Fault : PS1 12V UV Fault :
81070608-0a02xxxx Sensor [SensorElementName] has deasserted the transition to non-recoverable. (PS 2 VCO
Fault)
Explanation: This message is for the use case when an implementation has detected that the Sensor transition to
non-recoverable has deasserted.
May also be shown as 810706080a02xxxx or 0x810706080a02xxxx
Severity: Info
Alert Category: Critical - Power
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0531
SNMP Trap ID: 4
Automatically notify Support: No
User response: No action; information only. PS2 12V OC Fault : PS2 12V OV Fault : PS2 12V UV Fault :
194 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
810b030c-2581xxxx • 810b050a-1e82xxxx
196 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f0007-0302xxxx • 816f0008-0a02xxxx
198 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f000d-0402xxxx • 816f000d-0404xxxx
200 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f000d-0408xxxx • 816f000d-040axxxx
202 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f000d-040exxxx • 816f000f-2201ffff
816f000f-2201ffff The System [ComputerSystemElementName] has detected a POST Error deassertion. (ABR
Status)
Explanation: This message is for the use case when an implementation has detected that Post Error has deasserted.
May also be shown as 816f000f2201ffff or 0x816f000f2201ffff
Severity: Info
Alert Category: Critical - Other
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0185
SNMP Trap ID: 50
Automatically notify Support: No
User response: No action; information only. Firmware Error :
204 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f0021-3001xxxx • 816f0107-0302xxxx
206 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f010c-2002xxxx • 816f010c-2004xxxx
208 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f010c-2008xxxx • 816f010c-200axxxx
210 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f010c-200exxxx • 816f010c-2010xxxx
212 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f010d-0400xxxx • 816f010d-0402xxxx
214 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f010d-0406xxxx • 816f010d-0408xxxx
216 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f010d-040cxxxx • 816f010d-040exxxx
816f010f-2201xxxx The System [ComputerSystemElementName] has recovered from a firmware hang. (Firmware
Error)
Explanation: This message is for the use case when an implementation has recovered from a System Firmware
Hang.
May also be shown as 816f010f2201xxxx or 0x816f010f2201xxxx
Severity: Info
Alert Category: Critical - Other
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0187
SNMP Trap ID: 50
Automatically notify Support: No
User response: No action; information only.
218 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f0125-0b01xxxx • 816f0125-0b03xxxx
220 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f0125-1d03xxxx • 816f0207-0302xxxx
222 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f020d-0403xxxx • 816f020d-0405xxxx
224 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f020d-0409xxxx • 816f020d-040bxxxx
226 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f020d-040fxxxx • 816f0308-0a01xxxx
228 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f030c-2003xxxx • 816f030c-2005xxxx
230 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f030c-2009xxxx • 816f030c-200bxxxx
232 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f030c-200fxxxx • 816f030c-2011xxxx
234 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f040c-2001xxxx • 816f040c-2003xxxx
236 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f040c-2007xxxx • 816f040c-2009xxxx
238 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f040c-200dxxxx • 816f040c-200fxxxx
240 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f040c-2581xxxx • 816f0507-0301xxxx
242 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f050c-2003xxxx • 816f050c-2005xxxx
244 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f050c-2009xxxx • 816f050c-200bxxxx
246 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f050c-200fxxxx • 816f050c-2011xxxx
248 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f050d-0401xxxx • 816f050d-0403xxxx
250 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f050d-0407xxxx • 816f050d-0409xxxx
252 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f050d-040dxxxx • 816f050d-040fxxxx
816f0607-0301xxxx An SM BIOS Uncorrectable CPU complex error for [ProcessorElementName] has deasserted.
(CPU 1)
Explanation: This message is for the use case when an SM BIOS Uncorrectable CPU complex error has deasserted.
May also be shown as 816f06070301xxxx or 0x816f06070301xxxx
Severity: Info
Alert Category: Critical - CPU
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0817
SNMP Trap ID: 40
Automatically notify Support: No
User response: No action; information only.
816f0607-0302xxxx An SM BIOS Uncorrectable CPU complex error for [ProcessorElementName] has deasserted.
(CPU 2)
Explanation: This message is for the use case when an SM BIOS Uncorrectable CPU complex error has deasserted.
May also be shown as 816f06070302xxxx or 0x816f06070302xxxx
Severity: Info
Alert Category: Critical - CPU
Serviceable: No
CIM Information: Prefix: PLAT and ID: 0817
SNMP Trap ID: 40
Automatically notify Support: No
User response: No action; information only.
254 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f060d-0401xxxx • 816f060d-0403xxxx
256 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f060d-0407xxxx • 816f060d-0409xxxx
258 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f060d-040dxxxx • 816f060d-040fxxxx
260 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f070c-2004xxxx • 816f070c-2006xxxx
262 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f070c-200axxxx • 816f070c-200cxxxx
264 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f070c-2010xxxx • 816f070c-2012xxxx
266 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f070d-0402xxxx • 816f070d-0404xxxx
268 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f070d-0408xxxx • 816f070d-040axxxx
270 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f070d-040exxxx • 816f0807-0301xxxx
272 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
816f0813-2582xxxx • 816f0a07-0301xxxx
POST
When you turn on the server, it performs a series of tests to check the operation of
the server components and some optional devices in the server. This series of tests
is called the power-on self-test, or POST. This server does not use beep codes for
server status.
If a power-on password is set, you must type the password and press Enter, when
you are prompted, for POST to run.
274 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
0010002 Microprocessor not supported 1. Reseat the following components one at a time,
in the order shown, restarting the server each
time:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2 (if installed)
2. (Trained service technician only) Remove
microprocessor 2 and restart the server.
3. (Trained service technician only) Remove
microprocessor 1 and install microprocessor 2 in
the microprocessor 1 connector. Restart the server.
If the error is corrected, microprocessor 1 is bad
and must be replaced.
4. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. (Trained service technician only)
Microprocessor 1
b. (Trained service technician only)
Microprocessor 2
c. (Trained service technician only) System board
0011000 Invalid microprocessor type 1. Update the firmware to the latest level (see
“Updating the firmware” on page 491).
2. (Trained service technician only) Remove and
replace the affected microprocessor (error LED is
lit) with a supported type.
0011002 Microprocessor mismatch 1. Run the Setup utility and view the
microprocessor information to compare the
installed microprocessor specifications.
2. (Trained service technician only) Remove and
replace one of the microprocessors so that they
both match.
0011004 Microprocessor failed BIST 1. Update the firmware to the latest level (see
“Updating the firmware” on page 491).
2. (Trained service technician only) Reseat
microprocessor 2.
3. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. (Trained service technician only)
Microprocessor
b. (Trained service technician only) System
board
276 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
0051003 Uncorrectable DIMM error Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory error.
2. Manually re-enable all affected DIMMs if the
server firmware version is older than UEFI v1.10.
If the server firmware version is UEFI v1.10 or
newer, disconnect and reconnect the server to the
power source and restart the server.
3. If the problem remains, replace the failing DIMM
(see “Removing a memory module (DIMM)” on
page 448 and “Installing a memory module” on
page 450).
4. (Trained service technician only) If the problem
occurs on the same DIMM connector, check the
DIMM connector. If the connector contains any
foreign material or is damaged, replace the
system board (see “Removing the system board”
on page 484 and“Replacing the system board” on
page 486).
5. (Trained service technician only) Remove the
affected microprocessor and check the
microprocessor socket pins for any damaged
pins. If a damage is found, replace the system
board (see “Removing the system board” on page
484 and“Replacing the system board” on page
486).
6. (Trained Service technician only) Replace the
affected microprocessor (see “Removing a
microprocessor and heat sink” on page 474 and
“Replacing a microprocessor and heat sink” on
page 477).
0051004 DIMM presence detected read/write failure Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Update the server firmware to the latest level (see
“Updating the firmware” on page 491).
2. Reseat the DIMMs.
3. Install DIMMs in the correct sequence (see
“Installing a memory module” on page 450).
4. Replace the failing DIMM.
5. (Trained service technician only) Replace the
system board.
278 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
0058001 PFA threshold exceeded Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Check the IBM support website for an applicable
retain tip or firmware update that applies to this
memory error.
2. Swap the affected DIMMs (as indicated by the
error LEDs on the system board or the event
logs) to a different memory channel or
microprocessor (see “Installing a memory
module” on page 450).
3. If the error still occurs on the same DIMM,
replace the affected DIMM.
4. (Trained service technician only) If the problem
occurs on the same DIMM connector, check the
DIMM connector. If the connector contains any
foreign material or is damaged, replace the
system board (see “Removing the system board”
on page 484 and “Replacing the system board”
on page 486).
5. (Trained service technician only) Remove the
affected microprocessor and check the
microprocessor socket pins for any damaged
pins. If a damage is found, replace the system
board (see “Removing the system board” on page
484 and “Replacing the system board” on page
486).
6. (Trained Service technician only) Replace the
affected microprocessor (see “Removing a
microprocessor and heat sink” on page 474 and
“Replacing a microprocessor and heat sink” on
page 477).
0058007 DIMM population is unsupported Note: Each time you install or remove a DIMM, you
must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. Reseat the DIMMs and restart the server (see
“Removing a memory module (DIMM)” on page
448 and “Installing a memory module” on page
450).
2. Make sure that the DIMMs are installed in the
proper sequence (see “Installing a memory
module” on page 450).
280 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
00580A1 Invalid DIMM population for mirroring Note: Each time you install or remove a DIMM, you
mode must disconnect the server from the power source;
then, wait 10 seconds before restarting the server.
1. If a fault LED is lit, resolve the failure.
2. Install the DIMMs in the correct sequence (see
“Installing a memory module” on page 450).
00580A4 Memory population changed
Information only. Memory has been added, moved,
or changed.
00580A5 Mirror failover complete
Information only. Memory redundancy has been lost.
Check the event log for uncorrected DIMM failure
events (See “Event logs” on page 28 for more
information).
00580A6 Spare Memory Channel Activated
Information only. Memory online-spare channel has
been activated to backup a failed DIMM. Check the
event log for uncorrected DIMM failure events.
0068002 CMOS battery cleared 1. Reseat the battery.
2. Clear the CMOS memory (see “System-board
switches and jumpers” on page 18).
3. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Battery
b. (Trained service technician only) System
board
2011000 PCI-X PERR 1. Check the riser-card LEDs.
2. Reseat all affected adapters and riser cards.
3. Update the PCI device firmware.
4. Remove both adapters from the riser card.
5. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Riser card
b. (Trained service technician only) System
board
282 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
2018002 Option ROM resource allocation failure
Informational message that some devices might not
be initialized.
1. If possible, rearrange the order of the adapters in
the PCI slots to change the load order of the
optional-device ROM code.
2. Run the Setup utility, select Start Options, and
change the boot priority to change the load order
of the optional-device ROM code.
3. Run the Setup utility and disable some other
resources, if their functions are not being used, to
make more space available:
a. Select Start Options and Planar Ethernet
(PXE/DHCP) to disable the integrated
Ethernet controller ROM.
b. Select Advanced Functions, then PCI Bus
Control, then PCI ROM Control Execution to
disable the ROM of adapters in the PCI slots.
c. Select Devices and I/O Ports to disable any of
the integrated devices.
4. Replace the following components one at a time,
in the order shown, restarting the server each
time:
a. Each adapter
b. (Trained service technician only) System
board
3xx0007 (xx can Firmware fault detected, system halted 1. Recover the server firmware to the latest level.
be 00 - 19)
2. Undo any recent configuration changes, or clear
CMOS memory to restore the settings to the
default values.
3. Remove any recently installed hardware.
3038003 Firmware corrupted 1. Run the Setup utility, select Load Default
Settings, and save the settings to recover the
server firmware.
2. (Trained service technician only) Replace the
system board.
3048005 Booted secondary (backup) server firmware
image Information only. The backup switch was used to
boot the secondary bank.
284 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Error code Description Action
3108007 System configuration restored to default
settings Information only. This is message is usually
associated with the CMOS battery clear event.
3138002 Boot configuration error 1. Remove any recent configuration changes that
you made in the Setup utility.
2. Run the Setup utility, select Load Default
Settings, and save the settings.
3808000 IMM communication failure 1. Shut down the system and remove the power
cords from the server for 30 seconds; then,
reconnect the server to power and restart it.
2. Update the IMM firmware.
3. Make sure that the virtual media key is seated
and not damaged.
4. (Trained service technician only) Replace the
system board.
3808002 Error updating system configuration to 1. Remove power from the server, and then
IMM reconnect the server to power and restart it.
2. Run the Setup utility and select Save Settings.
3. Update the firmware.
3808003 Error retrieving system configuration from 1. Remove power from the server, and then
IMM reconnect the server to power and restart it.
2. Run the Setup utility and select Save Settings.
3. Update the IMM firmware.
3808004 IMM system event log full v When out-of-band, use the IMM Web interface or
IPMItool to clear the logs from the operating
system.
v When using the local console:
1. Run the Setup utility.
2. Select System Event Log.
3. Select Clear System Event Log.
4. Restart the server.
3818001 Core Root of Trust Measurement (CRTM) 1. Run the Setup utility, select Load Default
update failed Settings, and save the settings.
2. (Trained service technician only) Replace the
system board.
3818002 Core Root of Trust Measurement (CRTM) 1. Run the Setup utility, select Load Default
update aborted Settings, and save the settings.
2. (Trained service technician only) Replace the
system board.
286 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Diagnostic programs, messages, and error codes
The diagnostic programs are the primary method of testing the major components
of the server.
As you run the diagnostic programs, text messages are displayed on the screen
and are saved in the test log. A diagnostic text message indicates that a problem
has been detected and provides the action you should take as a result of the text
message.
Make sure that the server has the latest version of the diagnostic programs. To
download the latest version, complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Software and device drivers.
4. Click IBM System x3650 M3 to display the matrix of downloadable files for the
server.
Utilities are available to reset and update the code on the integrated USB flash
device, if the diagnostic partition becomes damaged and does not start the
diagnostic programs. For more information and to download the utilities, go to
http://www.ibm.com/jct01004c/systems/support/supportsite.wss/
docdisplay?lndocid=MIGR-5072294&brandind=5000008.
Procedure
1. If the server is running, turn off the server and all attached devices.
2. Turn on all attached devices; then, turn on the server.
3. When the prompt Press F2 for Dynamic System Analysis (DSA) is displayed,
press F2.
Note: After you exit from the stand-alone memory diagnostic environment,
you must restart the server to access the stand-alone memory diagnostic
environment again.
5. Type gui to display the graphical user interface, or select cmd to display the
DSA interactive menu.
6. Follow the instructions on the screen to select the diagnostic test to run.
If the diagnostic programs do not detect any hardware errors but the problem
remains during normal server operations, a software error might be the cause. If
you suspect a software problem, see the information that comes with your
software.
A single problem might cause more than one error message. When this happens,
correct the cause of the first error message. The other error messages usually will
not occur the next time you run the diagnostic programs.
If the server stops during testing and you cannot continue, restart the server and
try running the diagnostic programs again. If the problem remains, replace the
component that was being tested when the server stopped.
Aborted: The test could not proceed because of the server configuration.
To view the test log when the tests are completed, type the view command in the
DSA interactive menu, or select Diagnostic Event Log in the graphical user
interface. To transfer DSA collections to an external USB device, type the copy
command in the DSA interactive menu.
288 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Diagnostic messages
The following table describes the messages that the diagnostic programs might
generate and suggested actions to correct the detected problems. Follow the
suggested actions in the order in which they are listed in the column.
Table 10. DSA Preboot messages
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
089-801-xxx CPU CPU Stress Aborted Internal 1. Turn off and restart the system.
Test program
2. Make sure that the DSA code is at the
error.
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
3. Run the test again.
4. Make sure that the system firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
5. Run the test again.
6. Turn off and restart the system if
necessary to recover from a hung state.
7. Run the test again.
8. Replace the following components one at a
time, in the order shown, and run this test
again to determine whether the problem
has been solved:
a. (Trained service technician only)
Microprocessor board
b. (Trained service technician only)
Microprocessor
9. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
089-802-xxx CPU CPU Stress Aborted System 1. Turn off and restart the system.
Test resource
2. Make sure that the DSA code is at the
availability
latest level. For the latest level of DSA
error.
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
3. Run the test again.
4. Make sure that the system firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
5. Run the test again.
6. Turn off and restart the system if
necessary to recover from a hung state.
7. Run the test again.
8. Make sure that the system firmware is at
the latest level. The installed firmware
level is shown in the DSA event log in
the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
9. Run the test again.
10. Replace the following components one at
a time, in the order shown, and run this
test again to determine whether the
problem has been solved:
a. (Trained service technician only)
Microprocessor board
b. (Trained service technician only)
Microprocessor
11. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
290 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
089-901-xxx CPU CPU Stress Failed Test failure. 1. Turn off and restart the system if
Test necessary to recover from a hung state.
2. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
3. Run the test again.
4. Make sure that the system firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
5. Run the test again.
6. Turn off and restart the system if
necessary to recover from a hung state.
7. Run the test again.
8. Replace the following components one at a
time, in the order shown, and run this test
again to determine whether the problem
has been solved:
a. (Trained service technician only)
Microprocessor board
b. (Trained service technician only)
Microprocessor
9. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-801-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test stopped: the the power source. You must disconnect the
IMM returned system from ac power to reset the IMM.
an incorrect
2. After 45 seconds, reconnect the system to
response
the power source and turn on the system.
length.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
292 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-802-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test stopped: the the power source. You must disconnect the
test cannot be system from ac power to reset the IMM.
completed for
2. After 45 seconds, reconnect the system to
an unknown
the power source and turn on the system.
reason.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-803-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test stopped: the the power source. You must disconnect the
node is busy; system from ac power to reset the IMM.
try later.
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
294 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-804-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test stopped: the power source. You must disconnect the
invalid system from ac power to reset the IMM.
command.
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-805-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
invalid system from ac power to reset the IMM.
command for
2. After 45 seconds, reconnect the system to
the given
the power source and turn on the system.
LUN.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
296 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-806-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
timeout while system from ac power to reset the IMM.
processing the
2. After 45 seconds, reconnect the system to
command.
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-807-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: out the power source. You must disconnect the
of space. system from ac power to reset the IMM.
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
298 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-808-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
reservation system from ac power to reset the IMM.
canceled or
2. After 45 seconds, reconnect the system to
invalid
the power source and turn on the system.
reservation
ID. 3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-809-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
request data system from ac power to reset the IMM.
was
2. After 45 seconds, reconnect the system to
truncated.
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
300 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-810-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
request data system from ac power to reset the IMM.
length is
2. After 45 seconds, reconnect the system to
invalid.
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-811-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
request data system from ac power to reset the IMM.
field length
2. After 45 seconds, reconnect the system to
limit is
the power source and turn on the system.
exceeded.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
302 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-812-xxx IMM IMM I2C Aborted IMM I2C Test 1. Turn off the system and disconnect it from
Test aborted: a the power source. You must disconnect the
parameter is system from ac power to reset the IMM.
out of range.
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-813-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
cannot return system from ac power to reset the IMM.
the number of
2. After 45 seconds, reconnect the system to
requested
the power source and turn on the system.
data bytes.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
304 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-814-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
requested system from ac power to reset the IMM.
sensor, data,
2. After 45 seconds, reconnect the system to
or record is
the power source and turn on the system.
not present.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-815-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
invalid data system from ac power to reset the IMM.
field in the
2. After 45 seconds, reconnect the system to
request.
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
306 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-816-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the the power source. You must disconnect the
command is system from ac power to reset the IMM.
illegal for the
2. After 45 seconds, reconnect the system to
specified
the power source and turn on the system.
sensor or
record type. 3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-817-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: a the power source. You must disconnect the
command system from ac power to reset the IMM.
response
2. After 45 seconds, reconnect the system to
could not be
the power source and turn on the system.
provided.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
308 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-818-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
cannot system from ac power to reset the IMM.
execute a
2. After 45 seconds, reconnect the system to
duplicated
the power source and turn on the system.
request.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-819-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: a the power source. You must disconnect the
command system from ac power to reset the IMM.
response
2. After 45 seconds, reconnect the system to
could not be
the power source and turn on the system.
provided; the
SDR 3. Run the test again.
repository is 4. Make sure that the DSA code is at the
in update latest level. For the latest level of DSA
mode. code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
166-820-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: a the power source. You must disconnect the
command system from ac power to reset the IMM.
response
2. After 45 seconds, reconnect the system to
could not be
the power source and turn on the system.
provided; the
device is in 3. Run the test again.
firmware 4. Make sure that the DSA code and IMM
update mode. firmware are at the latest level.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
310 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-821-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: a the power source. You must disconnect the
command system from ac power to reset the IMM.
response
2. After 45 seconds, reconnect the system to
could not be
the power source and turn on the system.
provided;
IMM 3. Run the test again.
initialization 4. Make sure that the DSA code is at the
is in progress. latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-822-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the the power source. You must disconnect the
destination is system from ac power to reset the IMM.
unavailable.
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
312 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-823-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test aborted: the power source. You must disconnect the
cannot system from ac power to reset the IMM.
execute the
2. After 45 seconds, reconnect the system to
command;
the power source and turn on the system.
insufficient
privilege 3. Run the test again.
level. 4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-824-xxx IMM IMM I2C Aborted IMM I2C test 1. Turn off the system and disconnect it from
Test canceled: the power source. You must disconnect the
cannot system from ac power to reset the IMM.
execute the
2. After 45 seconds, reconnect the system to
command.
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at the
latest level. The installed firmware level is
shown in the diagnostic event log in the
Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
314 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-901-xxx IMM IMM I2C Failed The IMM 1. Turn off the system and disconnect it
Test indicates a from the power source. You must
failure in the disconnect the system from ac power to
H8 bus (Bus reset the IMM.
0)
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. Shut down the system and remove the
power cords from the server.
8. (Trained service technician only) Replace
the system board.
9. Reconnect the system to power and turn
on the system.
10. Run the test again.
11. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-902-xxx IMM IMM I2C Failed The IMM 1. Turn off the system and disconnect it
Test indicates a from the power source. You must
failure in the disconnect the system from ac power to
light path bus reset the IMM.
(Bus 1).
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. Turn off the system and disconnect it
from the power source.
8. Reseat the light path diagnostics panel.
9. Reconnect the system to the power
source and turn on the system.
10. Run the test again.
11. Turn off the system and disconnect it
from the power source.
12. (Trained service technician only) Replace
the system board.
13. Reconnect the system to the power
source and turn on the system.
14. Run the test again.
15. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
316 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-903-xxx IMM IMM I2C Failed The IMM 1. Turn off the system and disconnect it
Test indicates a from the power source. You must
failure in the disconnect the system from ac power to
DIMM bus reset the IMM.
(Bus 2).
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. Disconnect the system from the power
source.
8. Replace the DIMMs one at a time, and
run the test again after replacing each
DIMM.
9. Reconnect the system to the power
source and turn on the system.
10. Run the test again.
11. Turn off the system and disconnect it
from the power source.
12. Reseat all of the DIMMs.
13. Reconnect the system to the power
source and turn on the system.
14. Run the test again.
15. Turn off the system and disconnect it
from the power source.
16. (Trained service technician only) Replace
the system board.
17. Reconnect the system to the power
source and turn on the system.
18. Run the test again.
19. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
Chapter 3. Diagnostics 317
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-904-xxx IMM IMM I2C Failed The IMM 1. Turn off the system and disconnect it
Test indicates a from the power source. You must
failure in the disconnect the system from ac power to
power supply reset the IMM.
bus (Bus 3).
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. Reseat the power supply.
8. Run the test again.
9. Turn off the system and disconnect it
from the power source.
10. Trained service technician only) Reseat
the system board.
11. Reconnect the system to the power
source and turn on the system.
12. Run the test again.
13. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
318 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-905-xxx IMM IMM I2C Failed The IMM Note: Ignore the error if the hard disk drive
Test indicates a backplane is not installed.
failure in the 1. Turn off the system and disconnect it
HDD bus from the power source. You must
(Bus 4). disconnect the system from ac power to
reset the IMM.
2. After 45 seconds, reconnect the system to
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. Turn off the system and disconnect it
from the power source.
8. Reseat the hard disk drive backplane.
9. Reconnect the system to the power
source and turn on the system.
10. Run the test again.
11. Turn off the system and disconnect it
from the power source.
12. Trained service technician only) Replace
the system board.
13. Reconnect the system to the power
source and turn on the system.
14. Run the test again.
15. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
166-906-xxx IMM IMM I2C Failed The IMM 1. Turn off the system and disconnect it
Test indicates a from the power source. You must
failure in the disconnect the system from ac power to
memory reset the IMM.
configuration
2. After 45 seconds, reconnect the system to
bus (Bus 5).
the power source and turn on the system.
3. Run the test again.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the IMM firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. Turn off the system and disconnect it
from the power source.
8. Trained service technician only) Reseat
the system board.
9. Reconnect the system to the power
source and turn on the system.
10. Run the test again.
11. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
320 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
201-801-xxx Memory Memory Aborted Test canceled: 1. Turn off and restart the system.
Test the server
2. Run the test again.
firmware
programmed 3. Make sure that the server firmware is at
the memory the latest level. The installed firmware
controller level is shown in the diagnostic event log
with an in the Firmware/VPD section for this
invalid CBAR component. For more information, see
address “Updating the firmware” on page 491.
4. Run the test again.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
201-802-xxx Memory Memory Aborted Test canceled: 1. Turn off and restart the system.
Test the end
2. Run the test again.
address in the
E820 function 3. Make sure that all DIMMs are enabled in
is less than 16 the Setup utility.
MB. 4. Make sure that the server firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
5. Run the test again.
6. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
201-803-xxx Memory Memory Aborted Test canceled: 1. Turn off and restart the system.
Test could not
2. Run the test again.
enable the
processor 3. Make sure that the server firmware is at
cache. the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
4. Run the test again.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
201-804-xxx Memory Memory Aborted Test canceled: 1. Turn off and restart the system.
Test the memory
2. Run the test again.
controller
buffer request 3. Make sure that the server firmware is at
failed. the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
4. Run the test again.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
201-805-xxx Memory Memory Aborted Test canceled: 1. Turn off and restart the system.
Test the memory
2. Run the test again.
controller
display/alter 3. Make sure that the server firmware is at
write the latest level. The installed firmware
operation was level is shown in the diagnostic event log
not in the Firmware/VPD section for this
completed. component. For more information, see
“Updating the firmware” on page 491.
4. Run the test again.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
201-806-xxx Memory Memory Aborted Test canceled: 1. Turn off and restart the system.
Test the memory
2. Run the test again.
controller fast
scrub 3. Make sure that the server firmware is at
operation was the latest level. The installed firmware
not level is shown in the diagnostic event log
completed. in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
4. Run the test again.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
322 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
201-807-xxx Memory Memory Aborted Test canceled: 1. Turn off and restart the system.
Test the memory
2. Run the test again.
controller
buffer free 3. Make sure that the server firmware is at
request failed. the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
4. Run the test again.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
201-808-xxx Memory Memory Aborted Test canceled: 1. Turn off and restart the system.
Test memory
2. Run the test again.
controller
display/alter 3. Make sure that the server firmware is at
buffer execute the latest level. The installed firmware
error. level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
4. Run the test again.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
201-809-xxx Memory Memory Aborted Test canceled 1. Turn off and restart the system.
Test program
2. Run the test again.
error:
operation 3. Make sure that the DSA code is at the
running fast latest level. For the latest level of DSA
scrub. code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
4. Make sure that the server firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
5. Run the test again.
6. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
201-810-xxx Memory Memory Aborted Test stopped: 1. Turn off and restart the system.
Test unknown
2. Run the test again.
error code xxx
received in 3. Make sure that the DSA code is at the
COMMONEXIT latest level. For the latest level of DSA
procedure. code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
4. Make sure that the server firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
5. Run the test again.
6. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
324 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
201-901-xxx Memory Memory Failed Test failure: 1. Turn off the system and disconnect it
Test single-bit from the power source.
error, failing
2. Reseat DIMM z.
DIMM z.
3. Reconnect the system to power and turn
on the system.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the server firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. Replace the failing DIMMs.
8. Re-enable all memory in the Setup utility
(see “Using the Setup utility” on page
493).
9. Run the test again.
10. Replace the failing DIMM.
11. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
201-902-xxx Memory Memory Failed Test failure: 1. Turn off the system and disconnect it
Test single-bit and from the power source.
multi-bit
2. Reseat DIMM z.
error, failing
DIMM z 3. Reconnect the system to power and turn
on the system.
4. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
5. Make sure that the server firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
6. Run the test again.
7. Replace the failing DIMMs.
8. Re-enable all memory in the Setup utility
see “Using the Setup utility” on page
493).
9. Run the test again.
10. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
326 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
202-801-xxx Memory Memory Aborted Internal 1. Turn off and restart the system.
Stress Test program
2. Make sure that the DSA code is at the
error.
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
3. Make sure that the server firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
4. Run the test again.
5. Turn off and restart the system if
necessary to recover from a hung state.
6. Run the memory diagnostics to identify
the specific failing DIMM.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
202-802-xxx Memory Memory Failed General error: 1. Make sure that all memory is enabled by
Stress Test memory size checking the Available System Memory in
is insufficient the Resource Utilization section of the
to run the DSA event log. If necessary, enable all
test. memory in the Setup utility (see “Using
the Setup utility” on page 493).
2. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
3. Run the test again.
4. Run the standard memory test to validate
all memory.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
202-901-xxx Memory Memory Failed Test failure. 1. Run the standard memory test to validate
Stress Test all memory.
2. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
3. Turn off the system and disconnect it from
power.
4. Reseat the DIMMs.
5. Reconnect the system to power and turn
on the system.
6. Run the test again.
7. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
328 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
215-801-xxx Optical v Verify Aborted Unable to 1. Make sure that the DSA code is at the
Drive Media communicate latest level. For the latest level of DSA
Installed with the code, go to http://www.ibm.com/
device driver. support/entry/portal/
v Read/
Write Test docdisplay?lndocid=SERV-DSA.
v Self-Test 2. Run the test again.
3. Check the drive cabling at both ends for
Messages loose or broken connections or damage to
and actions the cable. Replace the cable if it is
apply to all damaged.
three tests.
4. Run the test again.
5. For additional troubleshooting
information, go to http://
www.ibm.com/support/
docview.wss?uid=psg1MIGR-41559.
6. Run the test again.
7. Make sure that the system firmware is at
the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
8. Run the test again.
9. Replace the CD/DVD drive.
10. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
215-802-xxx Optical v Verify Aborted The media 1. Close the media tray and wait 15
Drive Media tray is open. seconds.
Installed 2. Run the test again.
v Read/ 3. Insert a new CD/DVD into the drive and
Write Test wait for 15 seconds for the media to be
v Self-Test recognized.
4. Run the test again.
Messages
and actions 5. Check the drive cabling at both ends for
apply to all loose or broken connections or damage to
three tests. the cable. Replace the cable if it is
damaged.
6. Run the test again.
7. Make sure that the DSA code is at the
latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
8. Run the test again.
9. For additional troubleshooting
information, go to http://
www.ibm.com/support/
docview.wss?uid=psg1MIGR-41559.
10. Run the test again.
11. Replace the CD/DVD drive.
12. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
215-803-xxx Optical v Verify Failed The disc 1. Wait for the system activity to stop.
Drive Media might be in
2. Run the test again
Installed use by the
system. 3. Turn off and restart the system.
v Read/
4. Run the test again.
Write Test
5. Replace the CD/DVD drive.
v Self-Test
6. If the failure remains, go to the IBM Web
Messages site for more troubleshooting information
and actions at http://www.ibm.com/support/entry/
apply to all portal/docdisplay?lndocid=SERV-CALL.
three tests.
330 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
215-901-xxx Optical v Verify Aborted Drive media 1. Insert a CD/DVD into the drive or try a
Drive Media is not new media, and wait for 15 seconds.
Installed detected.
2. Run the test again.
v Read/ 3. Check the drive cabling at both ends for
Write Test loose or broken connections or damage to
v Self-Test the cable. Replace the cable if it is
damaged.
Messages
4. Run the test again.
and actions
apply to all 5. For additional troubleshooting
three tests. information, go to http://www.ibm.com/
support/docview.wss?uid=psg1MIGR-
41559.
6. Run the test again.
7. Replace the CD/DVD drive.
8. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
215-902-xxx Optical v Verify Failed Read 1. Insert a CD/DVD into the drive or try a
Drive Media miscompare. new media, and wait for 15 seconds.
Installed 2. Run the test again.
v Read/ 3. Check the drive cabling at both ends for
Write Test loose or broken connections or damage to
v Self-Test the cable. Replace the cable if it is
damaged.
Messages
4. Run the test again.
and actions
apply to all 5. For additional troubleshooting
three tests. information, go to http://www.ibm.com/
support/docview.wss?uid=psg1MIGR-
41559.
6. Run the test again.
7. Replace the CD/DVD drive.
8. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
215-903-xxx Optical v Verify Aborted Could not 1. Insert a CD/DVD into the drive or try a
Drive Media access the new media, and wait for 15 seconds.
Installed drive.
2. Run the test again.
v Read/ 3. Check the drive cabling at both ends for
Write Test loose or broken connections or damage to
v Self-Test the cable. Replace the cable if it is
damaged.
Messages
4. Run the test again.
and actions
apply to all 5. Make sure that the DSA code is at the
three tests. latest level. For the latest level of DSA
code, go to http://www.ibm.com/
support/entry/portal/
docdisplay?lndocid=SERV-DSA.
6. Run the test again.
7. For additional troubleshooting
information, go to http://
www.ibm.com/support/
docview.wss?uid=psg1MIGR-41559.
8. Run the test again.
9. Replace the CD/DVD drive.
10. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
215-904-xxx Optical v Verify Failed A read error 1. Insert a CD/DVD into the drive or try a
Drive Media occurred. new media, and wait for 15 seconds.
Installed 2. Run the test again.
v Read/ 3. Check the drive cabling at both ends for
Write Test loose or broken connections or damage to
v Self-Test the cable. Replace the cable if it is
damaged.
Messages
4. Run the test again.
and actions
apply to all 5. For additional troubleshooting
three tests. information, go to http://www.ibm.com/
support/docview.wss?uid=psg1MIGR-
41559.
6. Run the test again.
7. Replace the CD/DVD drive.
8. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
332 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
217-901-xxx SAS/SATA Disk Drive Failed 1. Reseat all hard disk drive backplane
Hard Drive Test connections at both ends.
2. Reseat the all drives.
3. Run the test again.
4. Make sure that the firmware is at the
latest level.
5. Run the test again.
6. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
405-901-xxx Broadcom Test Control Failed 1. Make sure that the component firmware is
Ethernet Registers at the latest level. The installed firmware
Device level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
2. Run the test again.
3. Replace the component that is causing the
error. If the error is caused by an adapter,
replace the adapter. Check the PCI
Information and Network Settings
information in the DSA event log to
determine the physical location of the
failing component.
4. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
405-901-xxx Broadcom Test MII Failed 1. Make sure that the component firmware is
Ethernet Registers at the latest level. The installed firmware
Device level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
2. Run the test again.
3. Replace the component that is causing the
error. If the error is caused by an adapter,
replace the adapter. Check the PCI
Information and Network Settings
information in the DSA event log to
determine the physical location of the
failing component.
4. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
405-902-xxx Broadcom Test Failed 1. Make sure that the component firmware is
Ethernet EEPROM at the latest level. The installed firmware
Device level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
2. Run the test again.
3. Replace the component that is causing the
error. If the error is caused by an adapter,
replace the adapter. Check the PCI
Information and Network Settings
information in the DSA event log to
determine the physical location of the
failing component.
4. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
334 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
405-903-xxx Broadcom Test Internal Failed 1. Make sure that the component firmware is
Ethernet Memory at the latest level. The installed firmware
Device level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
2. Run the test again.
3. Check the interrupt assignments in the
PCI Hardware section of the DSA event
log. If the Ethernet device is sharing
interrupts, if possible, use the Setup utility
see “Using the Setup utility” on page 493)
to assign a unique interrupt to the device.
4. Replace the component that is causing the
error. If the error is caused by an adapter,
replace the adapter. Check the PCI
Information and Network Settings
information in the DSA event log to
determine the physical location of the
failing component.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
405-904-xxx Broadcom Test Failed 1. Make sure that the component firmware is
Ethernet Interrupt at the latest level. The installed firmware
Device level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
2. Run the test again.
3. Check the interrupt assignments in the
PCI Hardware section of the DSA event
log. If the Ethernet device is sharing
interrupts, if possible, use the Setup utility
see “Using the Setup utility” on page 493)
to assign a unique interrupt to the device.
4. Replace the component that is causing the
error. If the error is caused by an adapter,
replace the adapter. Check the PCI
Information and Network Settings
information in the DSA event log to
determine the physical location of the
failing component.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
336 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 10. DSA Preboot messages (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
405-906-xxx Broadcom Test Loop Failed 1. Check the Ethernet cable for damage and
Ethernet back at make sure that the cable type and
Device Physical connection are correct.
Layer
2. Make sure that the component firmware is
at the latest level. The installed firmware
level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
3. Run the test again.
4. Replace the component that is causing the
error. If the error is caused by an adapter,
replace the adapter. Check the PCI
Information and Network Settings
information in the DSA event log to
determine the physical location of the
failing component.
5. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
405-907-xxx Broadcom Test Loop Failed 1. Make sure that the component firmware is
Ethernet back at at the latest level. The installed firmware
Device MAC-Layer level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
2. Run the test again.
3. Replace the component that is causing the
error. If the error is caused by an adapter,
replace the adapter. Check the PCI
Information and Network Settings
information in the DSA event log to
determine the physical location of the
failing component.
4. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
Trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Message
number Component Test State Description Action
405-908-xxx Broadcom Test LEDs Failed 1. Make sure that the component firmware is
Ethernet at the latest level. The installed firmware
Device level is shown in the diagnostic event log
in the Firmware/VPD section for this
component. For more information, see
“Updating the firmware” on page 491.
2. Run the test again.
3. Replace the component that is causing the
error. If the error is caused by an adapter,
replace the adapter. Check the PCI
Information and Network Settings
information in the DSA event log to
determine the physical location of the
failing component.
4. If the failure remains, go to the IBM Web
site for more troubleshooting information
at http://www.ibm.com/support/entry/
portal/docdisplay?lndocid=SERV-CALL.
Checkout procedure
The checkout procedure is the sequence of tasks that you should follow to
diagnose a problem in the server.
338 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
microprocessor socket. See “Microprocessor problems” on page 349 for
information about diagnosing microprocessor problems.
v Before you run the diagnostic programs, you must determine whether the failing
server is part of a shared hard disk drive cluster (two or more servers sharing
external storage devices). If it is part of a cluster, you can run all diagnostic
programs except the ones that test the storage unit (that is, a hard disk drive in
the storage unit) or the storage adapter that is attached to the storage unit. The
failing server might be part of a cluster if any of the following conditions is true:
– You have identified the failing server as part of a cluster (two or more servers
sharing external storage devices).
– One or more external storage units are attached to the failing server and at
least one of the attached storage units is also attached to another server or
unidentifiable device.
– One or more servers are located near the failing server.
Important: If the server is part of a shared hard disk drive cluster, run one test
at a time. Do not run any suite of tests, such as “quick” or “normal” tests,
because this might enable the hard disk drive diagnostic tests.
v If the server is halted and a POST error code is displayed, see “Event logs” on
page 28. If the server is halted and no error message is displayed, see
“Troubleshooting tables” on page 340 and “Solving undetermined problems” on
page 378.
v For information about power-supply problems, see “Solving power problems”
on page 376.
v For intermittent problems, check the error log; see “Event logs” on page 28 and
“Diagnostic programs, messages, and error codes” on page 287.
Troubleshooting tables
Use the troubleshooting tables to find solutions to problems that have identifiable
symptoms.
If you cannot find a problem in these tables, see “Running the diagnostic
programs” on page 287 for information about testing the server.
If you have just added new software or a new optional device and the server is
not working, complete the following steps before you use the troubleshooting
tables:
1. Check the system-error LED on the operator information panel; if it is lit, check
the LEDs on the system board (see “System-board LEDs” on page 22).
2. Remove the software or device that you just added.
3. Run the diagnostic tests to determine whether the server is running correctly.
4. Reinstall the new software or new device.
340 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
DVD drive problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The optional DVD drive is not 1. Make sure that:
recognized.
v The SATA channel to which the DVD drive is attached (primary) is enabled
in the Setup utility.
v All cables and jumpers are installed correctly (see “Internal cable routing
and connectors” on page 398).
v The signal cable and connector are not damaged and the connector pins are
not bent.
v All damaged parts are repaired or replaced.
v The correct device driver is installed for the DVD drive.
2. Run the DVD drive diagnostic programs and select the optical drive test. See
“Running the diagnostic programs” on page 287.
3. Reseat the following components:
a. DVD drive
b. DVD drive cable
4. Replace the components listed in step 3 one at a time, in the order shown,
restarting the server each time.
The CD or DVD drive is not 1. Clean the CD or DVD.
working correctly.
2. Replace the CD or DVD with new CD or DVD media
3. Run the DVD drive diagnostic programs.
4. Reseat the DVD drive.
5. Replace the DVD drive.
The DVD drive tray is not 1. Make sure that the server is turned on.
working.
2. Insert the end of a straightened paper clip into the manual tray-release
opening.
3. Reseat the DVD drive.
4. Replace the DVD drive.
342 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom Action
An installed hard disk drive is 1. Observe the associated amber hard disk drive status LED. If the LED is lit, it
not recognized. indicates a drive fault.
2. If the LED is lit, remove the drive from the bay, wait 45 seconds, and reinsert
the drive, making sure that the drive assembly connects to the hard disk drive
backplane.
3. Observe the associated green hard disk drive activity LED and the amber
status LED:
v If the green activity LED is flashing and the amber status LED is not lit, the
drive is recognized by the controller and is working correctly. Run the DSA
hard disk drive test to determine whether the drive is detected.
v If the green activity LED is flashing and the amber status LED is flashing
slowly, the drive is recognized by the controller and is rebuilding.
v If neither LED is lit or flashing, check the hard disk drive backplane (go to
step 4).
v If the green activity LED is flashing and the amber status LED is lit, replace
the drive. If the activity of the LEDs remains the same, go to step 4. If the
activity of the LEDs changes, return to step 1.
4. Make sure that the hard disk drive backplane is correctly seated. When it is
correctly seated, the drive assemblies correctly connect to the backplane
without bowing or causing movement of the backplane.
5. Move the hard disk drives to different bays to determine if the drive or the
backplane is not functioning.
6. Reseat the backplane power cable and repeat steps 1 through 3.
7. Reseat the backplane signal cable and repeat steps 1 through 3.
8. Suspect the backplane signal cable or the backplane:
v If the server has eight hot-swap bays:
a. Replace the affected backplane signal cable.
b. Replace the affected backplane.
v If the server has 12 hot-swap bays:
a. Replace the backplane signal cable.
b. Replace the backplane.
c. Replace the SAS expander card.
9. See “Problem determination tips” on page 379.
Multiple hard disk drives fail. Make sure that the hard disk drive, SAS RAID controller, and server device
drivers and firmware are at the latest level.
Important: Some cluster solutions require specific code levels or coordinated code
updates. If the device is part of a cluster solution, verify that the latest level of
code is supported for the cluster solution before you update the code.
Multiple hard disk drives are 1. Review the storage subsystem logs for indications of problems within the
offline. storage subsystem, such as backplane or cable problems.
2. See “Problem determination tips” on page 379.
Hypervisor problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
If an optional hypervisor device 1. Make sure that the optional hypervisor device is selected on the boot menu (in
is not listed in the expected the Setup utility and in F12).
boot order, doesn't appear in
2. Make sure that the hypervisor internal flash memory device is seated in the
the list of boot devices at all, or
connector correctly (see s and “Replacing a USB hypervisor memory key” on
a similar problem has occurred.
page 413).
3. See the documentation that comes with your optional hypervisor device for
setup and configuration information.
4. Make sure that other software works on the server.
344 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Intermittent problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
A problem occurs only 1. Make sure that:
occasionally and is difficult to v All cables and cords are connected securely to the rear of the server and
diagnose. attached devices.
v When the server is turned on, air is flowing from the fan grille. If there is no
airflow, the fans are not working. This can cause the server to overheat and
shut down.
2. Check the system event log or IMM event log (see “Event logs” on page 28).
3. Make sure that the server and IMM firmware has been updated to the most
recent code levels.
4. Review the operating system logs.
5. Contact your operating-system vendor to set up any available tools that are
capable of monitoring the server.
6. If an error occurs, run the DSA program and forward the results to IBM
service and support for analysis.
7. See “Solving undetermined problems” on page 378.
The server resets (restarts) 1. If the reset occurs during POST and the POST watchdog timer is enabled (click
occasionally. Advanced Setup --> Integrated Management Module (IMM) Setting --> IMM
Post Watchdog in the Setup utility to see the POST watchdog setting), make
sure that sufficient time is allowed in the watchdog timeout value (IMM POST
Watchdog Timeout). See the Installation and User’s Guide for information about
the settings in the Setup utility.
If the server continues to reset during POST, see “POST” on page 274 and
“Diagnostic programs, messages, and error codes” on page 287.
2. If the reset occurs after the operating system starts, disable any automatic
server restart (ASR) utilities, such as the IBM Automatic Server Restart IPMI
Application for Windows, or any ASR devices that are installed.
Note: ASR utilities operate as operating-system utilities and are related to the
IPMI device driver.
If the reset continues to occur after the operating system starts, the operating
system might have a problem; see “Software problems” on page 359.
3. If neither condition applies, check the system-event log (see “Event logs” on
page 28).
346 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Memory problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v For additional memory troubleshooting information, refer to the "Troubleshooting Memory - IBM BladeCenter
and System x" document at http://www-947.ibm.com/support/entry/portal/docdisplay?brand=5000020
&lndocid=MIGR-5081319.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The amount of system memory Note: Each time you install or remove a DIMM, you must disconnect the server
that is displayed is less than the from the power source; then, wait 10 seconds before restarting the server.
amount of installed physical 1. Make sure that:
memory.
v No error LEDs are lit on the operator information panel.
v Memory mirroring does not account for the discrepancy.
v The memory modules are seated correctly.
v You have installed the correct type of memory (see “Installing a memory
module” on page 450).
v If you changed the memory, you updated the memory configuration in the
Setup utility.
v All banks of memory are enabled. The server might have automatically
disabled a memory bank when it detected a problem, or a memory bank
might have been manually disabled.
2. Check the POST event log for error message 289:
v If a DIMM was disabled by a systems-management interrupt (SMI), replace
the DIMM.
v If a DIMM was disabled by the user or by POST, run the Setup utility and
enable the DIMM.
3. Run memory diagnostics (see “Running the diagnostic programs” on page
287).
4. Make sure that there is no memory mismatch when the server is at the
minimum memory configuration (one 1 GB DIMM in slot 3).
5. Add one pair of DIMMs at a time, making sure that the DIMMs in each pair
match. Install the DIMMs in the sequence that is described in “Installing a
memory module” on page 450.
6. Reseat the DIMMs, and then restart the server.
7. Reverse the DIMMs between the channels (of the same microprocessor), and
then restart the server. If the problem is related to a DIMM, replace the failing
DIMM.
8. (Trained service technician only) Install the failing DIMM into a DIMM
connector for microprocessor 2 (if installed) to verify that the problem is not
the microprocessor or the DIMM connector.
9. (Trained service technician only) Replace the system board.
348 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Microprocessor problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The server goes directly to the 1. Correct any errors that are indicated by the LEDs (see “Light path diagnostics
POST Event Viewer when LEDs” on page 363).
turned on.
2. Make sure that the server supports all the microprocessors and that the
microprocessors match in speed and cache size. To compare the microprocessor
information, run the Setup utility and select System Information, then select
System Summary , and then Processor Details.
3. (Trained service technician only) Reseat the microprocessors.
4. (Trained service technician only) Remove microprocessor 2 and restart the
server.
5. (Trained service technician only) Replace the following components, in the
order shown, restarting the server each time:
v Microprocessors
v System board
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
Testing the monitor. 1. Make sure that the monitor cables are firmly connected.
2. Try using the other video port.
3. Try using a different monitor on the server, or try testing the monitor on a
different server.
4. Run the diagnostic programs (see “Running the diagnostic programs” on page
287). If the monitor passes the diagnostic programs, the problem might be a
video device driver.
5. (Trained service technician only) Replace the system board
350 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The monitor has screen jitter, or 1. If the monitor self-tests show that the monitor is working correctly, consider
the screen image is wavy, the location of the monitor. Magnetic fields around other devices (such as
unreadable, rolling, or transformers, appliances, fluorescent lights, and other monitors) can cause
distorted. screen jitter or wavy, unreadable, rolling, or distorted screen images. If this
happens, turn off the monitor.
Attention: Moving a color monitor while it is turned on might cause screen
discoloration.
Move the device and the monitor at least 305 mm (12 in.) apart, and turn on
the monitor.
Note:
a. To prevent diskette drive read/write errors, make sure that the distance
between the monitor and any external diskette drive is at least 76 mm (3
in.).
b. Non-IBM monitor cables might cause unpredictable problems.
2. Reseat the monitor cable
3. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Monitor cable
b. Video adapter (if one is installed)
c. Monitor
d. (Trained service technician only) System board
Wrong characters appear on the 1. If the wrong language is displayed, update the server firmware with the
screen. correct language.
2. Reseat the monitor cable.
3. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Monitor
b. (Trained service technician only) System board
352 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Power problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The power-control button does 1. Make sure Make sure that both power supplies installed in the server are of
not work, and the reset button the same type. Mixing different power supplies in the server will cause a
does not work (the server does system error (the system-error LED on the front panel turns on and the PS and
not start). CNFG LEDs on the operator information panel are lit).
Note: The power-control button
2. Make sure that:
will not function until
v The power cords are correctly connected to the server and to a working
approximately 3 minutes after
electrical outlet.
the server has been connected
v The type of memory that is installed is correct.
to power.
v The LEDs on the power supply do not indicate a problem (see
“Power-supply LEDs” on page 367).
v The microprocessors are installed in the correct sequence.
3. Make sure that the power-control button and the reset button are working
correctly:
a. Disconnect the server power cords.
b. Reseat the operator information panel assembly cable.
c. Reconnect the power cords.
d. Press the power-control button to restart the server. If the button does not
work, replace the operator information panel assembly.
e. Press the reset button (on the light path diagnostics panel) to restart the
server. If the button does not work, replace the operator information panel
assembly.
4. Replace the following components one at a time, in the order shown, restarting
the server each time:
a. Hot-swap power supplies
b. (Trained service technician only) System board
5. See “Solving power problems” on page 376.
6. See “Solving undetermined problems” on page 378.
354 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The OVER SPEC LED on the 1. Disconnect the server power cords.
light path diagnostics panel is
2. Remove the following components:
lit, and the 12v channel C LED
on the system board is lit. v Tape drive, if one is installed (see “Removing a tape drive” on page 446for
more information)
v SAS riser-card assembly
v DIMMs 1 through 9
v (Trained service technician only) Microprocessor 1
3. Move switch 2 on switch block 4 (SW4) to the On position to force power on;
then, restart the server. If the OVER SPEC and 12v channel LEDs are still lit,
(trained service technician only) replace the system board.
4. Reinstall the components listed in step 2 one at a time, in the order shown,
restarting the server each time. If the 12v channel C LED is lit, the component
that you just reinstalled is defective. Replace the defective component.
v (Trained service technician only) Microprocessor 1
v DIMMs 1 through 9
v SAS riser-card assembly
v Tape drive, if one is installed (see “Replacing a tape drive” on page 447 for
more information)
The OVER SPEC LED on the 1. Disconnect the server power cords.
light path diagnostics panel is
2. (Trained service technician only) Remove microprocessor 1.
lit, and the 12v channel D LED
on the system board is lit. 3. Move switch 2 on switch block 4 (SW4) to the On position to force power on;
then, restart the server. If the OVER SPEC and 12v channel LEDs are still lit,
(trained service technician only) replace the system board.
4. (Trained service technician only) Move switch on switch block back to the Off
position; then, reinstall microprocessor 1.
5. Restart the server. If the 12v channel D LED is lit, (trained service technician
only) replace microprocessor 1.
356 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
The server unexpectedly shuts See “Solving undetermined problems” on page 378.
down, and the LEDs on the
operator information panel are
not lit.
358 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Software problems
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
Symptom Action
You suspect a software 1. To determine whether the problem is caused by the software, make sure that:
problem. v The server has the minimum memory that is needed to use the software. For
memory requirements, see the information that comes with the software. If
you have just installed an adapter or memory, the server might have a
memory-address conflict.
v The software is designed to operate on the server.
v Other software works on the server.
v The software works on another server.
2. If you received any error messages when using the software, see the
information that comes with the software for a description of the messages and
suggested solutions to the problem.
3. Contact the software vendor.
When LEDs are lit to indicate an error, they remain lit when the server is turned
off, provided that the server is still connected to power and the power supply is
operating correctly.
Before you work inside the server to view light path diagnostics LEDs, read the
safety information that begins on page “Safety” on page vii and “Handling
static-sensitive devices” on page 398.
If an error occurs, view the light path diagnostics LEDs in the following order:
1. Look at the operator information panel on the front of the server.
v If the information LED is lit, it indicates that information about a suboptimal
condition in the server is available in the IMM event log or in the system
event log.
v If the system-error LED is lit, it indicates that an error has occurred; go to
step 2.
The following illustration shows the operator information panel.
Operator information
panel
Light path
diagnostics LEDs
Release latch
2. To view the light path diagnostics panel, slide the latch to the left on the front
of the operator information panel and pull the panel forward. This reveals the
light path diagnostics panel. Lit LEDs on this panel indicate the type of error
that has occurred.
360 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 18. Operator information panel
Checkpoint
code display
Note any LEDs that are lit, and then push the light path diagnostics panel back
into the server.
Note:
a. Do not run the server for an extended period of time while the light path
diagnostics panel is pulled out of the server.
b. Light path diagnostics LEDs remain lit only while the server is connected to
power.
Look at the system service label on the top of the server, which gives an
overview of internal components that correspond to the LEDs on the light path
diagnostics panel. This information and the information in “Light path
diagnostics LEDs” on page 363 can often provide enough information to
diagnose the error.
3. Remove the server cover and look inside the server for lit LEDs. A lit LED on
or beside a component identifies the component that is causing the error.
The following illustration shows the LEDs on the system board.
12v channel error LEDs indicate an overcurrent condition. Table 12 on page 376
identifies the components that are associated with each power channel, and the
order in which to troubleshoot the components.
The following illustration shows the LEDs on the riser card.
362 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Remind button
You can use the remind button on the light path diagnostics panel to put the
system-error LED on the operator information panel into Remind mode.
When you press the remind button, you acknowledge the error but indicate that
you will not take immediate action. The system-error LED flashes while it is in
Remind mode and stays in Remind mode until one of the following conditions
occurs:
v All known errors are corrected.
v The server is restarted.
v A new error occurs, causing the system-error LED to be lit again.
Note: Check the system-event log and the IMM event log for additional
information before you replace a FRU.
364 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
LED Problem Action
DASD A hard disk drive error has occurred. A 1. Check the LEDs on the hard disk drives for the drive with
hard disk drive has failed or is missing. a lit status LED and reseat the hard disk drive.
2. Reseat the hard disk drive backplane.
3. For more information, see “Hard disk drive problems” on
page 342.
4. If the error remains, replace the following components in
the order listed, restarting the server after each:
a. Replace the hard disk drive (see “Removing a hot-swap
hard disk drive” on page 440 for more information).
b. Replace the hard disk drive backplane (see “Removing
the SAS hard disk drive backplane” on page 469 for
more information).
5. If the problem remains, go to http://www.ibm.com/
support/entry/portal/docdisplay?lndocid=SERV-CALL.
FAN A fan has failed, is operating too slowly, 1. Reseat the failing fan, which is indicated by a lit LED near
or has been removed. The TEMP LED the fan connector on the system board..
might also be lit.
2. Replace the failing fan, which is indicated by a lit LED
near the fan connector on the system board (see
“Removing a hot-swap fan” on page 457 for more
information).
Note:
1. If an LED that is next to an unused fan connector on the
system board is lit, a PCI riser-card assembly might be
missing; replace the PCI riser-card assembly. One PCI
riser-card assembly must always be present in PCI riser
connector 2.
2. When the BRD LED is lit and all cooling zones all asserted
at the same time by removing the cover and check if PCI
riser-card 2 LED is on.
LINK Reserved.
LOG An error message has been written to Check the IMM system event log and the system-error log for
the system-event log information about the error. Replace any components that are
identified in the error logs. (See “Event logs” on page 28 for
more information.)
366 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
LED Problem Action
SP The service processor (the IMM) has 1. Remove power from the server; then, reconnect the server
failed. to power and restart the server.
2. Update the firmware on the IMM.
3. If the failure remains, go to http://www.ibm.com/
support/entry/portal/docdisplay?lndocid=SERV-CALL for
additional troubleshooting information.
TEMP The system temperature has exceeded a 1. Check the error log to identify where the over-temperature
threshold level. A failing fan can cause condition was measured. If a fan has failed, replace it.
the TEMP LED to be lit. 2. Make sure that the room temperature is not too high. See
“Features and specifications” on page 7 for temperature
information.
3. Make sure that the air vents are not blocked.
4. If the failure remains, go to http://www.ibm.com/
support/entry/portal/docdisplay?lndocid=SERV-CALL for
additional troubleshooting information.
VRM Reserved.
Power-supply LEDs
The following minimum configuration is required for the DC LED on the power
supply to be lit:
v Power supply
v Power cord
v
The following table describes the problems that are indicated by various
combinations of the power-supply LEDs and the power-on LED on the operator
information panel and suggested actions to correct the detected problems.
Table 11. Power-supply LEDs
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
AC power-supply LEDs
AC DC Error Description Action Notes
Off Off Off No ac power to 1. Check the ac power to the server. This is a normal
the server or a condition when no
2. Make sure that the power cord is
problem with the ac power is present.
connected to a functioning power
ac power source
source.
3. Turn the server off and then turn the
server back on.
4. If the problem remains, replace the
power supply.
Off Off On No ac power to 1. Replace the power supply. This happens only
the server or a when a second
2. Make sure that the power cord is
problem with the power supply is
connected to a functioning power
ac power source providing power to
source.
and the power the server.
supply had
detected an
internal problem
Off On Off Faulty power Replace the power supply.
supply
Off On On Faulty power Replace the power supply.
supply
368 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 11. Power-supply LEDs (continued)
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is
solved.
v See Chapter 4, “Parts listing,” on page 381 to determine which components are customer replaceable units
(CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
v Go to the IBM support Web site at http://www.ibm.com/systems/support/ to check for technical information,
hints, tips, and new device drivers or to submit a request for information.
AC power-supply LEDs
AC DC Error Description Action Notes
On Off Off Power-supply 1. (Trained service technician only) Typically indicates
not fully seated, Reseat the power supply. that a power supply
faulty system is not fully seated.
2. If a power channel error LED on the
board, or faulty
system board is not lit, replace the
power-supply
power-supply (see the documentation
that comes with the power supply for
instructions).
3. If a power channel error LED on the
system board is lit, (trained service
technician only) replace the system
board.
On Off or On Faulty power Replace the power supply.
Flashing supply
On On Off Normal
operation
On On On Power supply is Replace the power supply.
faulty but still
operational
The following table describes the problems that are indicated by various
combinations of the power-supply LEDs on a dc power supply and suggested
actions to correct the detected problems.
DC power-supply LEDs
IN OK OUT OK Error (!) Description Action Notes
On On Off Normal operation
Off Off Off No dc power to the 1. Check the dc power to the This is a normal
server or a problem server. condition when no dc
with the dc power power is present.
2. Make sure that the power
source.
cord is connected to a
functioning power source.
3. Restart the server. If the error
remains, check the
power-supply LEDs.
4. Replace the power-supply.
Tape alert flags are numbered 1 through 64 and indicate specific media-changer
error conditions. Each tape alert is returned as an individual log parameter, and its
state is indicated in bit 0 of the 1-byte Parameter Value field of the log parameter.
When this bit is set to 1, the alert is active.
Each tape alert flag has one of the following severity levels:
C: Critical
W: Warning
I: Information
Different tape drives support some or all of the following flags in the tape alert
log:
370 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Flag 2: Library Hardware B (W) This flag is set when an unrecoverable
mechanical error occurs.
Flag 4: Library Hardware D (C) This flag is set when the tape drive fails the
power-on self-test or a mechanical error occurs that requires a power cycle to
recover. This flag is internally cleared when the drive is powered-off.
Flag 13: Library Pick Retry (W) This flag is set when a high retry count
threshold is passed during an operation to pick a cartridge from a slot before
the operation succeeds. This flag is internally cleared when another pick
operation is attempted.
Flag 14: Library Place Retry (W) This flag is set when a high retry count
threshold is passed during an operation to place a cartridge back into a slot
before the operation succeeds. This flag is internally cleared when another place
operation is attempted.
Flag 15: Library Load Retry (W) This flag is set when a high retry count
threshold is passed during an operation to load a cartridge into a drive before
the operation succeeds. This flag is internally cleared when another load
operation is attempted. Note that if the load operation fails because of a media
or drive problem, the drive sets the applicable tape alert flags.
Flag 16: Library Door (C) This flag is set when media move operations cannot
be performed because a door is open. This flag is internally cleared when the
door is closed.
Flag 23: Library Scan Retry (W) This flag is set when a high retry count
threshold is passed during an operation to scan the bar code on a cartridge
before the operation succeeds. This flag is internally cleared when another bar
code scanning operation is attempted.
Important: Some cluster solutions require specific code levels or coordinated code
updates. If the device is part of a cluster solution, verify that the latest level of
code is supported for the cluster solution before you update the code.
If the server firmware has become corrupted, such as from a power failure during
an update, you can recover the server firmware in one of two ways:
v In-band method: Recover server firmware, using either the boot block jumper
(Automated Boot Recovery) and a server Firmware Update Package Service
Pack.
v Out-of-band method: Use the IMM Web Interface to update the firmware, using
the latest server firmware update package.
Note: You can obtain a server firmware update package from one of the following
sources:
v Download the server firmware update from the World Wide Web.
v Contact your IBM service representative.
To download the server firmware update package from the World Wide Web,
complete the following steps.
The flash memory of the server consists of a primary bank and a backup bank. It is
essential that you maintain the backup bank with a bootable firmware image. If the
primary bank becomes corrupted, you can either manually boot the backup bank
with the boot block jumper, or in the case of image corruption, this will occur
automatically with the Automated Boot Recovery function.
To recover the server firmware and restore the server operation to the primary
bank, complete the following steps:
Procedure
1. Turn off the server, and disconnect all power cords and external cables.
2. Remove the server cover. See “Removing the cover” on page 402 for more
information.
3. Unlock and remove the server cover (see “Removing the cover” on page 402
for more information.
4. Locate the UEFI boot recovery jumper block (J29) on the system board.
372 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
IMM recovery jumper
(J147)
1
2
UEFI boot recovery 3 3
jumper (J29) 2
1
5. Remove any adapters that impede access to the UEFI boot recovery jumper
block (J29) (see “Removing a PCI adapter from a PCI riser-card assembly” on
page 417).
6. Move the jumper from pins 1 and 2 to pins 2 and 3 to enable the UEFI
recovery mode.
7. Reinstall any adapter that you removed before (see “Installing a PCI adapter
in a PCI riser-card assembly” on page 418).
8. Reinstall the server cover (see “Replacing the cover” on page 403).
9. Reconnect all power cords and external cables and restart the server. The
power-on self-test (POST) starts.
10. Boot the server to an operating system that is supported by the firmware
update package that you downloaded.
11. Perform the firmware update by following the instructions that are in the
firmware update package readme file.
12. Copy the downloaded firmware update package into a directory.
Results
See “System-board switches and jumpers” on page 18 for more information about
the switches and jumpers.
Note: Use this method if the BOARD LED on the light path diagnostics panel is lit
and there is a log entry or Booting Backup Image is displayed on the firmware
splash screen; otherwise, use the in-band manual recovery method.
1. Boot the server to an operating system that is supported by the firmware
update package that you downloaded.
2. Perform the firmware update by following the instructions that are in the
firmware update package readme file.
3. Restart the server.
4. At the firmware splash screen, press F3 when prompted to restore to the
primary bank. The server boots from the primary bank.
374 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Automatic boot failure recovery (ABR)
While the server is starting, if the integrated management module detects problems
with the server firmware in the primary bank, the server automatically switches to
the backup firmware bank and gives you the opportunity to recover the firmware
in the primary bank.
For instructions for recovering the UEFI firmware, see “Recovering the server
firmware” on page 371. After you have recovered the firmware in the primary
bank, complete the following steps:
Procedure
1. Restart the server.
2. When the prompt Press F3 to restore to primary is displayed. Press F3 to
recover the primary bank. Pressing F3 will restart the server.
Nx boot failure
Configuration changes, such as added devices or adapter firmware updates, and
firmware or application code problems can cause the server to fail POST (the
power-on self-test). If this occurs, the server responds in either of the following
ways:
v The server restarts automatically and attempts POST again.
v The server hangs, and you must manually restart the server for the server to
attempt POST again.
To specify the number of consecutive restart attempts that will trigger the Nx boot
failure feature, in the Setup utility, click System Settings → Operating Modes →
POST Attempts Limit. The available options are 3, 6, 9, and 255 (disable Nx boot
failure).
Power problems can be difficult to solve. For example, a short circuit can exist
anywhere on any of the power distribution buses. Usually, a short circuit will
cause the power subsystem to shut down because of an overcurrent condition. To
diagnose a power problem, use the following general procedure:
Procedure
1. Turn off the server and disconnect all power cords.
2. Check for loose cables in the power subsystem. Also check for short circuits, for
example, if a loose screw is causing a short circuit on a circuit board.
3. Check the LEDs on the operator information panel (see “Light path diagnostics
LEDs” on page 363).
4. If a power-channel error LED on the system board is lit, complete the following
steps; otherwise, go to step 5. See “System-board LEDs” on page 22 for the
location of the power-channel error LEDs. Table 12 identifies the components
that are associated with each power channel and the order in which to
troubleshoot the components.
a. Disconnect the cables and power cords to all internal and external devices
(see “Internal cable routing and connectors” on page 398). Leave the
power-supply cords connected.
b. Remove each component that is associated with the LED, one at a time, in
the sequence indicated in Table 12, restarting the server each time, until the
cause of the overcurrent condition is identified.
Important: Only a trained service technician should remove or replace a
FRU, such as a microprocessor or the system board. See Chapter 4, “Parts
listing,” on page 381 to determine whether a component is a FRU.
Table 12. Components associated with power-channel error LEDs
Power-channel
error LED Components
A CD or DVD drive (optical drive), fans, hard disk drives, hard disk drive
backplanes
B PCI riser-card assembly in PCI connector 1 on the system board,
DIMMs 1 through 16, microprocessor 2
C Tape drive if one is installed, SAS riser card assembly, DIMMs 1
through 8, microprocessor 1
D Microprocessor 1, system board
E Optional PCI video graphics adapter power cable if one is installed
(connector J154 on the system board), optional PCI video graphics
adapter if one is installed, PCI riser card assembly in PCI connector 2
on the system board, microprocessor 2
AUX Power All PCI adapters and PCI riser-card assemblies, SAS riser-card
assembly, operator information panel assembly, optional two-port
Ethernet card if installed
Results
If the server does not start from the minimum configuration, replace the
components in the minimum configuration one at a time until the problem is
isolated.
The method that you use to test the Ethernet controller depends on which
operating system you are using. See the operating-system documentation for
information about Ethernet controllers, and see the Ethernet controller
device-driver readme file.
If the Ethernet controller still cannot connect to the network but the hardware
appears to be working, the network administrator must investigate other possible
causes of the error.
If the diagnostic tests did not diagnose the failure or if the server is inoperative,
use the information in this section.
Check the LEDs on all the power supplies (see “Power-supply LEDs” on page 367).
If the LEDs indicate that the power supplies are working correctly, complete the
following steps:
1. Turn off the server.
2. Make sure that the server is cabled correctly.
3. Remove or disconnect the following devices, one at a time, until you find the
failure. Turn on the server and reconfigure it each time.
v Any external devices.
v Surge-suppressor device (on the server).
v Modem, printer, mouse, and non-IBM devices.
v Each adapter.
v Hard disk drives.
v Memory modules. The minimum configuration requirement is 2 GB DIMM
per installed microprocessor.
v Service processor (IMM).
The following minimum configuration is required for the server to start:
v One microprocessor (slot 1)
v One 2 GB DIMM per installed microprocessor (slot 3 if only one
microprocessor is installed)
v One power supply
v Power cord
v Three cooling fans
v One PCI riser-card assembly in PCI riser connector 2
v ServeRAID SAS controller
4. Turn on the server. If the problem remains, suspect the system board.
If the problem is solved when you remove an adapter from the server but the
problem recurs when you reinstall the same adapter, suspect the adapter; if the
problem recurs when you replace the adapter with a different one, suspect the riser
card.
If you suspect a networking problem and the server passes all the system tests,
suspect a network cabling problem that is external to the server.
378 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Problem determination tips
Because of the variety of hardware and software combinations that you can
encounter, use the following information to assist you in problem determination. If
possible, have this information available when you request assistance from IBM.
v Machine type and model
v Microprocessor and hard disk upgrades
v Failure symptom
– Does the server fail the diagnostics tests?
– What occurs? When? Where?
– Does the failure occur on a single server or on multiple servers?
– Is the failure repeatable?
– Has this configuration ever worked?
– What changes, if any, were made before the configuration failed?
– Is this the original reported failure?
v Diagnostics program type and version level
v Hardware configuration (print screen of the system summary)
v BIOS code level
v Operating-system type and version level
You can solve some problems by comparing the configuration and software setups
between working and nonworking servers. When you compare servers to each
other for diagnostic purposes, consider them identical only if all the following
factors are exactly the same in all the servers:
v Machine type and model
v BIOS level
v Adapters and attachments, in the same locations
v Address jumpers, terminators, and cabling
v Software versions and levels
v Diagnostic program type and version level
v Setup utility settings
v Operating-system control-file setup
See Appendix A, “Getting help and technical assistance,” on page 517 for
information about calling IBM for service.
The following replaceable components are available for all the Series x3650 M3
Type 4255, 7945, or 7949 server models, except as specified otherwise in
“Replaceable server components.” To check for an updated parts listing on the
Web, complete the following steps.
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Parts documents lookup.
4. From the Product family menu, select System x3650 M3 and click Go.
For information about the terms of the warranty and getting service and assistance,
see the Warranty Information document on the IBM Documentation CD.
The following illustration shows the major components in the server. The
illustrations in this document might differ slightly from your hardware.
382 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 13. Parts listing, Types 4255, 7945, and 7949 (continued)
CRU part CRU part
number (Tier number (Tier FRU part
Index Description 1) 2) number
2 PCI Express riser-card assembly (x 8) 69Y4324
2 PCI Express riser-card assembly (x 8) (for Emulex 10 GbE 59Y3818
adapter, 49Y4202)
2 PCI Express riser-card assembly (x 16) 69Y4325
3 PCI-X riser-card assembly 69Y4326
4 Heat sink, 95 watt 49Y4820
4 Heat sink, 130 watt 69Y1207
5 Microprocessor - Quad-Core Intel Xeon X5570 2.93 GHz 46D1262
(8 MB cache) 95 W
5 Microprocessor - Quad-Core Intel Xeon X5560 2.80 GHz 46D1263
(8 MB cache) 95 W
5 Microprocessor - Quad-Core Intel Xeon X5550 2.66 GHz 46D1264
(8 MB cache) 95 W
5 Microprocessor - Quad-Core Intel Xeon E5540 2.53 GHz 46D1265
(8 MB cache) 80 W
5 Microprocessor - Quad-Core Intel Xeon E5530 2.40 GHz 46D1266
(8 MB cache) 80 W
5 Microprocessor - Quad-Core Intel Xeon E5520 2.26 GHz 46D1267
(8 MB cache) 80 W
5 Microprocessor - Quad-Core Intel Xeon E5506 2.13 GHz 46D1270
(4 MB cache) 80 W (model A2x)
5 Microprocessor - Quad-Core Intel Xeon E5504 2.00 GHz 46D1271
(4 MB cache) 80 W
5 Microprocessor - Hex-Core Intel Xeon X5670 2.93 GHz 49Y7038
(12 MB cache) 95 W (model M2x)
5 Microprocessor - Hex-Core Intel Xeon X5660 2.80 GHz 49Y7039
(12 MB cache) 95 W (models L2x and L4x)
5 Microprocessor - Hex-Core Intel Xeon X5650 2.66 GHz 49Y7040
(12 MB cache) 95 W (models J2x, JSx, and D4x)
5 Microprocessor - Quad-Core Intel Xeon X5667 3.06 GHz 49Y7050
(12 MB cache) 95 W
5 Microprocessor - Quad-Core Intel Xeon E5640 2.66 GHz 49Y7051
(12 MB cache) 80 W (model G2x)
5 Microprocessor - Quad-Core Intel Xeon E5630 2.53 GHz 49Y7052
(12 MB cache) 80 W (model F2x)
5 Microprocessor - Quad-Core Intel Xeon E5620 2.40 GHz 49Y7053
(12 MB cache) 80 W (models D2x and D4x)
5 Microprocessor - Hex-Core Intel Xeon L5640 2.26 GHz (12 49Y7054
MB cache) 60 W (models H2x and H4x)
5 Microprocessor - Quad-Core Intel Xeon L5630 2.13 GHz 59Y3691
(12 MB cache) 40 W (model C2x)
5 Microprocessor - Quad-Core Intel Xeon E5503 2.00 GHz 69Y0781
(4 MB cache) 80 W
5 Microprocessor - Quad-Core Intel Xeon E5507 2.26 GHz 69Y0782
(4 MB cache) 80 W (models B2x and 32x)
384 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 13. Parts listing, Types 4255, 7945, and 7949 (continued)
CRU part CRU part
number (Tier number (Tier FRU part
Index Description 1) 2) number
11 DIMM - 4 GB (1Gb 2Rx8 1.35V ) ECC RDIMM 49Y1412
11 DIMM - 8 GB (2Gb 2Rx4) RDIMM 49Y1415
11 DIMM - 8 GB (2Gb 2Rx4) RDIMM 49Y1416
11 DIMM - 16 GB (2Gb 4Rx4) RDIMM 49Y1418
11 DIMM - 2 GB (1x2GB, 1Rx8) UDIMM 49Y1421
11 DIMM - 4 GB (2Gb 2Rx8) UDIMM 49Y1422
11 DIMM - 2 GB (2Gb 1Rx8) RDIMM 49Y1423
11 DIMM - 4 GB (2Gb 1Rx4) RDIMM 49Y1424
11 DIMM - 4 GB (2Gb 2Rx8) RDIMM 49Y1425
12 Power supply, 460 Watt, ac 39Y7229
12 Power supply, 460 Watt, ac 39Y7231
12 Power supply, 460 Watt, ac 69Y5907
12 Power supply, 460 Watt, 69Y5939
12 Power supply, 675 Watt, dc 39Y7215
12 Power supply, 675 Watt, dc 69Y5909
12 Power supply, 675 Watt HE, ac 39Y7218
12 Power supply, 675 Watt HE, ac 69Y5901
12 Power supply, 675 Watt HE, ac 69Y5903
12 Power supply, 675 Watt, ac 39Y7225
12 Power supply, 675 Watt, ac 39Y7227
12 Power supply, 675 Watt, ac 39Y7236
12 Power supply, 675 Watt, ac 69Y5909
12 Power supply, 675 Watt, ac 69Y5919
12 Power supply, 675 Watt 69Y5941
12 Power supply, 675 Watt 69Y5943
13 Power-supply bay filler (all models except F2x and JSx) 49Y4821
14 DVD drive, SATA 44W3254
14 DVD drive, SATA (model JSx) 44W3256
15 Operator information panel 44E4372
16A 4-drive filler panel, hot-swap 49Y5359
16A 4-drive filler panel, simple swap 49Y5360
16B Hot-swap HDD filler (models A2x, B2x, C2x, D2x, F2x, 44T2248
G2x, H2x, J2x, L2x, M2x, and N2x)
17 Rack latch bracket kit (all models) contains: 49Y5356
v Bracket, EIA left assembly (1)
v Bracket, EIA right assembly (1)
v Screw, M 3.5 steel (2)
18 Bezel, 16 HDD 59Y3780
19A Backplane, SAS, 4 hard disk drives 43V7070
386 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 13. Parts listing, Types 4255, 7945, and 7949 (continued)
CRU part CRU part
number (Tier number (Tier FRU part
Index Description 1) 2) number
Emulex 10GbE Virtual Fabric Adapter II 49Y7952
NetXtreme II 1000 Express quad port Ethernet adapter 49Y4222
Qlogic 10Gb CNA card assembly 00Y3274
Qlogic 10Gb Dual-Port CN 42C1802
Qlogic 10Gb SFP+SR 42C1816
10Gb SFP+ SR optical transceiver 46C9297
Brocade 10Gb SFP+ SR optical transceiver 42C1819
Brocade 10Gb CNA for IBM System x 42C1822
10Gb SFP+ transceiver module assembly 46C3449
IBM 10 GbE SW SFP+ transceiver 49Y8579
ServeRAID-BR10i SAS/SATA controller 46C9043
ServeRAID-BR10il SAS/SATA controller (model A2x) 49Y4737
ServeRAID M1015 SAS/SATA Controller 46C8933
ServeRAID-M5014 SAS/SATA controller 46C8929
ServeRAID-M5014 SAS/SATA controller (models F2x and 46M0918
G2x)
ServeRAID-M5015 SAS/SATA controller 46C8927
ServeRAID-M5015 SAS/SATA controller (models J2x, JSx, 46M0851
L2x, and M2x)
ServeRAID-B5015 SAS/SATA controller 46M0970
ServeRAID-M1015 SAS/SATA controller 46C8933
ServeRAID-M1015 SAS/SATA controller (models B2x, 46M0861
C2x, D2x, H2x, and N2x)
ServeRAID-MR10i SAS/SATA controller 46C9037
ServeRAID M5000 series advanced feature key 46M0931
ServeRAID M1000 series advanced feature key 46M0864
Carrier daughter card 44E8763
Daughter card mechanic kit, 1 GB 69Y4586
388 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 13. Parts listing, Types 4255, 7945, and 7949 (continued)
CRU part CRU part
number (Tier number (Tier FRU part
Index Description 1) 2) number
Cable, backplane configuration cable for 8 drives with 69Y4228
tape
Cable, backplane power cable for 4 drives 69Y4234
Cable, backplane power cable for 8 drive 69Y0649
Cable, KVM conversion 39M2911
Cable, operator information panel 46C4139
Cable, ServeRAID battery 90Y7310
Cable, SAS signal, 245 mm 59Y3786
Cable, SAS signal, 250 mm 69Y1332
Cable, SAS signal, 300 mm 49Y5390
Cable, SAS signal, 240 mm 49Y5392
Cable, SAS signal, 450 mm 59Y3921
Cable, SAS signal, 710 mm 69Y1328
Cable, SATA DVD 43V6914
Cable, TWINAX 1M 45W2444
Cable, TWINAX 3M 45W2526
Cable, TWINAX 5M 45W3044
Cable, USB conversion 39M2909
Cable, USB/video 46C4146
Cable, VGA power 59Y3455
Cord, 2.8 meter 39M5377
CPU extraction tool 81Y9398
Alcohol wipes 59P4739
Emulex 10GbE virtual fabric adapter 49Y4252
Thermal grease 41Y9292
Label, service 59Y3789
Labels, chassis 59Y3788
LP latch 69Y5119
Cable, USB power 39M6797
HBA adapter, 8 GB 00Y5629
Mechanical chassis 81Y6842
NVIDIA FX 3800 43V5894
NVIDIA FX 1800 43V5886
NVIDIA FX 1700 43V5765
NVIDIA FX 580 43V5890
NVIDIA FX 570 43V5782
DVI-A Dongle adapter 25R9043
Serve RAID M5016 card 00AE809
ServeRAID-MR10i Li-Ion battery 46C9040
Consumable parts are not covered by the IBM Statement of Limited Warranty. The
following consumable parts are available for purchase from the retail store.
Table 14. Consumable parts, Types 4255 and 7945
Index Description Part number
ServeRAID M5000 battery 43W4342
If you need help with your order, call the toll-free number that is listed on the
retail parts page, or contact your local IBM representative for assistance.
390 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 15. Product recovery CDs (continued)
Description CRU part number
Microsoft Windows Server 2008 Enterprise Edition 32/64 Bit 49Y0897
1-8 microprocessors, Traditional Chinese
VMware ESX Server 3i Recovery Tools CDs version 3.5 46D0762
VMware ESX Server 3i Version 3.5 Update 2 46M9236
VMware ESX Server 3i Version 3.5 Update 3 46M9237
VMware ESX Server 3i Version 3.5 Update 4 46M9238
VMware ESX Server 3i Version 3.5 Update 5 68Y9633
VMware ESXi 4.0 49Y8747
VMware ESXi 4.0 Update 1 68Y9634
Power cords
For your safety, IBM provides a power cord with a grounded attachment plug to
use with this IBM product. To avoid electrical shock, always use the power cord
and plug with a properly grounded outlet.
IBM power cords used in the United States and Canada are listed by Underwriter's
Laboratories (UL) and certified by the Canadian Standards Association (CSA).
For units intended to be operated at 115 volts: Use a UL-listed and CSA-certified
cord set consisting of a minimum 18 AWG, Type SVT or SJT, three-conductor cord,
a maximum of 15 feet in length and a parallel blade, grounding-type attachment
plug rated 15 amperes, 125 volts.
For units intended to be operated at 230 volts (U.S. use): Use a UL-listed and
CSA-certified cord set consisting of a minimum 18 AWG, Type SVT or SJT,
three-conductor cord, a maximum of 15 feet in length and a tandem blade,
grounding-type attachment plug rated 15 amperes, 250 volts.
For units intended to be operated at 230 volts (outside the U.S.): Use a cord set
with a grounding-type attachment plug. The cord set should have the appropriate
safety approvals for the country in which the equipment will be installed.
IBM power cords for a specific country or region are usually available only in that
country or region.
392 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
IBM power cord part
number Used in these countries and regions
39M5226 India
39M5233 Brazil
For information about the terms of the warranty and getting service and assistance,
see the Warranty Information document.
Installation guidelines
Use the installation guidelines to install the System x3650 M3 Types 4255, 7945,
and 7949
396 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
System reliability guidelines
The system reliability guidelines to ensure proper system cooling.
To help ensure proper system cooling and system reliability, make sure that the
following requirements are met:
v Each of the drive bays has a drive or a filler panel and electromagnetic
compatibility (EMC) shield installed in it.
v If the server has redundant power, each of the power-supply bays has a power
supply installed in it.
v There is adequate space around the server to allow the server cooling system to
work properly. Leave approximately 50 mm (2.0 in.) of open space around the
front and rear of the server. Do not place objects in front of the fans. For proper
cooling and airflow, replace the server cover before you turn on the server.
Operating the server for extended periods of time (more than 30 minutes) with
the server cover removed might damage server components.
v You have followed the cabling instructions that come with optional adapters.
v You have replaced a failed fan within 48 hours.
v You have replaced a hot-swap fan within 30 seconds of removal.
v You have replaced a hot-swap drive within 2 minutes of removal.
v You do not operate the server without the air baffles installed. Operating the
server without the air baffles might cause the microprocessors to overheat.
v Microprocessor 2 air baffle and DIMM air baffle are installed.
v The light path diagnostics panel is not pulled out of the server.
The server supports hot-plug, hot-add, and hot-swap devices and is designed to
operate safely while it is turned on and the cover is removed. Follow these
guidelines when you work inside a server that is turned on:
v Avoid wearing loose-fitting clothing on your forearms. Button long-sleeved
shirts before working inside the server; do not wear cuff links while you are
working inside the server.
v Do not allow your necktie or scarf to hang inside the server.
v Remove jewelry, such as bracelets, necklaces, rings, and loose-fitting wrist
watches.
v Remove items from your shirt pocket, such as pens and pencils, that might fall
into the server as you lean over it.
v Avoid dropping any metallic objects, such as paper clips, hairpins, and screws,
into the server.
The following illustration shows the internal routing and connectors for the two
SAS signal cables.
398 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 24. SAS signal cables
The following illustration shows the internal routing and connectors for the
SAS/SATA signal and power cables.
Attention: To disconnect the optional optical drive cable, you must first press the
connector release tab, and then disconnect the cable from the connector on the
system board. Do not disconnect the cable by using excessive force. Failing to
disconnect the cable properly may damage the connector on the system board. Any
damage to the connector may require replacing the system board.
The following illustration shows the internal routing and connector for the
operator information panel cable. The following notes describe additional
information you must consider when you install or remove the operator
information panel cable:
v You may remove the optional optical drive cable to obtain more room before
you install or remove the operator information panel cable.
v To remove the operator information panel cable, slightly press the cable toward
the chassis; then, pull to remove the cable from the connector on the system
board. Pulling the cable out of the connector by excessive force might cause
damage to the cable or connector.
v To connect the operator information panel cable on the system board, press
evenly on the cable. Pressing on one side of the cable might cause damage to the
cable or connector.
Attention: Failing to install or remove the cable with care may damage the
connectors on the system board. Any damage to the connectors may require
replacing the system board.
400 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 27. Operator information panel cable
The following illustration shows the internal routing and connector for the
USB/video cable.
The following illustration shows the internal routing for the configuration cable.
The illustrations in this document might differ slightly from your hardware.
Cover-release
latch
402 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. If you are planning to view the error LEDs that are on the system board and
components, leave the server connected to power and go directly to step 4.
3. If you are planning to install or remove a microprocessor, memory module, PCI
adapter, battery, or other non-hot-swap optional device, turn off the server and
all attached devices and disconnect all external cables and power cords.
4. Press down on the left and right side latches and slide the server out of the
rack enclosure until both slide rails lock.
Note: You can reach the cables on the back of the server when the server is in
the locked position.
5. Push the cover-release latch back 1, then lift it up 2. Slide the cover back
3, and then lift off the server. Set the cover aside.
Attention: For proper cooling and airflow, replace the cover before you turn
on the server. Operating the server for extended periods of time (over 30
minutes) with the cover removed might damage server components.
6. If you are instructed to return the cover, follow all packaging instructions, and
use any packaging materials for shipping that are supplied to you.
1. Make sure that all internal cables are correctly routed (see “Internal cable
routing and connectors” on page 398.)
2. Place the cover-release latch in the open (up) position.
3. Insert the bottom tabs of the top cover into the matching slots in the server
chassis.
4. Press down on the cover-release latch to lock the cover in place.
5. Slide the server into the rack.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Remove the cover (see “Removing the cover” on page 402).
4. Remove PCI riser-card assembly 2 (see “Removing a PCI riser-card assembly”
on page 414).
5. Grasp the top of the air baffle and lift the air baffle out of the server.
Attention: For proper cooling and airflow, replace all air baffles, making sure
all cables are out of the way, before you turn on the server. Operating the
server with any air baffle removed might damage server components.
404 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
6. If you are instructed to return the microprocessor air baffle, follow all
packaging instructions, and use any packaging materials for shipping that are
supplied to you.
Procedure
1. Align the tab on the left side of the microprocessor 2 air baffle with the slot in
the right side of the power-supply cage.
2. Align the pin on the bottom of the microprocessor air baffle with the hole on
the system board retention bracket.
3. Lower the microprocessor 2 air baffle into the server, making sure all cables are
out of the way.
Attention: For proper cooling and airflow, replace all air baffles before you
turn on the server. Operating the server with any air baffle removed might
damage server components.
4. Install PCI riser-card assembly 2.
PCI riser-card
assembly 1
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Remove the cover (see “Removing the cover” on page 402).
4. If necessary, remove riser-card assembly 1 (see “Removing a PCI riser-card
assembly” on page 414).
5. Place your fingers under the front and back of the top of the air baffle; then, lift
the air baffle out of the server.
Attention: For proper cooling and airflow, replace all the air baffles before
you turn on the server. Operating the server with any air baffle removed might
damage server components.
406 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing the DIMM air baffle
Use this information to replace the DIMM air baffle.
PCI riser-card
assembly 1
Procedure
1. Align the DIMM air baffle with the DIMMs and the back of the fans.
2. Lower the air baffle into place, making sure all cables are out of the way.
3. Replace PCI riser-card assembly 1, if necessary.
4. Install the cover (see “Replacing the cover” on page 403).
5. Slide the server into the rack.
6. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
Results
Attention: For proper cooling and airflow, replace all air baffles before you turn
on the server. Operating the server with any air baffle removed might damage
server components.
Note: To remove or install a fan, it is not necessary to remove the fan bracket. See
“Removing a hot-swap fan” on page 457 and “Replacing a hot-swap fan” on page
458.
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Remove the cover (see “Removing the cover” on page 402).
4. Remove the fans (see “Removing a hot-swap fan” on page 457).
5. Remove the PCI riser-card assemblies and the DIMM air baffle (see “Removing
a PCI riser-card assembly” on page 414 and “Removing the DIMM air baffle”
on page 406).
6. Press the fan-bracket release latches toward each other and lift the fan bracket
out of the server.
408 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing the fan bracket
Use this information to replace the fan bracket.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Slide the server out of the rack.
4. Remove the cover (see “Removing the cover” on page 402).
5. Locate the virtual media key on the system board. Grasp it and carefully pull it
off the virtual media key connector pins.
410 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing an IBM virtual media key
Use this information to replace an IBM virtual media key.
Procedure
1. Align the virtual media key with the virtual media key connector pins on the
system board as shown in the illustration.
2. Insert the virtual media key onto the pins until it clicks into place.
3. Install the server cover (see “Replacing the cover” on page 403).
4. Slide the server into the rack.
5. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
To remove a USB hypervisor memory key from a SAS riser card, complete the
following steps:
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Slide the server out of the rack.
4. Remove the cover (see “Removing the cover” on page 402).
5. Locate the SAS riser-card assembly, which is near the left-front corner of the
server.
6. Push the blue locking collar on the USB hypervisor connector back toward the
SAS riser card to unlock it from the connector.
7. Pull the hypervisor memory key out of the USB hypervisor connector.
8. If you are instructed to return the hypervisor memory key, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
Note: You must configure the server not to look for the hypervisor USB drive. See
Chapter 6, “Configuration information and instructions,” on page 491 for
information about disabling hypervisor support.
412 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing a USB hypervisor memory key
Use this information to replace a USB hypervisor memory key.
To install a USB hypervisor memory key in the SAS riser card, complete the
following steps:
1. Locate the SAS riser-card assembly, which is near the left-front corner of the
server.
2. Push the blue locking collar on the USB hypervisor connector on the SAS riser
card toward the SAS riser card (the unlocked position).
3. Insert the hypervisor memory key into the dedicated USB connector.
4. Slide the blue locking collar on the USB hypervisor connector forward, toward
the hypervisor memory key as far as it will go, to secure the hypervisor
memory key in position.
5. Install the server cover (see “Replacing the cover” on page 403).
6. Slide the server into the rack.
7. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
Note: You will have to configure the server to boot from the hypervisor USB drive.
See Chapter 6, “Configuration information and instructions,” on page 491 for
information about enabling the hypervisor memory key.
The server comes with one riser-card assembly that contains two PCI Express x8
Gen 2 connectors. You can replace a PCI Express riser-card assembly with a
riser-card assembly that contains one PCI Express Gen 2 x16 connector or that
contains two PCI-X 64-bit 133 MHz connectors. See http://www.ibm.com/
systems/info/x86servers/serverproven/compat/us/ for a list of riser-card
assemblies that you can use with the server.
PCI riser-card
assembly 2
PCI riser-card
assembly 1
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Slide the server out of the rack.
4. Remove the server cover (see “Removing the cover” on page 402).
414 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
5. Grasp the riser-card assembly at the front tab and rear edge and lift it to
remove it from the server. Place the riser-card assembly on a flat,
static-protective surface.
PCI riser-card
assembly 1
PCI riser
connector 2
Alignment
slots
Alignment
brackets
PCI riser
connector 1
Procedure
1. Reinstall any adapters and reconnect any internal cables you might have
removed in other procedures (see “Internal cable routing and connectors” on
page 398.)
2. Align the PCI riser-card assembly with the selected PCI connector on the
system board:
v PCI connector 1: Carefully fit the two alignment slots on the side of the
assembly onto the two alignment brackets in the side of the chassis.
v PCI connector 2: Carefully align the bottom edge (the contact edge) of the
riser-card assembly with the riser-card connector on the system board.
3. Press down on the assembly. Make sure that the riser-card assembly is fully
seated in the riser-card connector on the system board.
4. Install the server cover (see “Replacing the cover” on page 403).
5. Slide the server into the rack.
6. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
416 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Removing a PCI adapter from a PCI riser-card assembly
Use this information to remove a PCI adapter from a PCI riser-card assembly.
This topic describes removing an adapter from a PCI expansion slot in a PCI
riser-card assembly. These instructions apply to PCI adapters such as video graphic
adapters and network adapters. To remove a ServeRAID SAS controller from the
SAS riser card, go to “Removing a ServeRAID SAS controller from the SAS riser
card” on page 430.
The following illustration shows the locations of the adapter expansion slots from
the rear of the server.
Note:
1. If a PCI Express Gen 2x16 adapter is installed in a PCI riser-card assembly, the
second expansion slot is not available.
2. If you are replacing a high power graphics adapter, you might need to
disconnect the internal power cable from the system board before removing the
adapter.
To remove an adapter from a PCI expansion slot, complete the following steps.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
Important: Some cluster solutions require specific code levels or coordinated code
updates. If the device is part of a cluster solution, verify that the latest level of
code is supported for the cluster solution before you update the code.
Some high end video adapters are supported by your server. See
http://www.ibm.com/systems/info/x86servers/serverproven/compat/us/for
more information.
Note:
1. If you are installing a video adapter in your server, do not set the maximum
digital video resolution above 1600 x 1200 at 60 Hz for an LCD monitor. This is
the highest resolution supported for any video adapter in this server.
2. Any high-definition video-out connector or stereo connector on the video
adapter is not supported.
3. If you are installing x3650 M3 PCI Express Gen 2 x8 riser card (part number
59Y3818), you cannot install any adaptor on slot 1.
The following illustration shows the locations of the adapter expansion slots from
the rear of the server.
418 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 46. PCI adapter expansion slots
Adapter Expansion-slot
cover
PCI
riser-card
assembly
b. Align the adapter with the PCI connector on the riser card and the guide on
the external end of the riser-card assembly.
c. Press the adapter firmly into the PCI connector on the riser card.
2. Connect any required cables to the adapter (see “Internal cable routing and
connectors” on page 398.)
Attention:
v When you route cables, do not block any connectors or the ventilated space
around any of the fans.
v Make sure that cables are not routed on top of components under the PCI
riser-card assembly.
v Make sure that cables are not pinched by the server components.
3. Align the PCI riser-card assembly with the selected PCI connector on the
system board:
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Remove the cover (see “Removing the cover” on page 402).
4. Remove the PCI riser-card assembly 1 (see “Removing a PCI riser-card
assembly” on page 414).
5. Grasp the Ethernet adapter and disengage it from the standoffs and the
connector on the system board; then, slide the Ethernet adapter out of the port
openings on the rear of the chassis and remove it from the server.
6. If you are instructed to return the Ethernet adapter, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
420 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing the optional two-port Ethernet adapter
Use this information to replace the optional two-port Ethernet adapter.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Remove the cover (see “Removing the cover” on page 402).
3. Attach the rubber stopper on the chassis, along the edge of the system board,
as shown in the following illustration.
Rubber
stopper
Rubber
stopper
Ethernet
adapter connector
4. Remove the adapter filler panel on the rear of the chassis (if it has not been
removed already).
7. While you slightly press the top of the metal clip, rotate the metal clip toward
the front of the server until the metal clip clicks into place. Make sure the
metal clip is securely engaged on the chassis.
Attention: Pressing the top of the metal clip with excessive force may cause
damage to the metal clip.
8. Touch the static-protective package that contains the new adapter to any
unpainted metal surface on the server. Then, remove the adapter from the
package.
9. Align the adapter with the adapter connector on the system board; then, tilt
the adapter so that the port connectors on the adapter line up with the port
openings on the chassis.
10. Slide the port connectors on the adapter into the port openings on the chassis;
then, press the adapter firmly until the two standoffs engage the adapter.
Make sure the adapter is securely seated on the connector on the system
board.
Make sure the port connectors on the adapter do not set on the rubber
stopper. The following illustration shows the side view of the adapter in the
422 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
server.
Attention: Make sure the port connectors on the adapter are aligned
properly with the chassis on the rear of the server. An incorrectly seated
adapter might cause damage to the system board or the adapter.
11. Install PCI riser 1 (see “Replacing a PCI riser-card assembly” on page 415).
12. Install the cover (see “Replacing the cover” on page 403).
13. Slide the server into the rack.
14. Reconnect the power cord and any cables that you removed.
15. Turn on the peripheral devices and the server.
If you are removing a full-length adapter in the upper riser-card PCI slot and will
replace it with a shorter adapter or no adapter, you must remove the
full-length-adapter bracket from the end of the riser-card assembly and return the
bracket to its storage location.
To remove and store the full-length-adapter bracket, complete the following steps:
1. Press the bracket tab 3 and slide the bracket to the left until the bracket falls
free of the riser-card assembly.
2. Align the bracket with the storage location on the riser-card assembly as
shown.
3. Place the two hooks 1 in the two openings 2 in the storage location on the
riser-card assembly.
4. Press the bracket tab 3 and slide the bracket toward the expansion-lot-
opening end of the assembly until the bracket clicks into place.
To remove the SAS riser-card and controller assembly from the server, complete the
steps for the applicable server model.
v 16-drive-capable server model
424 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 56. SAS riser-card and controller assembly removal
1. Press the release tab toward the rear of the server and lift the back end of the
SAS controller card slightly.
2. Place your fingers underneath the upper portion of the SAS riser card and
lift the assembly from the system board.
3. Slide the front end of the SAS controller card out of the retention bracket and
lift the assembly out of the server.
v Tape-enabled server model
1. Press down on the assembly release latch and lift up on the tab to release the
SAS controller assembly, which includes the SAS riser card, from the system
board.
2. Lift the front and back edges of the assembly to remove the assembly from
the server.
To install the SAS riser-card and controller assembly in the server, complete the
steps for the applicable server model.
v 16-drive-capable server model
426 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 59. Controller retention brackets
a. Remove the SAS controller front retention bracket from the server.
b. Remove the rear controller retention bracket located in the battery bay
above the power supplies by pulling up the release tab 1 and sliding
the bracket outward 2.
c. Install the controller retention bracket from step b by aligning the
retention bracket controller slot and then placing the bracket tabs in the
holes on the chassis and slide the bracket to left until it clicks into place.
d. Install the controller retention bracket from step a by sliding the
bracket inward 1 and pressing down the release tab into place 2.
428 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
2. Place the front end of the SAS controller in the retention bracket and align
the SAS riser card with the SAS riser-card connector on the system board.
3. Press down on the SAS riser card and the rear edge of the SAS controller
until the SAS riser card is firmly seated and the SAS controller card retention
latch clicks into place. One or two pins (depending on the size of the card)
clicks into the corner holes of the SAS controller card when the controller
card is correctly seated.
v Tape-enabled server model
1. Align the pins on the backside of the riser with the slots on the side of the
chassis.
2. Align the SAS riser card of the SAS controller assembly with the SAS
riser-card connector on the system board.
3. Press the SAS controller assembly into place; make sure that the SAS riser
card is firmly seated and that the release latch and retention latch holds the
assembly securely.
Note: For brevity, in this documentation the ServeRAID SAS controller is often
referred to simply as the SAS controller.
A ServeRAID SAS controller is installed in a dedicated slot on the SAS riser card.
To remove a ServeRAID SAS controller from a SAS riser card, complete the
following steps:
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Remove the cover (see “Removing the cover” on page 402).
4. Locate the SAS riser-card and controller assembly near the left-front corner of
the server.
5. Disconnect the SAS signal cables from the connectors on the SAS controller
(see “Internal cable routing and connectors” on page 398).
6. Remove the SAS controller assembly, which includes the SAS riser card, from
the server (see “Removing the SAS riser-card and controller assembly” on
page 424).
7. If the server is a tape-enabled-model, press the tab on the SAS-controller
retention bracket away from the SAS riser card and lift the right edge of the
SAS controller card out of the bracket.
8. Pull the SAS controller horizontally out of the connector on the SAS riser card.
Note: If you have installed the optional ServeRAID adapter advanced feature
key, remove it and keep it in future use (see “Removing an optional
ServeRAID adapter advanced feature key” on page 433).
430 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
9. If you are replacing the SAS controller card, then remove the battery but keep
the cables connected.
10. If you are instructed to return the ServeRAID SAS controller, follow all
packaging instructions, and use any packaging materials for shipping that are
supplied to you.
To ensure that a RAID adapter works correctly in your server, make sure that the
adapter firmware is at the latest level.
Important: Some cluster solutions require specific code levels or coordinated code
updates. If the device is part of a cluster solution, verify that the latest level of
code is supported for the cluster solution before you update the code.
To install a SAS controller on the SAS riser card, complete the following steps.
Note: If you have the optional ServeRAID adapter advanced feature key,
install it first (see “Replacing an optional ServeRAID adapter advanced feature
key” on page 434).
2. If you are replacing a SAS controller that uses a battery, you can continue to
use that battery with your new SAS controller.
3. If the new SAS controller is a different physical size than the SAS controller
that you removed, you might have to move the controller retention bracket
(tape-enabled model servers only) to the correct location for the new SAS
controller.
4. Turn the SAS controller so that the keys on the bottom edge align correctly
with the connector on the SAS riser card in the SAS controller assembly.
5. Firmly press the SAS controller horizontally into the connector on the SAS
riser card.
6. (Tape-enabled model server only) Gently press the opposite edge of the SAS
controller into the controller retention bracket.
7. Install the SAS riser-card and controller assembly (see “Replacing the SAS
riser-card and controller assembly” on page 426).
8. Install the server cover (see “Replacing the cover” on page 403).
9. Slide the server into the rack.
10. Reconnect the external cables; then, reconnect the power cords and turn on
the peripheral devices and the server.
Results
Note:
1. When you restart the server for the first time after you install a SAS controller
with a battery, the monitor screen remains blank while the controller initializes
the battery. This might take a few minutes, after which the startup process
continues. This is a one-time occurrence.
Important: You must allow the initialization process to be completed. If you do
not, the battery pack will not work, and the server might not start.
The battery comes partially charged, at 30% or less of capacity. Run the server
for 4 to 6 hours to fully charge the controller battery. The LED just above the
battery on the controller remains lit until the battery is fully charged.
Until the battery is fully charged, the controller firmware sets the controller
cache to write-through mode; after the battery is fully charged, the controller
firmware re-enables write-back mode.
2. When you restart the server, you will be given the opportunity to import the
existing RAID configuration to the new ServeRAID SAS controller.
432 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Removing an optional ServeRAID adapter advanced feature
key
About this task
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect the power cords.
3. Remove the cover (see “Removing the cover” on page 402).
4. Grasp the feature key and lift to remove it from connector on the ServeRAID
adapter.
5. If you are instructed to return the feature key, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect the power cords.
3. Remove the cover (see “Removing the cover” on page 402).
4. Align the feature key with the connector on the ServeRAID adapter and push it
into the connector until it is firmly seated.
434 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 70. ServeRAID M1000 advanced eature key installation
5. Reconnect the power cord and any cables that you removed.
6. Install the cover “Replacing the cover” on page 403.
7. Slide the server into the rack.
8. Turn on the peripheral devices and the server.
To remove a ServeRAID SAS controller battery from the remote battery tray,
complete the following steps:
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Remove the cover (see “Removing the cover” on page 402).
4. Locate the remote battery tray in the server and remove the battery that you
want to replace:
a. Remove the battery retention clip from the tabs that secure the battery to
the remote battery tray.
436 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 72. Battery retention clip removal
b. Lift the battery and battery carrier from the tray and carefully disconnect
the remote battery cable from the interposer card on the ServeRAID
controller.
Note: If your battery and battery carrier are attached with screws instead of
a locking-clip mechanism, remove the three screws to remove the battery
from the battery carrier.
To install a ServeRAID SAS controller battery on the remote battery tray, complete
the following steps:
Procedure
1. Install the replacement battery on the remote battery tray:
a. Place the replacement battery on the battery carrier from which the former
battery had been removed, and connect the battery carrier cable to the
replacement battery.
b. Connect the remote battery cable to the interposer card.
Attention: To avoid damage to the hardware, make sure that you align the
black dot on the cable connector with the black dot on the connector on the
interposer card. Do not force the remote battery cable into the connector.
438 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 74. Connecting remote battery cable
c. On the remote battery tray, find the pattern of recessed rings that matches
the posts on the battery and battery carrier.
d. Press the posts into the rings and underneath the tabs on the remote battery
tray.
e. Secure the battery to the tray with the battery retention clip.
2. Install the cover “Replacing the cover” on page 403
Latch
Handle
Attention: To maintain proper system cooling, do not operate the server for more
than 10 minutes without either a drive or a filler panel installed in each bay.
To remove a hard disk drive from a hot-swap bay, complete the following steps.
Procedure
1. Read the safety information that begins on page “Safety” on page vii,
“Handling static-sensitive devices” on page 398, and “Installation guidelines”
on page 395.
2. Press up on the release latch at the top of the drive front.
3. Rotate the handle on the drive downward to the open position.
4. Pull the hot-swap drive assembly out of the bay approximately 25 mm (1 inch).
Wait approximately 45 seconds while the drive spins down before you remove
the drive assembly completely from the bay.
5. If you are instructed to return the hot-swap drive, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
Locate the documentation that comes with the hard disk drive and follow those
instructions in addition to the instructions in this section.
For information about the type of hard disk drive that the server supports and
other information that you must consider when installing a hard disk drive, see the
Installation and User’s Guide on the IBM Documentation CD.
440 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Latch
Handle
Attention: To maintain proper system cooling, do not operate the server for more
than 10 minutes without either a drive or a filler panel installed in each bay.
Procedure
1. Orient the drive as shown in the illustration.
2. Make sure that the tray handle is open.
3. Align the drive assembly with the guide rails in the bay.
4. Gently push the drive assembly into the bay until the drive stops.
5. Push the tray handle to the closed (locked) position.
6. If the system is turned on, check the hard disk drive status LED to verify that
the hard disk drive is operating correctly.
Results
After you replace a failed hard disk drive, the green activity LED flashes as the
disk spins up. The amber LED turns off after approximately 1 minute. If the new
drive starts to rebuild, the amber LED flashes slowly, and the green activity LED
remains lit during the rebuild process. If the amber LED remains lit, see “Hard
disk drive problems” on page 342.
Note: You might have to reconfigure the disk arrays after you install hard disk
drives. See the RAID documentation on the IBM ServeRAID Support CD for
information about RAID controllers.
Attention: To maintain proper system cooling, do not operate the server for more
than 10 minutes without either a drive or a filler panel installed in each bay.
To remove a hard disk drive from a simple-swap bay, complete the following steps.
Procedure
1. Read the safety information that begins on page “Safety” on page vii,
“Handling static-sensitive devices” on page 398, and “Installation guidelines”
on page 395
2. Turn off the server and peripheral devices and disconnect the power cords and
all external cables.
Note: When you disconnect the power source from the server, you lose the
ability to view the LEDs because the LEDs are not lit when the power source is
removed. Before you disconnect the power source, make a note of which LEDs
are lit, including the LEDs that are lit on the operation information panel, on
the light path diagnostics panel, and LEDs inside the server on the system
board; then, see Chapter 3, “Diagnostics,” on page 27 for information about
how to solve the problem.
3. Press up on the release latch at the top of the drive front.
4. Rotate the handle on the drive downward to the open position.
5. If you are replacing the 2.5 inch simple-swap hard disk drive backplane,
remove it now.
6. If you are instructed to return the simple-swap drive, follow all packaging
instructions, and use any packaging materials for shipping that are supplied to
you.
442 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing a simple-swap hard disk drive
Use this information to replace a simple-swap hard disk drive .
Locate the documentation that comes with the hard disk drive and follow those
instructions in addition to the instructions in this section.
Simple-swap models do not support the SAS hot-swap backplane or the SAS riser
card.
For information about the type of hard disk drive that the server supports and
other information that you must consider when installing a hard disk drive, see the
Installation and User’s Guide on the IBM Documentation CD.
Attention: To maintain proper system cooling, do not operate the server for more
than 10 minutes without either a drive or a filler panel installed in each bay.
Procedure
1. Install the 2.5 inch simple-swap hard disk drive backplane.
2. Remove the drive filler panel from the front of the server.
3. Orient the drive as shown in the illustration.
4. Make sure that the tray handle is open.
5. Align the drive assembly with the guide rails in the bay.
6. Gently push the drive assembly into the bay until the drive stops.
7. Push the tray handle to the closed (locked) position.
8. If the system is turned on, check the hard disk drive status LED to verify that
the hard disk drive is operating correctly.
After you replace a failed hard disk drive, the green activity LED flashes as the
disk spins up. The amber LED turns off after approximately 1 minute. If the new
drive starts to rebuild, the amber LED flashes slowly, and the green activity LED
remains lit during the rebuild process. If the amber LED remains lit, see “Hard
disk drive problems” on page 342.
Note: You might have to reconfigure the disk arrays after you install hard disk
drives. See the RAID documentation on the IBM ServeRAID Support CD for
information about RAID controllers.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Slide the server out of the rack; then, remove the cover (see “Removing the
cover” on page 402).
4. Press the release tab down to release the drive; then, while you press the tab,
push the drive toward the front of the server.
5. From the front of the server, pull the drive out of the bay.
444 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Release tab
CD/DVD drive
Alignment pins
Alignment pins
The following illustration shows how to remove an optional tape drive from the
server.
To remove a tape drive from the server, complete the following steps:
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices, and disconnect the power cords and
all external cables.
3. Slide the server out of the rack; then, remove the cover (see “Removing the
cover” on page 402).
4. Open the tape drive tray release latch and slide the drive tray out of the bay
approximately 25 mm (1 inch).
5. Disconnect the power and signal cables from the rear of the tape drive.
6. Pull the drive completely out of the bay.
7. Remove the tape drive from the drive tray by removing the four screws on the
sides of the tray.
446 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 85. Tape drive removal
8. If you are not installing another drive in the bay, insert the tape drive filler
panel into the empty tape drive bay.
9. If you are instructed to return the drive, follow all packaging instructions, and
use any packaging materials for shipping that are supplied to you.
3. Prepare the drive according to the instructions that come with the drive,
setting any switches or jumpers.
4. Slide the tape-drive assembly most of the way into the tape-drive bay.
5. Using the cables from the former tape drive, connect the signal and power
cables to the back of the tape drive.
6. Make sure all the cables are out of the way, and slide the tape-drive assembly
the rest of the way into the tape-drive bay.
7. Push the tray handle to the closed (locked) position.
8. Install the cover (see “Replacing the cover” on page 403).
9. Slide the server into the rack.
10. Reconnect the external cables; then, reconnect the power cords and turn on
the peripheral devices and the server.
448 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 88. DIMM removal
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices and disconnect all power cords and
external cables.
3. Slide the server out of the rack.
4. Remove the cover (see “Removing the cover” on page 402).
5. If riser-card assembly 1 contains one or more adapters, remove it (see
“Removing a PCI riser-card assembly” on page 414).
6. Remove the air baffle over the DIMMs (see “Removing the DIMM air baffle”
on page 406
Attention: To avoid breaking the retaining clips or damaging the DIMM
connectors, open and close the clips gently.
7. Open the retaining clip on each end of the DIMM connector and lift the DIMM
from the connector.
8. If you are instructed to return the DIMM, follow all packaging instructions, and
use any packaging materials for shipping that are supplied to you.
450 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
2 = dual-rank
4 = quad-rank
- ff is the device organization (bit width)
4 = x4 organization (4 DQ lines per SDRAM)
8 = x8 organization
16 = x16 organization
- wwwww is the DIMM bandwidth, in MBps
6400 = 6.40 GBps (PC3-800 SDRAMs, 8-byte primary data bus)
8500 = 8.53 GBps (PC3-1066 SDRAMs, 8-byte primary data bus)
10600 = 10.66 GBps (PC3-1333 SDRAMs, 8-byte primary data bus)
12800 = 12.80 GBps PC3-1600 SDRAMs, 8-byte primary data bus)
- m is the DIMM type
E = Unbuffered DIMM (UDIMM) with ECC (x72-bit module data bus)
R = Registered DIMM (RDIMM)
U = Unbuffered DIMM with no ECC (x64-bit primary data bus)
- aa is the CAS latency, in clocks at maximum operating frequency
- bb is the JEDEC SPD Revision Encoding and Additions level
- cc is the reference design file for the design of the DIMM
- d is the revision number of the reference design of the DIMM
v The following rules apply to DDR3 DIMM speed as it relates to the number of
DIMMs in a channel:
– When you install 1 DIMM per channel, the memory runs at 1333 MHz
– When you install 2 DIMMs per channel, the memory runs at 1066 MHz
– When you install 3 DIMMs per channel, the memory runs at 800 MHz
– All channels in a server run at the fastest common frequency.
– Do not install registered and unbuffered DIMMs in the same server.
v The maximum memory speed is determined by the combination of the
microprocessor, DIMM speed, and the number of DIMMs installed in each
channel.
v In two-DIMM-per-channel configuration, a server with an Intel Xeon X5600
series microprocessor automatically operates with a maximum memory speed of
up to 1333 MHz when one of the following conditions is met:
– Two 1.5 V single-rank or dual-rank RDIMMs are installed in the same
channel. In the Setup utility, Memory speed is set to Max performance mode
– Two 1.35 V single-rank or dual-ranl RDIMMs are installed in the same
channel. In the Setup utility, Memory speed is set to Max performance and
LV-DIMM power is set to Enhance performance mode. The 1.35 V RDIMMs
will function at 1.5 V
v The server supports a maximum of 18 single-rank or dual-rank RDIMMs. The
server supports up to 12 single-rank or dual-rank UDIMMs or quad-rank
RDIMMs.
Note: To determine the type of a DIMM, see the label on the DIMM. The
information on the label is in the format xxxxx nRxxx PC3-xxxxx-xx-xx-xxx. The
numeral in the sixth numerical position indicates whether the DIMM is
single-rank (n=1), dual-rank (n=2), or quad-rank (n=4).
v The RDIMM options that are available for the server are 2 GB, 4 GB, 8 GB, and
16 GB. The server supports a minimum of 2 GB and a maximum of 288 GB of
system memory using RDIMMs.
For 32-bit operating systems only: Some memory is reserved for various system
resources and is unavailable to the operating system. The amount of memory
that is reserved for system resources depends on the operating system, the
configuration of the server, and the configured PCI devices.
v The UDIMM options that are available for the server are 2 GB and 4 GB. The
server supports a minimum of 2 GB and a maximum of 48 GB of system
memory using UDIMMs.
452 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
DIMM installation sequence
The server comes with a minimum of one 2 GB DIMM installed in slot 3. When
you install additional DIMMs, install them in the order shown in the following
table to optimize system performance. In non-mirroring mode, all three channels
on the memory interface for each microprocessor can be populated in any order
and have no matching requirements.When you install additional DIMMs, install
them in the order shown in Table 17, to maintain performance.
Important: If you have configured the server to use memory mirroring, do not use
the order in Table 17; go to “Memory mirroring” and use the installation order
shown there.
Table 17. DIMM installation sequence for non-mirroring (normal) mode
Installed microprocessors DIMM connector population sequence
Microprocessor socket 1 Install the DIMMs in the following sequence: 3, 6, 9, 2, 5, 8,
1, 4, 7
Microprocessor socket 2 Install the DIMMs in the following sequence: 12, 15, 18, 11,
14, 17, 10, 13, 16
Memory mirroring
Memory-mirroring mode replicates and stores data on two pairs of DIMMs within
two channels simultaneously. If a failure occurs, the memory controller switches
from the primary pair of memory DIMMs to the backup pair of DIMMs. To enable
memory mirroring through the Setup utility, select System Settings → Memory. For
details about enabling memory mirroring, see “Using the Setup utility” on page
493. When you use the memory mirroring feature, consider the following
information:
v When you use memory mirroring, you must install a pair of DIMMs at a time.
One DIMM must be in channel 0, and the mirroring DIMM must be in the same
connector in channel 1. The two DIMMs in each pair must be identical in size,
type, rank (single, dual, or quad), and organization. They do not have to be
identical in speed. The channels run at the speed of the slowest DIMM in any of
the channels. See Table 19 on page 455 for the DIMM connectors that are in each
pair.
v Channel 2, DIMM connectors 7, 8, 9, 16, 17, and 18 are not used in
memory-mirroring mode.
v The maximum available memory is reduced to half of the installed memory
when memory mirroring is enabled. For example, if you install 64 GB of
memory using RDIMMs, only 32 GB of addressable memory is available when
you use memory mirroring.
The following diagram shows the memory channel interface layout with the
DIMM installation sequence for mirroring mode. The numbers within the boxes
indicate the DIMM population sequence in pairs within the channels, and the
numbers next to the boxes indicate the DIMM connectors within the channels. For
example, the following illustration shows the first pair of DIMMs (indicated by
ones (1) inside the boxes) should be installed in DIMM connectors 1 on channel 0
and DIMM connector 2 on channel 1. DIMM connectors 3, 6, 9, 12, 15, and 18 on
channel 2 are not used in memory-mirroring mode.
3 2 1 10 11 12
CPU1 CPU2
QPI
1 2 3 CH1 CH1 6 5 4
6 5 4 13 14 15
CH2 CH2
9 8 7 16 17 18
The following table lists the DIMM connectors on each memory channel.
Table 18. Connectors on each memory channel
Memory channel DIMM connectors
Channel 0 1, 2, 3, 10, 11, 12
Channel 1 4, 5, 6, 13, 14, 15
Channel 2 7, 8, 9, 16, 17, 18
The following illustration shows the memory connector layout that is associated
with each microprocessor. For example, DIMM connectors 10, 11, 12, 13, 14, 15, 16,
17, and 18 (DIMM connectors are shown underneath the boxes) are associated with
microprocessor 2 slot (CPU2) and DIMM connectors 1, 2, 3, 4, 5, 6, 7, 8, and 9 are
associated with microprocessor 1 slot (CPU1). The numbers within the boxes
indicates the installation sequence of the DIMM pairs. For example, the first DIMM
pair (indicated within the boxes by ones (1)) should be installed in DIMM
connectors 1 and 2, which is associated with microprocessor 1 (CPU1).
Note: You can install DIMMs for microprocessor 2 as soon as you install
microprocessor 2; you do not have to wait until all of the DIMM connectors for
microprocessor 1 are filled.
CPU2 6 5 4 6 5 4
10 11 12 13 14 15 16 17 18
1 2 3 1 2 3 CPU1
9 8 7 6 5 4 3 2 1
The following table lists the installation sequence for installing DIMMs in
memory-mirroring mode.
454 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Table 19. Memory-mirroring mode DIMM population sequence
Number of installed
DIMMs microprocessors DIMM connector
First pair of DIMMs 1 3, 6
Second pair of DIMMs 1 2, 5
Third pair of DIMMs 1 1, 4
Fourth pair of DIMMs 2 12, 15
Fifth pair of DIMMs 2 11, 14
Sixth pair of DIMMs 2 10, 13
Note: DIMM connectors 7, 8, 9, 16, 17, and 18 are not used in memory-mirroring mode.
When you install or remove DIMMs, the server configuration information changes.
When you restart the server, the system displays a message that indicates that the
memory configuration has changed.
Online-spare memory
The server supports online-spare memory. This feature disables the failed memory
from the system configuration and activates an online-spare DIMM to replace the
failed active DIMM. You can enable either online-spare memory or memory
mirroring in the Setup utility (see “Using the Setup utility” on page 493). When
you use the memory online-spare feature, consider the following information:
v The memory online-spare feature is supported on server models with an Intel
Xeon™ 5600 series microprocessor.
v When you enable the memory online-spare feature, you must install three
DIMMs per microprocessor at a time. The first DIMM must be in channel 0, the
second DIMM in channel 1, and the online-spare DIMM in channel 2. The
DIMMs must be identical in size, type, rank, and organization, but not in speed.
The channels run at the speed of the slowest DIMM in any of the channels.
v The maximum available memory is reduced to 2/3 of the installed memory
when memory online-spare mode is enabled. For example, if you install 72 GB
of memory using RDIMMs, only 48 GB of addressable memory is available
when you use memory online-spare.
The following table shows the installation sequence for installing DIMMs for each
microprocessor and the online-spare DIMM in memory online-spare mode:
Table 20. Memory online-spare mode DIMM population sequence
Installed
Microprocessor DIMM connector
Microprocessor 1 3, 6, 9
3, 6, 9, 2, 5, 8
3, 6, 9, 2, 5, 8, 1, 4, 7
Microprocessor 2 12, 15, 18,
12, 15, 18, 11, 14, 17,
12, 15, 18, 11, 14, 17, 10, 13, 16
456 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
12. Reconnect the external cables; then, reconnect the power cords and turn on
the peripheral devices and the server.
13. Go to the Setup utility and make sure all the installed DIMMs are present and
enabled.
To remove any of the three replaceable fans, complete the following steps.
Vertical tabs
Fan 3
Fan 2 Fan 1
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Leave the server connected to power.
3. Slide the server out of the rack and remove the cover (see “Removing the
cover” on page 402). The LED on the system board near the connector for the
failing fan will be lit.
Attention: To ensure proper system cooling, do not remove the top cover for
more than 30 minutes during this procedure.
4. Grasp the fan by the finger grips on the sides of the fan.
5. Lift the fan out of the server.
6. Replace the fan within 30 seconds.
7. If you are instructed to return the fan, follow all packaging instructions, and
use any packaging materials for shipping that are supplied to you.
For proper cooling, the server requires that all three fans be installed at all times.
See “System-board internal connectors” on page 17 for the locations of the fan
connectors.
To install any of the three replaceable fans, complete the following steps.
Vertical tabs
Fan 3
Fan 2 Fan 1
Procedure
1. Orient the new fan over its position in the fan bracket so that the connector on
the bottom aligns with the fan connector on the system board.
2. Align the vertical tabs on the fan with the slots on the fan cage bracket.
3. Push the new fan into the fan connector on the system board. Press down on
the top surface of the fan to seat the fan fully. (Make sure that the LED has
turned off.)
4. Repeat steps 1 through 3 until all the new or replacement fans are installed.
5. Install the cover (see “Replacing the cover” on page 403).
6. Slide the server into the rack.
458 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Removing a hot-swap fan
The server comes with three replaceable fans.
Attention: To ensure proper server operation and cooling, if you remove a fan
with the system running, you must install a replacement fan within 30 seconds or
the system will shut down.
Vertical tabs
Fan 3
Fan 2 Fan 1
Procedure
1. Read the safety information that begins on page “Safety” on page vii,
“Installation guidelines” on page 395.
2. Leave the server connected to power.
3. Slide the server out of the rack and remove the cover (see Removing the cover).
The LED near the failing fan will be lit.
Attention: To ensure proper system cooling, do not remove the top cover for
more than 30 minutes during this procedure.
4. Lift the fan out of the server.
5. Replace the fan within 30 seconds (see Installing a hot-swap fan).
Results
The following notes describe the type of ac power supply that the server supports
and other information that you must consider when you install a power supply:
v Make sure that the devices that you are installing are supported. For a list of
supported optional devices for the server, see http://www.ibm.com/systems/
info/x86servers/serverproven/compat/us/.
v Before you install an additional power supply or replace a power supply with
one of a different wattage, you may use the IBM Power Configurator utility to
determine current system power consumption. For more information and to
download the utility, go to http://www.ibm.com/systems/bladecenter/
resources/powerconfig.html.
v The server comes with one hot-swap 12-volt output power supply that connects
to power supply bay 1. The input voltage is 110 V ac or 220 V ac auto-sensing.
v You cannot mix 460-watt and 675-watt power supplies, high-efficiency and
non-high-efficiency power supplies, or ac and dc power supplies in the server.
v The following information applies when you install 460-watt power supplies in
the server:
– A warning message is generated if the total power consumption exceeds 400
watts and the server only has one operational 460-watt power supply. In this
case, the server can still operate under normal condition. Before you install
additional components in the server, you must install an additional power
supply
– The server automatically shuts down if the total power consumption exceeds
the total power supply output capacity
– You can enable the power capping feature in the Setup utility to control and
monitor power consumption in the server (see “Setup utility menu choices”
on page 494)
The following table lists the system status when you install 460-watt power
supplies in the server:
Table 21. System status with 460-watt power supplies installed
Total system power
Number of 460-watt power supplies installed
consumption (in
watts) One Two Two with one failure
< 400 Normal Normal, redundant Normal
power
400 ~ 460 Normal, status Normal, redundant Normal, status
warning power warning
> 460 System shutdown Normal System shutdown
460 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v For instructions on how to install a hot-swap dc power supply, refer to the
documentation that comes with the dc power supply.
Statement 5
CAUTION:
The power control button on the device and the power switch on the power
supply do not turn off the electrical current supplied to the device. The device
also might have more than one power cord. To remove all electrical current from
the device, ensure that all power cords are disconnected from the power source.
2
1
Statement 8
CAUTION:
Never remove the cover on a power supply or any part that has the following
label attached.
Hazardous voltage, current, and energy levels are present inside any component
that has this label attached. There are no serviceable parts inside these
components. If you suspect a problem with one of these parts, contact a service
technician.
Note: The procedure below describes how to install a hot-swap ac power supply,
for instructions on how to install a hot-swap dc power supply, refer to the
documentation that comes with the dc power supply.
Attention: During normal operation, each power-supply bay must contain either
a power supply or power-supply filler for proper cooling.
Procedure
1. Slide the power supply into the bay until the retention latch clicks into place.
Attention: Do not install 460-watt and 675-watt power supplies,
high-efficiency and non-high-efficiency power supplies, or ac and dc power
supplies in the server.
2. Connect the power cord for the new power supply to the power-cord connector
on the power supply.
The following illustration shows the power-cord connectors on the back of the
server.
3. Route the power cord through the power-supply handle and through any cable
clamps on the rear of the server, to prevent the power cord from being
accidentally pulled out when you slide the server in and out of the rack.
4. Connect the power cord to a properly grounded electrical outlet.
5. Make sure that the error LED on the power supply is not lit, and that the dc
power LED and ac power LED on the power supply are lit, indicating that the
power supply is operating correctly.
6. If you are replacing a power supply with one of a different wattage, apply the
power information label provided with the new power supply over the existing
462 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
power information label on the server.
x.x/x.x
xx/xx HZ
Statement 2
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has a
module containing a lithium battery, replace it only with the same module type
made by the same manufacturer. The battery contains lithium and can explode if
not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water
v Heat to more than 100°C (212°F)
v Repair or disassemble
Attention: Neither tilt nor push the battery by using excessive force.
c. Use your thumb and index finger to lift the battery from the socket.
464 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Attention: Do not lift the battery by using excessive force. Failing to remove
the battery properly may damage the socket on the system board. Any damage
to the socket may require replacing the system board.
9. Dispose of the battery as required by local ordinances or regulations. See the
IBM Environmental Notices and User's Guide on the IBM Documentation CD for
more information.
The following notes describe information that you must consider when you replace
the battery in the server.
v You must replace the battery with a lithium battery of the same type from the
same manufacturer.
v After you replace the battery, you must reconfigure the server and reset the
system date and time.
v To avoid possible danger, read and follow the following safety statement.
v To order replacement batteries, call 1-800-IBM-SERV within the United States,
and 1-800-465-7999 or 1-800-465-6666 within Canada. Outside the U.S. and
Canada, call your support center or business partner.
Statement 2
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has a
module containing a lithium battery, replace it only with the same module type
made by the same manufacturer. The battery contains lithium and can explode if
not properly used, handled, or disposed of.
Do not:
v Throw or immerse into water
v Heat to more than 100°C (212°F)
v Repair or disassemble
See the IBM Environmental Notices and User's Guide on the IBM Documentation CD
for more information.
Procedure
1. Follow any special handling and installation instructions that come with the
replacement battery.
2. Insert the new battery:
b. Press the battery down into the socket until it clicks into place. Make sure
that the battery clip holds the battery securely.
c. If you removed a rubber cover from the battery holder, use your fingers to
install the battery cover on top of the battery connector.
3. Reinstall any adapters that you removed.
4. Reconnect the internal cables that you disconnected (see “Internal cable routing
and connectors” on page 398).
5. Install the cover (see “Replacing the cover” on page 403).
6. Slide the server into the rack.
7. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
Note: You must wait approximately 2.5 minutes after you connect the power
cord of the server to an electrical outlet before the power-control button
becomes active.
8. Start the Setup utility and reset the configuration.
v Set the system date and time.
v Set the power-on password.
v Reconfigure the server.
See Chapter 6, “Configuration information and instructions,” on page 491
To remove the operator information panel assembly, complete the following steps.
466 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 102. Operator information panel removal
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Remove the cover (see “Removing the cover” on page 402).
3. Disconnect the cable from the back of the operator information panel assembly.
4. Reach inside the server and press the release tab; then, while you hold the
release tab down, push the assembly toward the front of the server.
5. From the front of the server, carefully pull the operator information panel
assembly out of the server.
6. If you are instructed to return the operator information panel assembly, follow
all packaging instructions, and use any packaging materials for shipping that
are supplied to you.
The illustrations in this document might differ slightly from your hardware.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Remove all the cables that are connected to the front of the server.
3. Remove the screws from the bezel.
4. Rotate the top of the bezel away from the server.
468 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing the bezel
Use this information to replace the bezel.
Procedure
1. Insert the tabs on the bottom of the bezel into the slots on the underside of the
chassis and attach it with the screws.
2. Connect any cables you previously removed from the front of the server.
To remove the SAS hard disk drive backplane, complete the following steps.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices, and disconnect the power cord and
all external cables.
3. Slide the server out of the rack.
4. Remove the cover (see “Removing the cover” on page 402).
5. Pull the hard disk drives or fillers out of the server slightly to disengage them
from the backplane. See “Removing a hot-swap hard disk drive” on page 440
for details.
6. To obtain more working room, remove the fans (see “Removing a hot-swap
fan” on page 457).
7. Lift the backplane out of the server by pulling it toward the rear of the server
and then lifting it up.
8. Disconnect the backplane power, signal, and configuration cables (see “Internal
cable routing and connectors” on page 398).
9. If you are instructed to return the backplane, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
470 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing the SAS hard disk drive backplane
Use this information to replace the SAS hard disk drive backplane.
To install the replacement SAS hard disk drive backplane, complete the following
steps.
Procedure
1. Connect the power and signal cables to the replacement backplane (see
“Internal cable routing and connectors” on page 398).
2. Align the backplane with the backplane slot in the chassis and the small slots
on top of the hard disk drive cage.
3. Lower the backplane into the slots on the chassis.
4. Rotate the top of the backplane until the front tab clicks into place into the
latches on the chassis.
5. Insert the hard disk drives and the fillers the rest of the way into the bays.
6. Replace the fan bracket and fans if you removed them (see “Replacing the fan
bracket” on page 409 and “Replacing a hot-swap fan” on page 458).
7. Install the cover (see “Replacing the cover” on page 403).
8. Slide the server into the rack.
9. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
To remove the simple-swap hard disk drive backplate, complete the following
steps.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server and peripheral devices, and disconnect the power cord and
all external cables.
3. Slide the server out of the rack.
4. Remove the cover (see “Removing the cover” on page 402).
5. Pull the hard disk drives or fillers out of the server slightly to disengage them
from the backplate. See “Removing a simple-swap hard disk drive” on page
442 for details.
6. To obtain more working room, remove the fans (see “Removing a hot-swap
fan” on page 457).
7. Lift the backplate out of the server by pulling it and lifting it up.
8. Disconnect the backplate power, signal, and configuration cables (see “Internal
cable routing and connectors” on page 398).
9. If you are instructed to return the backplate, follow all packaging instructions,
and use any packaging materials for shipping that are supplied to you.
472 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing the simple-swap hard disk drive backplate
Use this information to replace the simple-swap hard disk drive backplate.
To install the replacement simple-swap hard disk drive backplate, complete the
following steps.
Procedure
1. Connect the power and signal cables to the replacement backplate (see
“Internal cable routing and connectors” on page 398).
2. Align the backplate with the backplate slot in the chassis and the small slots on
top of the hard disk drive cage.
3. Lower the backplate into the slots on the chassis.
4. Rotate the top of the backplate until the front tab clicks into place into the
latches on the chassis.
5. Insert the hard disk drives and the fillers the rest of the way into the bays.
6. Replace the fan bracket and fans if you removed them (see “Replacing the fan
bracket” on page 409 and “Replacing a hot-swap fan” on page 458).
7. Install the cover (see “Replacing the cover” on page 403).
8. Slide the server into the rack.
9. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
The illustrations in this document might differ slightly from the hardware.
Attention:
v Always use the microprocessor installation tool to remove a microprocessor.
Failing to use the microprocessor installation tool may damage the
microprocessor sockets on the system board. Any damage to the microprocessor
sockets may require replacing the system board.
v Do not allow the thermal grease on the microprocessor and heat sink to come in
contact with anything. Contact with any surface can compromise the thermal
grease and the microprocessor socket.
v Dropping the microprocessor during installation or removal can damage the
contacts.
v Do not touch the microprocessor contacts; handle the microprocessor by the
edges only. Contaminants on the microprocessor contacts, such as oil from your
skin, can cause connection failures between the contacts and the socket.
Procedure
1. Read the safety information that begins on page “Safety” on page vii,
“Handling static-sensitive devices” on page 398, and “Installation guidelines”
on page 395.
2. Turn off the server and peripheral devices and disconnect the power cord and
all external cables.
3. Remove the cover (see “Removing the cover” on page 402).
4. Depending on which microprocessor you are removing, remove the following
components, if necessary:
v Microprocessor 1: PCI riser-card assembly 1 and DIMM air baffle (see
“Removing a PCI riser-card assembly” on page 414 and “Removing the
DIMM air baffle” on page 406)
v Microprocessor 2: PCI riser-card assembly 2 and microprocessor 2 air baffle
(see “Removing a PCI riser-card assembly” on page 414 and “Removing the
DIMM air baffle” on page 406).
5. Open the heat-sink release lever to the fully open position.
474 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 110. Heat-sink release lever
6. Lift the heat sink out of the server. If the heat sink sticks to the
microprocessor, slightly twist the heat sink back and forth to break the seal.
After removal, place the heat sink on its side on a clean, flat surface.
7. Release the microprocessor retention latch by pressing down on the end,
moving it to the side, and releasing it to the open (up) position.
8. Open the microprocessor bracket frame by lifting up the tab on the top edge.
Keep the bracket frame in the open position.
Microprocessor
Microprocessor bracket frame
release lever
9. Locate the microprocessor installation tool that comes with the new
microprocessor.
10. Twist the handle on the microprocessor tool counterclockwise so that it is in
the open position.
Handle
Installation tool
Installation tool
Microprocessor
Alignment pins
12. Twist the handle on the installation tool clockwise and lift the microprocessor
out of the socket.
Handle
Installation tool
13. Carefully lift the microprocessor straight up and out of the socket, and place it
on a static-protective surface. Remove the microprocessor from the installation
tool by twisting the handle counterclockwise.
14. If you do not intend to install a microprocessor in the socket, install the socket
dust cover on the socket.
Attention: The pins on the socket are fragile. Any damage to the pins may
require replacing the system board.
15. If you are instructed to return the microprocessor, follow all packaging
instructions, and use any packaging materials for shipping that are supplied
to you.
476 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Replacing a microprocessor and heat sink
Use this information to replace a microprocessor and heat sink.
For information about the type of microprocessor that the server supports and
other information that you must consider when you install a microprocessor, see
the Installation and User’s Guide on the IBM Documentation CD.
Read the documentation that comes with the microprocessor to determine whether
you must update the IBM System x Server Firmware.
Important: Some cluster solutions require specific code levels or coordinated code
updates. If the device is part of a cluster solution, verify that the latest level of
code is supported for the cluster solution before you update the code.
To download the most current level of server firmware, complete the following
steps:
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Software and device drivers.
4. Click System x3650 M3 to display the matrix of downloadable files for the
server.
Important:
v Always use the microprocessor installation tool to install a microprocessor.
Failing to use the microprocessor installation tool may damage the
microprocessor sockets on the system board. Any damage to the microprocessor
sockets may require replacing the system board.
v A startup (boot) microprocessor must always be installed in microprocessor
connector 1 on the system board.
v Make sure the microprocessor that you are about to install is supported. See
Chapter 4, “Parts listing,” on page 381 for a list of supported microprocessors.
v Do not install an Intel Xeon™ 5500 series microprocessor and an Intel Xeon™
5600 series microprocessor in the same server.
v To ensure correct server operation, make sure that you use microprocessors that
are compatible and you have installed an additional DIMM for microprocessor 2.
Compatible microprocessors must have the same QuickPath Interconnect (QPI)
link speed, integrated memory controller frequency, core frequency, power
segment, cache size, and type.
v Microprocessors with different stepping levels are supported in this server. If
you install microprocessors with different stepping levels, it does not matter
which microprocessor is installed in microprocessor connector 1 or connector 2.
v If you are installing a microprocessor that has been removed, make sure that it is
paired with its original heat sink or a new replacement heat sink. Do not reuse a
heat sink from another microprocessor; the thermal grease distribution might be
different and might affect conductivity.
v If you are installing a new heat sink, remove the protective backing from the
thermal material that is on the underside of the new heat sink.
v If you are installing a new heat-sink assembly that did not come with thermal
grease, see “Thermal grease” on page 482 for instructions for applying thermal
grease; then, continue with step 1 on page 478 of this procedure.
Procedure
1. Touch the static-protective package that contains the microprocessor to any
unpainted metal surface on the server. Then, remove the microprocessor from
the package.
2. Rotate the microprocessor release lever on the socket from its closed and
locked position until it stops in the fully open position.
Attention:
v Do not touch the microprocessor contact; handle the microprocessor by the
edges only. Contaminants on the microprocessor contacts, such as oil from
your skin, can cause connection failures between the contacts and the
socket.
v Handle the microprocessor carefully. Dropping the microprocessor during
installation or removal can damage the contacts.
v Do not use excessive force when you press the microprocessor into the
socket.
v Make sure that the microprocessor is oriented and aligned and positioned
in the socket before you try to close the lever.
3. If there is a plastic protective cover on the bottom of the microprocessor,
carefully remove it.
Protective
cover
Microprocessor
4. Locate the microprocessor installation tool that comes with the new
microprocessor.
5. Twist the handle of the installation tool counterclockwise so that it is in the
open position.
478 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Handle
Installation tool
6. Align the microprocessor alignment slots with the alignment pins on the
microprocessor installation tool and place the microprocessor on the underside
of the tool so that the tool can grasp the microprocessor correctly.
Microprocessor
Installation tool
Alignment
pin slots
Alignment
pins
7. Twist the handle of the installation tool clockwise to secure the microprocessor
in the tool.
Handle
Installation tool
Installation tool
Microprocessor
Alignment pins
Handle
Installation tool
480 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 121. Heat sink
a. Make sure that the heat-sink release lever is in the open position.
b. Remove the plastic protective cover from the bottom of the heat sink.
c. If the new heat sink did not come with thermal grease, apply thermal
grease on the microprocessor before you install the heat sink (see “Thermal
grease” on page 482).
d. Align the heat sink above the microprocessor with the thermal grease side
down.
e. Slide the flange of the heat sink into the opening in the retainer bracket.
f. Press down firmly on the heat sink until it is seated securely.
g. Rotate the heat-sink release lever to the closed position and hook it
underneath the lock tab.
13. Replace the components that you removed in “Removing a microprocessor
and heat sink” on page 474
v Microprocessor 1: DIMM air baffle and PCI riser-card assembly 1 (see
“Replacing the DIMM air baffle” on page 407 and “Replacing a PCI
riser-card assembly” on page 415)
v Microprocessor 2: Microprocessor 2 air baffle and PCI riser-card assembly 2
(see “Replacing the microprocessor 2 air baffle” on page 405 and “Replacing
a PCI riser-card assembly” on page 415).
14. Install the cover (see “Replacing the cover” on page 403).
15. Slide the server into the rack.
16. Reconnect the external cables; then, reconnect the power cords and turn on
the peripheral devices and the server.
Procedure
1. Place the heat-sink assembly on a clean work surface.
2. Remove the cleaning pad from its package and unfold it completely.
3. Use the cleaning pad to wipe the thermal grease from the bottom of the heat
exchanger.
0.02 mL of thermal
grease
Microprocessor
5. Use the thermal-grease syringe to place nine uniformly spaced dots of 0.02 mL
each on the top of the microprocessor.
Note: 0.01mL is one tick mark on the syringe. If the grease is properly applied,
approximately half (0.22 mL) of the grease will remain in the syringe.
482 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Removing a heat-sink retention module
Use this information to remove a heat-sink retention module.
5. Remove the four screws that secure the heat-sink retention module to the
system board; then, lift the heat-sink retention module from the system board.
6. If you are instructed to return the heat-sink retention module, follow all
packaging instructions, and use any packaging materials for shipping that are
supplied to you.
2. Install the four screws that secure the module to the system board.
Attention: Make sure that you install each heat sink with its paired
microprocessor (see steps 3 on page 483 and 4 on page 483).
3. Install the microprocessor, heat sink, and applicable air baffle (see “Replacing a
microprocessor and heat sink” on page 477).
4. Install the cover (see “Replacing the cover” on page 403).
5. Slide the server into the rack.
6. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
484 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
10. Remove the fans (see “Removing a hot-swap fan” on page 457).
11. Disconnect all cables from the system board (see “Internal cable routing and
connectors” on page 398).
Attention:
v In the following step, do not allow the thermal grease to come in contact
with anything, and keep each heat sink paired with its microprocessor for
reinstallation. Contact with any surface can compromise the thermal grease
and the microprocessor socket; a mismatch between the microprocessor and
its original heat sink can require the installation of a new heat sink.
v Disengage all latches, release tabs or locks on cable connectors when you
disconnect all cables from the system board. Please refer to “Internal cable
routing and connectors” on page 398 for more information. Failing to
release them before removing the cables will damage the cable sockets on
the system board. The cable sockets on the system board are fragile. Any
damage to the cable sockets may require replacing the system board.
12. Remove each microprocessor heat sink and microprocessor; then, place them
on a static-protective surface for reinstallation (see “Removing a
microprocessor and heat sink” on page 474).
13. Push in and lift up the two system-board release latches on each side of the
fan cage.
System board
release latches
14. Slide the system board forward and tilt it away from the power supplies.
Using the two lift handles on the system board, pull the system board out of
the server.
Attention: When you pull the system board out of the server, be careful not
to damage any surrounding components and not to bend the pin inside the
microprocessor socket.
15. If you are instructed to return the system board, follow all packaging
instructions, and use any packaging materials for shipping that are supplied
to you.
16. Remove the socket covers from the microprocessor sockets on the new system
board and place them on the microprocessor sockets of the system board you
are removing.
Attention: Make sure to place the socket covers for the microprocessor
sockets on the system board before you return the old system board.
Notes:
1. When you reassemble the components in the server, be sure to route all cables
carefully so that they are not exposed to excessive pressure (see “Internal cable
routing and connectors” on page 398).
2. When you replace the system board, you must either update the server with
the latest firmware or restore the pre-existing firmware that the customer
provides on a diskette or CD image. Make sure that you have the latest
firmware or a copy of the pre-existing firmware before you proceed. See
“Updating the firmware” on page 491, “Updating the Universal Unique
Identifier (UUID)” on page 511, and “Updating the DMI/SMBIOS data” on
page 514 for more information.
486 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
4. If you see the error message Non-compatible/non-supported CPU, see PDSG for
more information appears, the microprocessor that you installed is not
supported. See Chapter 4, “Parts listing,” on page 381 for a list of supported
microprocessors.
Procedure
1. Align the system board at an angle, as shown in the illustration; then, rotate
and lower it flat and slide it back toward the rear of the server. Make sure
that the rear connectors extend through the rear of the chassis.
2. Reconnect to the system board the cables that you disconnected in step 11 on
page 485 of “Removing the system board” on page 484 (see “Internal cable
routing and connectors” on page 398).
3. Rotate the system-board release latch toward the rear of the server until the
latch clicks into place.
4. Install the fans.
5. Install each microprocessor with its matching heat sink (see “Replacing a
microprocessor and heat sink” on page 477).
6. Install the DIMMs (see “Installing a memory module” on page 450).
7. Install the air baffles (see “Replacing the DIMM air baffle” on page 407) and
“Removing the microprocessor 2 air baffle” on page 404, making sure that all
cables are out of the way.
8. Install the SAS riser-card and controller assembly (see “Replacing the SAS
riser-card and controller assembly” on page 426).
9. If necessary, install the Ethernet adapter.
10. If necessary, install the virtual media key.
11. Install the PCI riser-card assemblies and all adapters (see “Replacing a PCI
riser-card assembly” on page 415).
12. Install the cover (see “Replacing the cover” on page 403).
13. Push the power supplies back into the server.
14. Slide the server into the rack.
15. Reconnect the external cables; then, reconnect the power cords and turn on
the peripheral devices and the server.
Important: Some cluster solutions require specific code levels or coordinated code
updates. If the device is part of a cluster solution, verify that the latest level of
code is supported for the cluster solution before you update the code.
Procedure
1. Read the safety information that begins on page “Safety” on page vii and
“Installation guidelines” on page 395.
2. Turn off the server, and disconnect all power cords and external cables.
3. Pull the server out of the rack.
4. Remove the server cover (see “Removing the cover” on page 402).
5. Remove the SAS riser card assembly (see “Removing the SAS riser-card and
controller assembly” on page 424).
488 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Figure 130. 240 VA safety cover removal
Procedure
1. Line up and insert the tabs on the bottom of the safety cover into the slots on
the system board.
2. Slide the safety cover toward the back of the server until it is secure.
3. Connect the hard disk drive backplane power cables to the connector in front
of the safety cover.
4. Install the screw into the safety cover.
5. Install the SAS riser-card assembly (see “Replacing the SAS riser-card and
controller assembly” on page 426).
6. Install the server cover (see “Replacing the cover” on page 403).
7. Slide the server into the rack.
8. Reconnect the external cables; then, reconnect the power cords and turn on the
peripheral devices and the server.
490 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Chapter 6. Configuration information and instructions
This chapter provides information about updating the firmware and using the
configuration utilities.
Important: Some cluster solutions require specific code levels or coordinated code
updates. If the device is part of a cluster solution, verify that the latest level of
code is supported for the cluster solution before you update the code.
The firmware for the server is periodically updated and is available for download
from the Web. To check for the latest level of firmware, such as server firmware,
vital product data (VPD) code, device drivers, and service processor firmware
complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Software and device drivers.
4. Click System x3650 M3 to display the matrix of downloadable files for the
server.
Download the latest firmware for the server; then, install the firmware, using the
instructions that are included with the downloaded files.
When you replace a device in the server, you might have to either update the
firmware that is stored in memory on the device or restore the pre-existing
firmware from a diskette or CD image.
v BIOS code is stored in ROM on the system board.
v IMM firmware is stored in ROM on the IMM on the system board.
v Ethernet firmware is stored in ROM on the Ethernet controller.
v ServeRAID firmware is stored in ROM on the ServeRAID adapter.
v SATA firmware is stored in ROM on the integrated SATA controller.
v SAS/SATA firmware is stored in ROM on the SAS/SATA controller on the
system board.
492 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Use the LSI Configuration Utility program to configure the integrated
SAS/SATA controller with RAID capabilities and the devices that are attached to
it. For information about using this program, see “Using the LSI Configuration
Utility program” on page 507.
The following table lists the different server configurations and the applications
that are available for configuring and managing RAID arrays.
Table 22. Server configuration and applications for configuring and managing RAID arrays
RAID array configuration RAID array management
(before operating system is (after operating system is
Server configuration installed) installed)
ServeRAID-BR10i adapter LSI Utility (Setup utility, MegaRAID Storage Manager
(LSI 1068E) press Ctrl+C), ServerGuide (for monitoring storage only)
ServeRAID-BR10il v2 adapter LSI Utility (Setup utility, MegaRAID Storage Manager
(LSI 1064E) press Ctrl+C), ServerGuide and IBM Director
ServeRAID-MR10i adapter MegaRAID Storage Manager MegaRAID Storage Manager
(LSI 1078) (MSM), MegaRAID BIOS (MSM)
Configuration Utility (press
C to start), ServerGuide
ServeRAID-M5014 adapter MegaRAID Storage Manager MegaRAID Storage Manager
(LSI SAS2108) (MSM), MegaCLI (Command and IBM Director
Line Interface), ServerGuide
ServeRAID-M5015 adapter MegaRAID Storage Manager MegaRAID Storage Manager
(LSI SAS2108) (MSM), MegaCLI (Command and IBM Director
Line Interface), ServerGuide
ServeRAID-M1050 adapter MegaRAID Storage Manager MegaRAID Storage Manager
(LSI SAS2008) (MSM), MegaCLI (Command and IBM Director
Line Interface), ServerGuide
Use the Setup utility, formerly called the Configuration/Setup Utility program, to
perform the following tasks:
v View configuration information
v View and change assignments for devices and I/O ports
v Set the date and time
v Set the startup characteristics of the server and the order of startup devices
v Set and change settings for advanced hardware features
v View, set, and change settings for power-management features
v View and clear error logs
v Resolve configuration conflicts
Procedure
1. Turn on the server.
494 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
controller. If you disable a device, it cannot be configured, and the operating
system will not be able to detect it (this is equivalent to disconnecting the
device).
– Power
Select this choice to view or change power capping to control consumption,
processors, and performance states.
– Operating Modes
Select this choice to view or change the operating profile (for example,
performance and power utilization).
– Legacy Support
Select this choice to view or set legacy support.
- Force Legacy Video on Boot
Select this choice to force INT video support, if the operating system does
not support UEFI video output standards.
- Rehook INT
Select this choice to enable or disable devices from taking control of the
boot process. The default is Disable.
- Legacy Thunk Support
Select this choice to enable or disable the UEFI to interact with PCI mass
storage devices that are not UEFI-compliant.
– Integrated Management Module
Select this choice to view or change the settings for the integrated
management module.
- POST Watchdog Timer
Select this choice to view or enable the POST watchdog timer.
- POST Watchdog Timer Value
Select this choice to view or set the POST loader watchdog timer value.
- Reboot System on NMI
Enable or disable restarting the system whenever a nonmaskable interrupt
(NMI) occurs. Disabled is the default.
- Commands on USB Interface Preference
Select this choice to enable or disable the Ethernet over USB interface on
IMM.
- Network Configuration
Select this choice to view the system management network interface port,
the IMM MAC address, the current IMM IP address, and host name; define
the static IMM IP address, subnet mask, and gateway address; specify
whether to use the static IP address or have DHCP assign the IMM IP
address; save the network changes; and reset the IMM.
- Reset IMM to Defaults
Select this choice to view or reset IMM to the default settings.
- Reset IMM
Select this choice to reset IMM.
– Adapters and UEFI Drivers
Select this choice to view information about the adapters and drivers in the
server that are compliant with EFI 1.10 and UEFI 2.0.
v Network
496 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Select this choice to clear the system-event log.
v User Security
Select this choice to set, change, or clear passwords. See “Passwords” for more
information.
This choice is on the full and limited Setup utility menu.
– Set Power-on Password
Select this choice to set or change a power-on password. For more
information, see “Power-on password” on page 498.
– Clear Power-on Password
Select this choice to clear a power-on password. For more information, see
“Power-on password” on page 498.
– Set Administrator Password
Select this choice to set or change an administrator password. An
administrator password is intended to be used by a system administrator; it
limits access to the full Setup utility menu. If an administrator password is
set, the full Setup utility menu is available only if you type the administrator
password at the password prompt. For more information, see “Administrator
password” on page 499.
– Clear Administrator Password
Select this choice to clear an administrator password. For more information,
see “Administrator password” on page 499.
v Save Settings
Select this choice to save the changes that you have made in the settings.
v Restore Settings
Select this choice to cancel the changes that you have made in the settings and
restore the previous settings.
v Load Default Settings
Select this choice to cancel the changes that you have made in the settings and
restore the factory settings.
v Exit Setup
Select this choice to exit from the Setup utility. If you have not saved the
changes that you have made in the settings, you are asked whether you want to
save the changes or exit without saving them.
Passwords
From the User Security menu choice, you can set, change, and delete a power-on
password and an administrator password. The User Security choice is on the full
Setup utility menu only.
If you set only a power-on password, you must type the power-on password to
complete the system startup and to have access to the full Setup utility menu.
If you set a power-on password for a user and an administrator password for a
system administrator, you must type the power-on password to complete the
system startup. A system administrator who types the administrator password has
access to the full Setup utility menu; the system administrator can give the user
Power-on password:
If a power-on password is set, when you turn on the server, the system startup
will not be completed until you type the power-on password. You can use any
combination of between six and 20 printable ASCII characters for the password.
When a power-on password is set, you can enable the Unattended Start mode, in
which the keyboard and mouse remain locked but the operating system can start.
You can unlock the keyboard and mouse by typing the power-on password.
If you forget the power-on password, you can regain access to the server in any of
the following ways:
v If an administrator password is set, type the administrator password at the
password prompt. Start the Setup utility and reset the power-on password.
v Remove the battery from the server and then reinstall it. See the Problem
Determination and Service Guide on the IBM Documentation CD for instructions for
removing the battery.
v Change the position of the power-on password switch (enable switch 1 of the
system board switch block (SW4) to bypass the power-on password check (see
“System-board switches and jumpers” on page 18 for more information).
Attention: Before you change any switch settings or move any jumpers, turn
off the server; then, disconnect all power cords and external cables. See the
safety information that begins on page “Safety” on page vii. Do not change
settings or move jumpers on any system-board switch or jumper blocks that are
not shown in this document.
The default for all of the switches on switch block (SW4) is Off.
While the server is turned off, move switch 1 of the switch block (SW4) to the
On position to enable the power-on password override. You can then start the
Setup utility and reset the power-on password. You do not have to return the
switch to the previous position.
The power-on password override switch does not affect the administrator
password.
498 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Administrator password:
If an administrator password is set, you must type the administrator password for
access to the full Setup utility menu. You can use any combination of between six
and 20 printable ASCII characters for the password.
Attention: If you set an administrator password and then forget it, there is no
way to change, override, or remove it. You must replace the system board.
To use the Boot Selection Menu program, complete the following steps:
Procedure
1. Turn off the server.
2. Restart the server.
3. Press F12 (Select Boot Device). If a bootable USB mass storage device is
installed, a submenu item (USB Key/Disk) is displayed.
4. Use the Up Arrow and Down Arrow keys to select an item from the Boot
Selection Menu and press Enter.
Results
The next time the server starts, it returns to the startup sequence that is set in the
Setup utility.
To force the server to start from the backup copy, turn off the server; then, place
the UEFI boot recovery J29 jumper in the backup position (pins 2 and 3).
Use the backup copy of the server firmware until the primary copy is restored.
After the primary copy is restored, turn off the server; then, move the UEFI boot
recovery J29 jumper back to the primary position (pins 1 and 2).
You can download a free image of the ServerGuide Setup and Installation CD or
purchase the CD from the ServerGuide fulfillment Web site at
http://www.ibm.com/systems/management/serverguide/sub.html. To download
the free image, click IBM Service and Support Site.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
ServerGuide features
Features and functions can vary slightly with different versions of the ServerGuide
program. To learn more about the version that you have, start the ServerGuide Setup
and Installation CD and view the online overview. Not all features are supported on
all server models.
500 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Setup and configuration overview
When you use the ServerGuide Setup and Installation CD, you do not need setup
diskettes. You can use the CD to configure any supported IBM server model. The
setup program provides a list of tasks that are required to set up your server
model. On a server with a ServeRAID adapter or integrated SCSI controller with
RAID capabilities, you can run the SCSI RAID configuration program to create
logical drives.
Note: Features and functions can vary slightly with different versions of the
ServerGuide program.
When you start the ServerGuide Setup and Installation CD, the program prompts you
to complete the following tasks:
v Select your language.
v Select your keyboard layout and country.
v View the overview to learn about ServerGuide features.
v View the readme file to review installation tips for your operating system and
adapter.
v Start the operating-system installation. You will need your operating-system CD.
Note: Features and functions can vary slightly with different versions of the
ServerGuide program.
1. After you have completed the setup process, the operating-system installation
program starts. (You will need your operating-system CD to complete the
installation.)
2. The ServerGuide program stores information about the server model, service
processor, hard disk drive controllers, and network adapters. Then, the
program checks the CD for newer device drivers. This information is stored
and then passed to the operating-system installation program.
3. The ServerGuide program presents operating-system partition options that are
based on your operating-system selection and the installed hard disk drives.
4. The ServerGuide program prompts you to insert your operating-system CD
and restart the server. At this point, the installation program for the operating
system takes control to complete the installation.
If you have already configured the server hardware and you are not using the
ServerGuide program to install your operating system, complete the following
steps to download the latest operating-system installation instructions from the
IBM Web site.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. From the menu on the left side of the page, click System x support search.
4. From the Task menu, select Install.
5. From the Product family menu, select System x3650 M3.
6. From the Operating system menu, select your operating system, and then click
Search to display the available installation documents.
The IMM also provides the following remote server management capabilities
through the OSA SMBridge management utility program:
v Command-line interface (IPMI Shell)
The command-line interface provides direct access to server management
functions through the IPMI 2.0 protocol. Use the command-line interface to issue
commands to control the server power, view system information, and identify
the server. You can also save one or more commands as a text file and run the
file as a script.
v Serial over LAN
Establish a Serial over LAN (SOL) connection to manage servers from a remote
location. You can remotely view and change the UEFI settings, restart the server,
identify the server, and perform other management functions. Any standard
Telnet client application can access the SOL connection.
To start using the embedded hypervisor functions, you must add the USB memory
key to the startup sequence in the Setup utility.
To add the USB hypervisor memory key to the boot order, complete the following
steps:
Results
If the embedded hypervisor image becomes corrupt, you can use the VMware
Recovery CD that comes with the server to recover the image. To recover the flash
device image, complete the following steps:
1. Turn on the server.
For additional information and instructions, see the VMware ESXi Server 3i
Embedded Setup Guide at http://www.vmware.com/pdf/vi3_35/esx_3i_e/r35/
vi3_35_25_3i_setup.pdf/.
The virtual media key has an LED. When this LED is lit and green, it indicates that
the key is installed and functioning correctly.
504 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
v Mapping the CD or DVD drive, diskette drive, and USB flash drive on a remote
client, and mapping ISO and diskette image files as virtual drives that are
available for use by the server
v Uploading a diskette image to the IMM memory and mapping it to the server as
a virtual drive
The blue-screen capture feature captures the video display contents before the IMM
restarts the server when the IMM detects an operating-system hang condition. A
system administrator can use the blue-screen capture to assist in determining the
cause of the hang condition.
Procedure
1. Install the virtual media key into the dedicated slot on the system board (see
“Replacing an IBM virtual media key” on page 411).
2. Turn on the server.
Procedure
1. Turn on the server.
To log on to the Web interface to use the remote presence functions, complete the
following steps:
Procedure
1. Open a Web browser on a computer that connects to the server and in the
address or URL field, type the IP address or host name of the IMM to which
you want to connect.
Note:
a. If you are logging in to the IMM for the first time after installation, the
IMM defaults to DHCP. If a DHCP host is not available, the IMM uses the
default static IP address 192.168.70.125.
b. You can obtain the DHCP-assigned IP address or the static IP address from
the server UEFI or from your network administrator.
The Login page is displayed.
2. Type the user name and password. If you are using the IMM for the first time,
you can obtain the user name and password from your system administrator.
All login attempts are documented in the event log. A welcome page opens in
the browser.
Note: The IMM is set initially with a user name of USERID and password of
PASSW0RD (passw0rd with a zero, not the letter O). You have read/write
access. For enhanced security, change this default password during the initial
configuration.
3. On the Welcome page, type a timeout value (in minutes) in the field that is
provided. The IMM will log you off the Web interface if your browser is
inactive for the number of minutes that you entered for the timeout value.
4. Click Continue to start the session. The browser opens the System Status page,
which displays the server status and the server health summary.
506 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Configuring the Gigabit Ethernet controller
The Ethernet controllers are integrated on the system board. They provide an
interface for connecting to a 10 Mbps, 100 Mbps, or 1 Gbps network and provide
full-duplex (FDX) capability, which enables simultaneous transmission and
reception of data on the network. If the Ethernet ports in the server support
auto-negotiation, the controllers detect the data-transfer rate (10BASE-T,
100BASE-TX, or 1000BASE-T) and duplex mode (full-duplex or half-duplex) of the
network and automatically operate at that rate and mode.
You do not have to set any jumpers or configure the controllers. However, you
must install a device driver to enable the operating system to address the
controllers. To find updated information about configuring the controllers,
complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
Procedure
1. Go to http://www.ibm.com/systems/support/.
2. Under Product support, click System x.
3. Under Popular links, click Software and device drivers.
4. From the Product family menu, select System x3650 M3 and click Go.
The integrated SAS/SATA controller with RAID capabilities supports RAID arrays.
You can use the LSI Configuration Utility program to configure RAID 1 (IM),
RAID 1E (IME), and RAID 0 (IS) for a single pair of attached devices. If you install
a different type of RAID adapter, follow the instructions in the documentation that
comes with the adapter to view or change settings for attached devices.
When you are using the LSI Configuration Utility program to configure and
manage arrays, consider the following information:
v The integrated SAS/SATA controller with RAID capabilities supports the
following features:
– Integrated Mirroring (IM) with hot-spare support (also known as RAID 1)
Use this option to create an integrated array of two disks plus up to two
optional hot spares. All data on the primary disk can be migrated.
To start the LSI Configuration Utility program, complete the following steps:
Procedure
1. Turn on the server.
Results
When you have finished changing settings, press Esc to exit from the program;
select Save to save the settings that you have changed.
508 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Formatting a hard disk drive
Low-level formatting removes all data from the hard disk. If there is data on the
disk that you want to save, back up the hard disk before you perform this
procedure.
Note: Before you format a hard disk, make sure that the disk is not part of a
mirrored pair.
To create a RAID array of hard disk drives, complete the following steps:
1. From the list of adapters, select the controller (channel) for which you want to
create an array.
2. Select RAID Properties.
3. Select the type of array that you want to create.
4. In the RAID Disk column, use the Spacebar or Minus (-) key to select [Yes]
(select) or [No] (deselect) to select or deselect a drive from a RAID disk.
5. Continue to select drives, using the Spacebar or Minus (-) key, until you have
selected all the drives for your array.
6. Press C to create the disk array.
7. Select Save changes then exit this menu to create the array.
8. Exit the Setup utility.
You can also use the ASU program to configure the optional remote presence
features or other IMM settings. The remote presence features provide enhanced
systems-management capabilities.
In addition, the ASU program provides limited settings for configuring the IPMI
function in the IMM through the command-line interface.
Use the command-line interface to issue setup commands. You can save any of the
settings as a file and run the file as a script. The ASU program supports scripting
environments through a batch-processing mode.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
To locate and install a newer version of IBM Systems Director, complete the
following steps:
Procedure
1. Check for the latest version of IBM Systems Director:
a. Go to http://www.ibm.com/systems/management/director/
downloads.html.
b. If a newer version of IBM Systems Director than what comes with the
server is shown in the drop-down list, follow the instructions on the Web
page to download the latest version.
2. Install the IBM Systems Director program.
Results
510 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
If your management server is not connected to the Internet, to locate and install
updates and interim fixes, complete the following steps:
1. Make sure that you have run the Discovery and Inventory collection tasks.
2. On a system that is connected to the Internet, go to http://www.ibm.com/
support/fixcentral/.
3. From the Product family list, select IBM Systems Director.
4. From the Product list, select IBM Systems Director.
5. From the Installed version list, select the latest version, and click Continue.
6. Download the available updates.
7. Copy the downloaded files to the management server.
8. On the management server, on the Welcome page of the IBM Systems Director
Web interface, click the Manage tab, and click Update Manager.
9. Click Import updates and specify the location of the downloaded files that
you copied to the management server.
10. Return to the Welcome page of the Web interface, and click View updates.
11. Select the updates that you want to install, and click Install to start the
installation wizard.
To download the ASU and update the UUID, complete the following steps.:
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
Procedure
1. Download the Advanced Settings Utility (ASU):
a. Go to http://www.ibm.com/systems/support/.
b. Under Product support, select System x.
c. Under Popular links, select Tools and utilities.
d. In the left pane, click System x and BladeCenter Tools Center.
e. Scroll down and click Tools reference.
f. Scroll down and click the plus-sign (+) for Configuration tools to expand the
list; then, select Advanced Settings Utility (ASU).
g. In the next window under Related Information, click the Advanced Settings
Utility link and download the ASU version for your operating system.
2. ASU sets the UUID in the Integrated Management Module (IMM). Select one of
the following methods to access the Integrated Management Module (IMM) to
set the UUID:
v Online from the target system (LAN or keyboard console style (KCS) access)
v Remote access to the target system (LAN based)
Note: If you do not specify any of these parameters, ASU will use the
default values. When the default values are used and ASU is unable to access
the IMM using the online authenticated LAN access method, ASU will
automatically use the unauthenticated KCS access method.
The following commands are examples of using the userid and password
default values and not using the default values:
Example that does not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SYsInfoUUID <uuid_value> user <user_id>
password <password>
Example that does use the userid and password default values:
asu set SYSTEM_PROD_DATA.SysInfoUUID <uuid_value>
v Online KCS access (unauthenticated and user restricted):
You do not need to specify a value for access_method when you use this
access method.
Example:
asu set SYSTEM_PROD_DATA.SysInfoUUID <uuid_value>
512 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
The KCS access method uses the IPMI/KCS interface. This method requires
that the IPMI driver be installed. Some operating systems have the IPMI
driver installed by default. ASU provides the corresponding mapping layer.
See the Advanced Settings Utility Users Guide for more details. You can access
the ASU Users Guide from the IBM Web site.
Note: Changes are made periodically to the IBM Web site. The actual
procedure might vary slightly from what is described in this document.
a. Go to http://www.ibm.com/systems/support/.
b. Under Product support, select System x.
c. Under Popular links, select Tools and utilities.
d. In the left pane, click System x and BladeCenter Tools Center.
e. Scroll down and click Tools reference.
f. Scroll down and click the plus-sign (+) for Configuration tools to expand
the list; then, select Advanced Settings Utility (ASU).
g. In the next window under Related Information, click the Advanced
Settings Utility link.
v Remote LAN access, type the command:
Note: When using the remote LAN access method to access IMM using the
LAN from a client, the host and the imm_external_ip address are required
parameters.
host <imm_external_ip> [user <imm_user_id>[[password <imm_password>]
Where:
imm_external_ip
The external IMM LAN IP address. There is no default value. This
parameter is required.
imm_user_id
The IMM account (1 of 12 accounts). The default value is USERID.
imm_password
The IMM account password (1 of 12 accounts). The default value is
PASSW0RD (with a zero 0 not an O).
The following commands are examples of using the userid and password
default values and not using the default values:
Example that does not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SYsInfoUUID <uuid_value> host <imm_ip>
user <user_id> password <password>
Example that does use the userid and password default values:
asu set SYSTEM_PROD_DATA.SysInfoUUID <uuid_value> host <imm_ip>
v Bootable media:
You can also build a bootable media using the applications available through
the Tools Center Web site at http://publib.boulder.ibm.com/infocenter/
toolsctr/v1r0/index.jsp. From the left pane, click IBM System x and
BladeCenter Tools Center, then click Tool reference for the available tools.
5. Restart the server.
Make sure that you download the version for your operating system. You can
download the ASU from the IBM Web site. To download the ASU and update the
DMI, complete the following steps.
Note: Changes are made periodically to the IBM Web site. The actual procedure
might vary slightly from what is described in this document.
1. Download the Advanced Settings Utility (ASU):
a. Go to http://www.ibm.com/systems/support/.
b. Under Product support, select System x.
c. Under Popular links, select Tools and utilities.
d. In the left pane, click System x and BladeCenter Tools Center.
e. Scroll down and click Tools reference.
f. Scroll down and click the plus-sign (+) for Configuration tools to expand the
list; then, select Advanced Settings Utility (ASU).
g. In the next window under Related Information, click the Advanced Settings
Utility link and download the ASU version for your operating system.
2. ASU sets the DMI in the Integrated Management Module (IMM). Select one of
the following methods to access the Integrated Management Module (IMM) to
set the DMI:
v Online from the target system (LAN or keyboard console style (KCS) access)
v Remote access to the target system (LAN based)
v Bootable media containing ASU (LAN or KCS, depending upon the bootable
media)
3. Copy and unpack the ASU package, which also includes other required files, to
the server. Make sure that you unpack the ASU and the required files to the
same directory. In addition to the application executable (asu or asu64), the
following files are required:
v For Windows based operating systems:
– ibm_rndis_server_os.inf
– device.cat
v For Linux based operating systems:
– cdc_interface.sh
4. After you install ASU, type the following commands to set the DMI:
514 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
<system model>
The system model. Type system yyyyyyy, where yyyyyyy is the product
identifier such as x3650M3.
<s/n> The serial number on the server. Type sn zzzzzzz, where zzzzzzz is the
serial number.
<asset_method>
The server asset tag number. Type asset
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa, where
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa is the asset tag number.
[access_method]
The access method that you select to use from the following methods:
v Online authenticated LAN access, type the command:
[host <imm_internal_ip>] [user <imm_user_id>][password <imm_password>]
Where:
imm_internal_ip
The IMM internal LAN/USB IP address. The default value is
169.254.95.118.
imm_user_id
The IMM account (1 of 12 accounts). The default value is USERID.
imm_password
The IMM account password (1 of 12 accounts). The default value is
PASSW0RD (with a zero 0 not an O).
Note: If you do not specify any of these parameters, ASU will use the
default values. When the default values are used and ASU is unable to access
the IMM using the online authenticated LAN access method, ASU will
automatically use the following unauthenticated KCS access method.
The following commands are examples of using the userid and password
default values and not using the default values:
Examples that do not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SYsInfoProdName <m/t_model> user
<imm_user_id> password <imm_password>
asu set SYSTEM_PROD_DATA.SYsInfoProdIdentifier <system model> user
<imm_user_id> password <imm_password>
asu set SYSTEM_PROD_DATA.SYsInfoSerialNum <s/n> user
<imm_user_id> password <imm_password>
asu set SYSTEM_PROD_DATA.SYsEncloseAssetTag <asset_tag> user
<imm_user_id> password <imm_password>
Examples that do not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SYsInfoProdName <m/t_model>
asu set SYSTEM_PROD_DATA.SYsInfoProdIdentifier <system model>
asu set SYSTEM_PROD_DATA.SYsInfoSerialNum <s/n>
asu set SYSTEM_PROD_DATA.SYsEncloseAssetTag <asset_tag>
v Remote LAN access, type the command:
Note: When using the remote LAN access method to access IMM using the
LAN from a client, the host and the imm_external_ip address are required
parameters.
host <imm_external_ip> [user <imm_user_id>[[password <imm_password>]
Where:
imm_external_ip
The external IMM LAN IP address. There is no default value. This
parameter is required.
imm_user_id
The IMM account (1 of 12 accounts). The default value is USERID.
imm_password
The IMM account password (1 of 12 accounts). The default value is
PASSW0RD (with a zero 0 not an O).
The following commands are examples of using the userid and password
default values and not using the default values:
Examples that do not use the userid and password default values:
asu set SYSTEM_PROD_DATA.SYsInfoProdName <m/t_model> host <imm_ip>
user <imm_user_id> password <imm_password>
asu set SYSTEM_PROD_DATA.SYsInfoProdIdentifier <system model> host <imm_ip>
user <imm_user_id> password <imm_password>
asu set SYSTEM_PROD_DATA.SYsInfoSerialNum <s/n> host <imm_ip>
user <imm_user_id> password <imm_password>
asu set SYSTEM_PROD_DATA.SYsEncloseAssetTag <asset_tag> host <imm_ip>
user <imm_user_id> password <imm_password>
516 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Appendix A. Getting help and technical assistance
If you need help, service, or technical assistance or just want more information
about IBM products, you will find a wide variety of sources available from IBM to
assist you.
Use this information to obtain additional information about IBM and IBM
products, determine what to do if you experience a problem with your IBM system
or optional device, and determine whom to call for service, if it is necessary.
If you believe that you require IBM to perform warranty service on your IBM
product, the IBM service technicians will be able to assist you more efficiently if
you prepare before you call.
v Check all cables to make sure that they are connected.
v Check the power switches to make sure that the system and any optional
devices are turned on.
v Check for updated software, firmware, and operating-system device drivers for
your IBM product. The IBM Warranty terms and conditions state that you, the
owner of the IBM product, are responsible for maintaining and updating all
software and firmware for the product (unless it is covered by an additional
maintenance contract). Your IBM service technician will request that you
upgrade your software and firmware if the problem has a documented solution
within a software upgrade.
v If you have installed new hardware or software in your environment, check
http://www.ibm.com/systems/info/x86servers/serverproven/compat/us/ to
make sure that the hardware and software is supported by your IBM product.
v Go to http://www.ibm.com/supportportal to check for information to help you
solve the problem.
v Gather the following information to provide to IBM Support. This data will help
IBM Support quickly provide a solution to your problem and ensure that you
receive the level of service for which you might have contracted.
– Hardware and Software Maintenance agreement contract numbers, if
applicable
– Machine type number (IBM 4-digit machine identifier)
– Model number
– Serial number
– Current system UEFI and firmware levels
– Other pertinent information such as error messages and logs
v Go to http://www.ibm.com/support/entry/portal/Open_service_request to
submit an Electronic Service Request. Submitting an Electronic Service Request
will start the process of determining a solution to your problem by making the
pertinent information available to IBM Support quickly and efficiently. IBM
service technicians can start working on your solution as soon as you have
completed and submitted an Electronic Service Request.
On the World Wide Web, up-to-date information about IBM systems, optional
devices, services, and support is available at http://www.ibm.com/supportportal.
IBM System x information is at http://www.ibm.com/systems/x/. IBM
BladeCenter information is at http://www.ibm.com/systems/bladecenter. IBM
IntelliStation information is at http://www.ibm.com/systems/intellistation.
Before you send diagnostic data to IBM, read the terms of use at
http://www.ibm.com/de/support/ecurep/terms.html.
You can use any of the following methods to send diagnostic data to IBM:
v Standard upload: http://www.ibm.com/de/support/ecurep/send_http.html
v Standard upload with the system serial number: http://www.ecurep.ibm.com/
app/upload_hw
v Secure upload: http://www.ibm.com/de/support/ecurep/
send_http.html#secure
v Secure upload with the system serial number: https://www.ecurep.ibm.com/
app/upload_hw
518 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Creating a personalized support web page
You can create a personalized support web page by identifying IBM products that
are of interest to you.
For more information about Support Line and other IBM services, see
http://www.ibm.com/services or see http://www.ibm.com/planetwide for
support telephone numbers. In the U.S. and Canada, call 1-800-IBM-SERV
(1-800-426-7378).
In the U.S. and Canada, hardware service and support is available 24 hours a day,
7 days a week. In the U.K., these services are available Monday through Friday,
from 9 a.m. to 6 p.m.
IBM may not offer the products, services, or features discussed in this document in
other countries. Consult your local IBM representative for information on the
products and services currently available in your area. Any reference to an IBM
product, program, or service is not intended to state or imply that only that IBM
product, program, or service may be used. Any functionally equivalent product,
program, or service that does not infringe any IBM intellectual property right may
be used instead. However, it is the user's responsibility to evaluate and verify the
operation of any non-IBM product, program, or service.
IBM may have patents or pending patent applications covering subject matter
described in this document. The furnishing of this document does not give you
any license to these patents. You can send license inquiries, in writing, to:
IBM Director of Licensing
IBM Corporation
North Castle Drive
Armonk, NY 10504-1785
U.S.A.
IBM may use or distribute any of the information you supply in any way it
believes appropriate without incurring any obligation to you.
Intel, Intel Xeon, Itanium, and Pentium are trademarks or registered trademarks of
Intel Corporation or its subsidiaries in the United States and other countries.
Java and all Java-based trademarks and logos are trademarks or registered
trademarks of Oracle and/or its affiliates.
UNIX is a registered trademark of The Open Group in the United States and other
countries.
Important notes
Processor speed indicates the internal clock speed of the microprocessor; other
factors also affect application performance.
CD or DVD drive speed is the variable read rate. Actual speeds vary and are often
less than the possible maximum.
When referring to processor storage, real and virtual storage, or channel volume,
KB stands for 1024 bytes, MB stands for 1,048,576 bytes, and GB stands for
1,073,741,824 bytes.
Maximum internal hard disk drive capacities assume the replacement of any
standard hard disk drives and population of all hard disk drive bays with the
largest currently supported drives that are available from IBM.
Each solid-state memory cell has an intrinsic, finite number of write cycles that the
cell can incur. Therefore, a solid-state device has a maximum number of write
cycles that it can be subjected to, expressed as total bytes written (TBW). A
522 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
device that has exceeded this limit might fail to respond to system-generated
commands or might be incapable of being written to. IBM is not responsible for
replacement of a device that has exceeded its maximum guaranteed number of
program/erase cycles, as documented in the Official Published Specifications for
the device.
Some software might differ from its retail version (if available) and might not
include user manuals or all program functionality.
Particulate contamination
Attention: Airborne particulates (including metal flakes or particles) and reactive
gases acting alone or in combination with other environmental factors such as
humidity or temperature might pose a risk to the device that is described in this
document.
Documentation format
The publications for this product are in Adobe Portable Document Format (PDF)
and should be compliant with accessibility standards. If you experience difficulties
when you use the PDF files and want to request a web-based format or accessible
PDF document for a publication, direct your mail to the following address:
Information Development
IBM Corporation
205/A015
3039 E. Cornwallis Road
P.O. Box 12195
Research Triangle Park, North Carolina 27709-2195
U.S.A.
In the request, be sure to include the publication part number and title.
When you send information to IBM, you grant IBM a nonexclusive right to use or
distribute the information in any way it believes appropriate without incurring any
obligation to you.
524 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Electronic emission notices
When you attach a monitor to the equipment, you must use the designated
monitor cable and any interference suppression devices that are supplied with the
monitor.
Properly shielded and grounded cables and connectors must be used in order to
meet FCC emission limits. IBM is not responsible for any radio or television
interference caused by using other than recommended cables and connectors or by
unauthorized changes or modifications to this equipment. Unauthorized changes
or modifications could void the user's authority to operate the equipment.
This device complies with Part 15 of the FCC Rules. Operation is subject to the
following two conditions: (1) this device may not cause harmful interference, and
(2) this device must accept any interference received, including interference that
might cause undesired operation.
Responsible manufacturer:
526 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Zulassungsbescheinigung laut dem Deutschen Gesetz über die
elektromagnetische Verträglichkeit von Geräten (EMVG) (bzw. der
EMC EG Richtlinie 2004/108/EG) für Geräte der Klasse A
Dieses Gerät ist berechtigt, in Übereinstimmung mit dem Deutschen EMVG das
EG-Konformitätszeichen - CE - zu führen.
Verantwortlich für die Einhaltung der EMV Vorschriften ist der Hersteller:
Generelle Informationen:
Das Gerät erfüllt die Schutzanforderungen nach EN 55024 und EN 55022 Klasse
A.
This is a Class A product based on the standard of the Voluntary Control Council
for Interference (VCCI). If this equipment is used in a domestic environment, radio
interference may occur, in which case the user may be required to take corrective
actions.
This is electromagnetic wave compatibility equipment for business (Type A). Sellers
and users need to pay attention to it. This is for any areas other than home.
528 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Taiwan Class A compliance statement
532 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
L microprocessor
applying thermal grease 477
POST (continued)
error codes 274
LED errors heat sink 477 event log 28
dc power supply 367 problems 349 Event Viewer 349
LEDs 11, 17 removing 474 power
ac power 14 replacing 477 supply 7
enclosure manager heartbeat 23 specifications 7 power cords 391
Ethernet activity 12, 14 mirroring mode 453 power problems 353, 376
Ethernet icon LED 12 monitor problems 349 power supply
Ethernet link 14 mouse problems 346 installing 460
Ethernet-link status 12 LED errors 367
IMM heartbeat 23 operating requirements 460
IN OK power 14
information 12 N ServerProven 460
power-control button 12
LEDs New Zealand Class A statement 525
power-cord connector 14
Ethernet icon 12 NMI button 13
power-on LED
light path diagnostics 363 notes 6
front 12
locator 12, 14 notes, important 522
rear 14
OUT OK power 14 notices 521
power-on password
power supply 367 electronic emission 525
setting 494
power-channel error 376 FCC, Class A 525
power-on password override switch 18
power-on 12, 14 notices and statements 6
problem determination tips 379
riser-card assembly 26 Nx boot failure 375
problem isolation tables 340
system board 22 problems
system error 12 configuration
system pulse 23 O minimum 378
system-error 14 obtaining IP address for Web DVD-ROM drive 341
LEDs and controls interface 505 Ethernet controller 377
front view 11 online documentation 5 intermittent 345
on light path diagnostics panel 13 online-spare mode 455 keyboard 346
operator information panel 12 operating system installation memory 347
rear view 14 with ServerGuide 501 microprocessor 349
legacy operating system without ServerGuide 502 minimum configuration 378
requirement 501 operator information panel monitor 349
Licenses and Attributions Documents 5 CD/DVD-eject button 11 optional devices 352
light path diagnostics operator information panel assembly, POST 274
description 360 replacing 466 power 353, 376
LEDs 363 optional device connectors serial port 357
panel 360 on the system board 24 ServerGuide 358
light path diagnostics panel optional device problems 352 software 359
accessing 13 ordering consumable parts 381 undetermined 378
Linux license agreement 5 OUT OK LED 367 USB port 359
locator LED 12, 14 OUT OK power LED 14 video 349, 360
logs product service, IBM Taiwan 519
event 28 publications 5
system event message 31
viewing test 288 P
LSI Configuration Utility particulate contamination 7, 523
overview 507 parts listing 381 R
starting 508 password RAID array, creating 509
administrator 499 recovering server firmware 371
power-on 498 recovery CDs 390
M password, power-on
switch on system board 498
recovery, automatic boot failure
(ABR) 375
memory PCI remind button 13, 363
two-DIMM-per-channel (2DPC) 450 expansion slots 7 remote presence feature
memory mirroring PCI adapter enabling 505
description 453 installing 418 using 504
DIMM population sequence 453, 455 removing 417 removing
memory module PCI riser card 240 VA safety cover 488
removing 448 installing 415 battery 463
specifications 7 removing 414 bezel 468
memory online-spare People's Republic of China Class A CD-RW/DVD drive 444
description 455 electronic emission statement 528 cover 402
memory problems 347 pointing device problems 346 DIMM 448
menu choices in Setup utility 494 port connectors 18 DIMM air baffle 406
messages, diagnostic 287 POST Ethernet adapter 420
description 274 fan 457
Index 533
removing (continued) SAS system board (continued)
fan bracket 408 riser-card and controller assembly, connectors (continued)
hard disk drive 440, 442 installing 426 internal 17
heat sink 474 riser-card and controller assembly, installing 486
heat-sink retention module 483 removing 424 LEDs 22
microprocessor 474 SAS connector, internal 17 power-on password switch 498
microprocessor air baffle 404 SAS hard disk drive backplane removing 484
operator information panel installing 471 switch block 18
assembly 466 SAS hard disk drive backplate system board optional devices
PCI adapter 417 installing 473 connectors 24
PCI riser card 414 sending diagnostic data to IBM 518 system event message log 31
SAS hard disk drive backplane 469 serial connector 14 system pulse LEDs 23
SAS hard disk drive backplate 472 serial port problems 357 system reliability guidelines 397
SAS riser-card and controller server configuration, updating 492 system-error LED
assembly 424 Server controls 11 front 12
server components 395 Server controls, LEDs, and power 11 rear 14
ServeRAID adapter advanced feature server firmware, recovering 371 system-event log 28
key 433 server firmware, starting backup 499 system-locator LED 12, 14
ServeRAID SAS controller 430 server replaceable units 381 Systems Director, updating 510
ServeRAID SAS controller ServeRAID adapter advanced feature key
battery 436 hypervisor 433
system board 484
tape drive 446
installing 434
ServeRAID SAS controller
T
Taiwan Class A electronic emission
USB hypervisor memory key 412 installing 431
statement 529
virtual media key 410 removing 430
tape alert flags 370
removing and replacing ServeRAID SAS controller battery
tape drive
FRUs 474 installing 438
installing 447
replacement parts 381 removing 436
removing 446
replacing ServerGuide
telecommunication regulatory
battery 465 features 500
statement 524
CD-RW/DVD drive 445 problems 358
telephone numbers 519
cover 403 using 500
temperature 7
DIMM air baffle 407 using to install operating system 501
test log, viewing 288
Ethernet adapter 421 service and support
thermal grease 482
fan bracket 409 before you call 517
thermal material, heat sink 477
hard disk drive 440 hardware 519
Tier 1 parts, removing and replacing 402
microprocessor 477 software 519
tools, diagnostic 27
microprocessor air baffle 405 servicing electrical equipment x
trademarks 522
operator information panel setup and configuration with
trained service technicians, guidelines ix
assembly 467 ServerGuide 501
troubleshooting tables 340
PCI adapter 418 Setup utility 29
two-DIMM-per-channel (2DPC)
PCI riser card 415 menu choices 494
requirements 450
SAS hard disk drive backplane 471 starting 494
SAS hard disk drive backplate 473 using 493
server components 395 simple-swap
ServeRAID SAS controller 431 hard disk drive 442 U
simple-swap hard disk drive 443 size 7 undetermined problems 378
tape drive 447 software problems 359 undocumented problems 3
USB hypervisor memory key 413 software service and support telephone United States FCC Class A notice 525
virtual media key 411 numbers 519 Universal Serial Bus (USB) problems 359
reset button 13 specifications 7 universal unique identifier,
returning components 398 starting updating 511
riser card backup server firmware 499 unsafe conditions, inspecting for ix
installing 415 LSI Configuration Utility 508 updating
removing 414 Setup utility 494 DMI/SMBIOS 514
riser-card assembly statements and notices 6 firmware 491
LEDs 26 static-sensitive devices, handling 398 IBM Systems Director 510
location 417 support web page, custom 519 server configuration 492
Russia Class A electronic emission switch universal unique identifier 511
statement 528 functions 18 USB connector 11, 14
power-on password override 18 USB hypervisor memory key
system board location 18 installing 413
S switch block
system board 18
removing 412
using 503
safety vii
system board using
safety statements vii, xi
connectors 17 boot selection menu program 499
external port 18 embedded hypervisor 503
534 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
using (continued)
LSI Configuration Utility 507
remote presence feature 504
ServerGuide 500
Setup utility 493
V
video
adapter 418
problems 349
video connector
front 11
rear 14
viewing event logs 29
Setup utility 29
virtual media key
installing 411
removing 410
W
Web interface
logging on to 506
obtaining IP address 505
web site
ServerGuide 500
weight 7
Index 535
536 System x3650 M3 Types 4255, 7945, and 7949: Problem Determination and Service Guide
Printed in USA