NVIDIA DGX H100 Service Manual
10.2. Identifying the Failed DIMM
1. From the console, run the following nvsm command to identify memory alerts:
sudo nvsm show health
2. Determine the DIMM manufacturer.
sudo nvsm show memory
3. Request the replacement DIMM from NVIDIA Enterprise Support, specifying the manufacturer.
10.3. Replacing the DIMM
1. Power o the system.
2. Remove the motherboard tray. Refer to Motherboard Tray - Removal and Installation for more
information.
3. Pull the motherboard out of the system and place it on a solid, at surface and remove the lid
and air baes to expose the DIMMs.
4. Identify the failed DIMM on the motherboard. Use the label on the lid to identify the position of
the DIMM to be replaced. The names of the DIMMs also include the CPU numbering for easier
identication.
62 Chapter 10. DIMM Replacement