Do you have a question about the Nvidia DGX-2 SYSTEM and is the answer not in the manual?
GPU Memory | 512 GB total (32 GB per GPU) |
---|---|
RAM | 1.5 TB |
GPU | 16 x NVIDIA Tesla V100 |
CPU | 2 x Intel Xeon Platinum 8168 |
Storage | 30 TB SSD |
Interconnect | NVSwitch |
Form Factor | Server |
Operating System | Ubuntu Server |
Provides an overview of the DGX-2 System structure and content.
Details the major components and specifications of the DGX-2 System.
Lists additional components not shown in the main exploded view.
Details the physical dimensions and form factor of the DGX-2 System.
Outlines the power requirements and supply details.
Information on operating temperature, humidity, and airflow.
Identifies and explains the front panel interfaces and buttons.
Details the rear panel interfaces and connectors.
Describes ports and controls on the motherboard tray.
Describes the available network ports and their purposes.
Guides on using specific ports for external storage connectivity.
Outlines the operating system and included software packages.
Lists other NVIDIA documentation for DGX systems.
Provides contact information for NVIDIA Enterprise Support.
Instructions for connecting a display and keyboard directly.
Steps for accessing the console via the Baseboard Management Controller.
How to establish an SSH connection for console access.
Basic requirements and steps for initial system setup.
Overview of the installation and configuration process.
Guide to setting up an NGC account for GPU-accelerated tools.
Steps to get NGC API Key and select container tags for examples.
Procedures to check system health and software installation.
Recommendations for securing BMC management access.
Steps to configure proxy settings for OS and applications.
How to set up distinct subnets for Docker containers.
Required firewall ports for DGX-2 System access.
Network URLs and verification for NGC container access.
Methods to set a static IP address for the BMC.
Steps to configure static IP addresses for network interfaces.
How to reconfigure network ports between InfiniBand and Ethernet.
Verifying network connectivity for OS software updates.
Step-by-step guide for updating the DGX OS software.
Best practices to prevent firmware corruption during updates.
How to get the container for firmware updates.
How to display firmware components qualified by NVIDIA.
Checking current firmware versions against the manifest.
Procedures for updating system and BMC firmware.
Explains options like forcing firmware updates.
A quick reference for firmware update commands.
Instructions for removing firmware update containers.
Steps to access the Baseboard Management Controller.
Explains the primary controls available on the BMC dashboard.
Introduction to NVIDIA KVM for GPU multi-tenancy.
Steps to install DGX KVM software and prepare the system.
Guide to creating and managing virtual machines.
Procedures for managing the lifecycle of virtual machines.
Methods for establishing connections to virtual machines.
Instructions for installing and managing KVM images.
Explains how OS and data drives are generated for VMs.
How to update DGX OS software for host and guest VMs.
Details on resource allocations and management in KVM.
Notes on security policies and firewall settings for KVM.
Using VMs when GPUs may have failed.
Steps to restart VMs after system or VM failures.
How to return the system to normal operation after GPU failures.
Methods for updating software on isolated systems.
Procedures for re-imaging isolated DGX-2 systems.
Setting up private repositories for software updates.
Method for installing containers from NGC on air-gapped systems.
General safety guidelines for handling the server.
Explains safety symbols and associated hazards.
Specifies the intended use of the product as ITE.
Criteria for choosing a suitable installation site.
Guidelines for safe movement and lifting of equipment.
Warnings and precautions related to power and electrical safety.
Safety instructions for accessing the internal components.
Guidelines for safely installing the system in a rack.
Precautions to prevent damage from static electricity.
Information on chemical hazards and battery replacement.
FCC compliance information for the US.
cULus listing mark and US/Canada compliance.
Canadian ISED compliance and interference regulations.
European Conformity directives and Class A product notice.
VCCI compliance and Japan RoHS material content declaration.
ACMA EMC requirements for Class A equipment.
China RoHS material content declaration and hazardous substance limits.
SII compliance information for Israel.
EurAsian Customs Union compliance.
ICT compliance for Vietnam.
KATS compliance and Korea RoHS material content declaration.
BSMI compliance and Taiwan RoHS material content declaration.