You are concerned about potential security threats and unexpected downtime in your InfiniBand data center.
Which UFM platform uses analytics to detect security threats, operational issues, and predict network failures in InfiniBand data centers?
What is the purpose of configuring NVUE to ignore Linux files?
Which component of the Spectrum-X platform is responsible for reordering out-of-order packets?
Which tool would you use to gather telemetry data in a SpectrumX network?
Which of the following NCCL environment variables enable SHARP aggregation with NCCL when using the NCCL-SHARP plugin?
Pick the 2 correct responses below
In order to configure RoCE on a Cumulus switch, which command should be used?
In Cumulus Linux, which technology enables the ability to provide active-active redundancy to servers, without the need for direct inter-switch links?
When upgrading Cumulus Linux to a new version, which configuration files should be migrated from the old installation?
Pick the 2 correct responses below.
You are optimizing a multi-node AI training cluster using InfiniBand networking and NVIDIA GPUs. You need to implement efficient collective communication operations across the nodes.
Which feature of NVIDIA Collective Communications Library (NCCL) allows for optimized performance in multi-subnet InfiniBand environments?
How does Spectrum-X achieve network isolation for multiple tenants?
A fabric administrator added new servers to a 40-port edge switch. The administrator now needs to gather and map the newly added ports’ LIDs and LINK SPEED. Which of the following commands can be used for that purpose?
Which of the following statements are true about AI workloads and adaptive routing?
Pick the 2 correct responses below.
A financial services company is planning to implement an AI infrastructure to support real-time fraud detection and risk assessment. They need a solution that can handle both training and inference workloads while maintaining data privacy and security.
Which NVIDIA reference architecture component would be most appropriate to address the data privacy and security concerns in this AI networking setup?
A user has requested confirmation that the InfiniBand network is performing optimally and is not limiting the speed of a training run. To verify this, you would like to measure the RDMA throughput rate between two endpoints.
Which tool should be used?
In an AI cluster using NVIDIA GPUs, which configuration parameter in the NicClusterPolicy custom resource is crucial for enabling high-speed GPU-to-GPU communication across nodes?
What is the basic functionality of an IB Router?
You are tasked with troubleshooting a link flapping issue in an InfiniBand AI fabric. You would like to start troubleshooting from the physical layer.
What is the right NVIDIA tool to be used for this task?
A cloud service provider is deploying the NVIDIA Spectrum-X Ethernet platform in a multi-tenant environment. To ensure the security and isolation of each tenant's AI workload, the provider wants to implement a feature that prevents unauthorized accessto the network.
Which of the following features of the Spectrum-X platform should the provider implement?
You are troubleshooting InfiniBand connectivity issues in a cluster managed by the NVIDIA Network Operator. You need to verify the status of the InfiniBand interfaces. Which command should you use to check the state and link layer of InfiniBand interfaces on a node?
Which service on Cumulus switches can monitor layer 1, layer 2, layer 3, tunnel, buffer, and ACL related issues?
You're troubleshooting a Spectrum-X network and notice that the System Status LED on a switch is blinking for more than 5 minutes. What is the most likely cause of this issue?