Loading

Sorry

Your web browser doesn't support some required capabilities.

This interactive simulation works best with the latest version of Chrome, Firefox, or Safari.

Sorry

An error occurred. Please visit Hands-on Labs Support

to get help or report this issue.

Sorry

Sorry

Unable to initialize the simulation player:

Please visit Hands-on Labs Support

to get help or report this issue.

X
↩ Return to the lab
HOL-1947-01: Using Bitfustion GPU Virtualization in vSphere

This is an interactive demo

Drive it with your mouse, your finger, or just use the arrow keys.

Use Learn mode to learn the demo. The orange boxes show where to click.

Use Present mode to hide the orange boxes and notes.

Click a Shortcut to jump to a specific part of the demo.

X

This part of the lab is presented as a Hands-on Labs Interactive Simulation. This will allow you to experience steps which are too time-consuming or resource intensive to do live in the lab environment. In this simulation, you can use the software interface as if you are interacting with a live environment.

The orange boxes show where to click, and the left and right arrow keys can also be used to move through the simulation in either direction.

In this lab there will be a lot of typing in a putty window.  Each command that needs to be type will be highlighted in blue.

This iSim module walks you through the steps of running a TensorFlow job on a VM without GPU  to use GPU on another VM. With Bitfusion FlexDirect resource scheduler running on a GPU server VM, client VM can run a TensorFlow job to use the GPU on the remote GPU server VM. The GPU VM is configured with NVIDIA P100 in Passthrough mode (for more information of configuring GPU in Passthrough mode, please refer to module 3: Using GPUs in Passthrough Mode)

Start Bitfusion FlexDirect Resource Scheduler on a GPU VM

  1. Notice bf-gpuvm-01 VM's IP address, 172.16.31.181
  2. Click on the Putty icon in the menu bar at the bottom of the screen
  3. Type the VM's IP address, 172.16.31.181 in the Host Name box
  4. Click on Open
  5. Log on to the VM with:
  6. Username = root
  7. Password = no password press Enter

Note: if the text stops, hit the tab key for command autocompletion

We will verify that the NVIDIA GPU is enabled on this VM

Type nvidia-smi and press Enter

We now need to start the FlexDirect Resource Scheduler on this VM

Type nohub flexdirect resource_scheduler &> /dev/null & and press Enter

  1. Click on X to close the putty window
  2. Click OK to end your putty session

Configure the CPU VM to point to the GPU VM

  1. Click on the scrolling bar, to find the bf-cpuvm-01 VM in the Bitfusion-CPU-VMs resource pool
  2. Click on bf-cpuvm-01
  3. Click on ACTIONS
  4. Click on Power
  5. Click on Power On
  6. Take note of this VM's IP address, 172.16.31.185
  7. Click on the Putty icon in the menu bar at the bottom of the screen
  8. Type the VM's IP address, 172.16.31.185 in the Host Name box
  9. Click on Open
  10. Log on to the VM with:
    • Username = root
    • Password = no password press Enter

Note: if the text stops, hit the tab key for command autocompletion

Verify that on the CPU VM, it can see the GPU on the remote GPU VM

Type nvidia-smi and press Enter, to show there is no NVIDIA controller installed on this VM

Type echo '172.16.31.181' > /etc/bitfusionio/servers.conf and press Enter, to add the GPU VM to the servers.conf file so it can be by the CPU VM

Type cat /etc/bitfusionio/servers.conf and press Enter, to verifies the CPU VM is listed in the server configuration

Run a TensorFlow ML Workload on the CPU VM

Type flexdirect list_gpus and press Enter, to confirm the CPU VM can see the GPU on the GPU VM

Type flexdirect run -n 1 nvidia-smi and press Enter, to verify the CPU VM can use the GPU on the CPU VM

Type flexdirect run -n 1 -- python /m1_data/nzhang/benchmarks/scripts/tf_cnn_benchmarks.py and press Enter, to execute the TensorFlow ML Workload

To return to the lab, click the link in the top right corner or close this browser tab.