Question 1

What is included in the instance price?

Accepted Answer

3TB of Scratch Storage are included in the instance price, but any Block Storage provisioned by you, is at your expense.
For redundancy and thus security reasons we strongly recommend that you provision extra Block Storage volume, as Scratch Storage is ephemeral storage that disappears when you switch off the machine. Scratch Storage purpose is to speed up the transfer of your data sets to the gpu.
How to use Scratch storage then? Follow the guide

Question 2

Whats the difference between H100-1-80G and H100-2-80G?

Accepted Answer

These are 2 formats of the same instance embedding NVIDIA H100 PCIe Tensor Core.
H100-1-80G embeds 1 GPU NVIDIA H100 PCIe Tensor Core, offering a GPU memory of 80GB
H100-2-80G embeds 2 GPUs NVIDIA H100 PCIe Tensor Core, offering a GPU memory of 2 times 80GB. This instance enables faster time to train for bigger Transformers models that scale 2 GPUs at a time. Thanks to the PCIe Board Factor, the servers of the H100 PCIe GPU instance are made of 2 GPUs. By launching a H100-2-80G instance format, the user benefits from a fully dedicated server with 2 GPUs.

Question 3

What is the environmental impact of the H100 PCIe instance?

Accepted Answer

NVIDIA announced the H100 to enable companies to slash costs for deploying AI, "delivering the same AI performance with 3.5x more energy efficiency and 3x lower total cost of ownership, while using 5x fewer server nodes over the previous generation."
What inside the product can confirm this announcement?
The thinner engraving of the chip reduces the surface and thus the energy required to power the chip
Thanks to innovations like the new data format FP8 (8bits) more calculations are done with the same amount of consumption resulting in time and energy optimization

In addition, at Scaleway we decided to localize our H100 PCIe instances in the adiabatic Data Center DC5. With a PUE (Power User Effectiveness) of 1.15 (average is usually 1.6) this datacenter saves between 30% and 50% electricity compared with a conventional data centre.
Stay tuned for our benchmarks on the topic!

Question 4

How can I use MIG to get the most out of my GPU?

Accepted Answer

NVIDIA Multi-Instance GPU (MIG) is a technology introduced by NVIDIA to enhance the utilization and flexibility of their data center GPUs, specifically designed for virtualization and multi-tenant environments. It allows a single physical GPU to be partitioned into up to seven smaller Instances, each of which operates as an independent MIG partition with its own dedicated resources, such as memory, compute cores, and video outputs.
Read the dedicated documentation to use MIG technology on your GPU instance

Question 5

How to choose the right GPU for my workload?

Accepted Answer

There are many criteria to take into account to choose the right GPU instance:
Workload requirements
Performance requirements
GPU type
GPU memory
CPU and RAM
GPU driver and software compatibility
Scaling

For more guidance read the dedicated documentation on that topic

H100 PCIe GPU instance

Fine tune models like LLaMA 2

Accelerate inference workloads up to 30 times

Maximize GPU utility up to your needs

H100 PCIe GPU technical specifications

Numerous AI applications and use cases

Natural Language Processing

Automatic Speech Recognition

Generative AI

Computer vision

Recommender

Enjoy the simplicity of a pre-configured AI environment

Optimized GPU OS Image

Enjoy your favorite Jupyter environment

Choose your AI containers among multiple registries

NVIDIA Enterprise AI software at your disposal

Deploy and Scale your infrastructure with Kubernetes