I tried launching a GPU instance with NVIDIA Brev CLI

I tried launching a GPU instance with NVIDIA Brev CLI

2026.07.03

This page has been translated by machine translation. View original

Introduction

There are many things you want to try using a GPU, like local LLMs and image generation, right?
...But you can't just buy a dedicated machine right away, and shared machines are fully booked.
In that case, let's easily try out a GPU environment in the cloud!

This time, I'd like to get started with the NVIDIA Brev CLI.
Referencing the official Quickstart, I'll try creating a GPU instance and getting to the point where I can see the GPU with nvidia-smi.
https://docs.nvidia.com/brev/getting-started/quickstart

Environment

Item Details
OS macOS 26.5.1
Package Manager Homebrew
Brev CLI v0.6.328

Steps

1. Sign up for Brev

Create a Brev account from the following link.
https://brev.nvidia.com/

2. Install the Brev CLI

Install the Brev CLI with the following command.

brew install brevdev/homebrew-brev/brev

Let's check the version.

% brev --version                                   
Current Version: v0.6.328

Log in.

% brev login

   ▸     Starting Login

You can log in by entering the email address registered with your account.

3. Search for available GPU instance types

With Brev, you can search for available GPU instance types using the brev search command.
This time, since I want to use something with a short startup time and low cost, I'll specify the following options.

brev search --max-boot-time 3 --sort price

--max-boot-time 3 → Narrow down to instances with an estimated boot time of 3 minutes or less
--sort price → Sort the search results by price

Let's look at the search results.

% brev search --max-boot-time 3 --sort price       

 TYPE                             PROVIDER                 GPU         COUNT  VRAM/GPU  TOTAL VRAM  CAPABILITY  DISK   $/GB/MO  BOOT   FEATURES  VCPUS  $/HR   
 verda_V100                       verda:shadeform          V100            1  16 GB     16 GB       7.0         250GB  -        2m30s  -             6  $0.47  
 hyperstack_A6000                 hyperstack:shadeform     A6000           1  48 GB     48 GB       8.6         100GB  -        3m     -            28  $0.60  
 vultr_A16                        vultr:shadeform          A16             1  16 GB     16 GB       8.6         350GB  -        3m     -             6  $0.61  
 massedcompute_A6000_plus         massedcompute:shadeform  A6000           1  48 GB     48 GB       8.6         256GB  -        2m30s  -            12  $0.68  
 massedcompute_A6000              massedcompute:shadeform  A6000           1  48 GB     48 GB       8.6         256GB  -        3m     -             6  $0.68  
 excesssupply_RTX4090             excesssupply:shadeform   RTX4090         1  24 GB     24 GB       8.9         850GB  -        2m30s  -            12  $0.72  
 excesssupply_RTX5090             excesssupply:shadeform   RTX5090         1  32 GB     32 GB       10.0        900GB  -        2m     -            12  $0.78  
 massedcompute_L40S               massedcompute:shadeform  L40S            1  48 GB     48 GB       8.9         625GB  -        2m30s  -            12  $1.06  
 scaleway_L4                      scaleway:shadeform       L4              1  24 GB     24 GB       8.9         500GB  -        2m     -             8  $1.14  
 massedcompute_RTX6000Ada         massedcompute:shadeform  RTX6000Ada      1  48 GB     48 GB       8.9         350GB  -        2m30s  -            12  $1.16  
 hyperstack_A6000x2               hyperstack:shadeform     A6000           2  48 GB     96 GB       8.6         300GB  -        3m     -            60  $1.20  
 verda_A6000                      verda:shadeform          A6000           1  48 GB     48 GB       8.6         250GB  -        3m     -            10  $1.30  

Since I'm only confirming that the GPU instance starts up this time, I'd like to launch the low-cost, short-boot-time TYPE verda_V100.

4. Create and launch a GPU instance

The Quickstart introduces the % brev create my-instance --gpu "nebius.l40sx1.pcie" command, but the --gpu option was not available in the local Brev CLI v0.6.328.

% brev create my-instance --gpu "verda_V100"
unknown flag: --gpu

So, I'll check brev search --help and try using the --type option.

% brev create my-instance --type "verda_V100"      
Attempting to create 1 instance(s) with 1 parallel attempts
Instance types to try: verda_V100

Trying verda_V100 for 1 instance(s)...
[Worker 1] Trying verda_V100 for instance 'my-instance'...
[Worker 1] verda_V100 Success! Created instance 'my-instance'

Waiting for instance(s) to be ready...
You can safely ctrl+c to exit
  my-instance: Ready

Successfully created 1 instance(s)!

Instance: my-instance
  ID: nrn519qmd
  Type: verda_V100
  Connect:
    brev open my-instance
    brev shell my-instance

Although the BOOT column showed 2 minutes 30 seconds, it actually completed in about 3 minutes 30 seconds.

5. Connect to the GPU instance

Let's connect to the launched instance using the shell command.

% brev shell my-instance
⣟ waiting for SSH connection to be available Agent pid 18795
shadeform@brev-nrn519qmd:~$ nvidia-smi
Fri Jul  3 05:12:52 2026       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 580.126.09             Driver Version: 580.126.09     CUDA Version: 13.0     |
+-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  Tesla V100-SXM2-16GB           On  |   00000000:05:00.0 Off |                  Off |
| N/A   30C    P0             19W /  300W |       0MiB /  16384MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+
| Processes:                                                                              |
|  GPU   GI   CI              PID   Type   Process name                        GPU Memory |
|        ID   ID                                                               Usage      |
|=========================================================================================|
|  No running processes found                                                             |
+-----------------------------------------------------------------------------------------+

I was able to connect!
Now that I've confirmed it, I'll disconnect using the exit command.

$ exit
logout
Shared connection to xxx.xxx.xxx.xxx closed.

You can use the ls command to check running instances.

% brev ls
You have 1 instances in Org classmethod
 NAME         STATUS   BUILD      SHELL  ID         MACHINE     GPU 
 my-instance  RUNNING  COMPLETED  READY  nrn519qmd  verda_V100  -  

6. Cleanup

Once an instance is created, you will be billed even if you are not connected to it.
When you are done with what you wanted to do, make sure to stop or delete it.
Since the instance I selected does not support brev stop, I will delete it.

% brev stop my-instance
1 error occurred:
	* instance "my-instance" does not support stop.

% brev delete my-instance
Deleting instance my-instance. This can take a few minutes. Run 'brev ls' to check status

The instance was deleted in about 3 minutes.

 % brev ls
No instances in org <ORG_NAME>

See teammates' instances:
	brev ls --all

Impressions

Since you can easily try out GPU instances, I think it's useful for research and verification before purchasing a device.
Next, I'd like to try connecting via SSH from VSCode and experimenting with the GPU environment.


製造業のクラウド活用とデジタル化を支援します

クラスメソッドの専門家による包括的なクラウド導入とデジタル化支援で、製造業の業務効率を最大化しましょう。AWSの導入から運用、最適化まで、最新技術と豊富な知見であらゆる課題に対応します。生産ラインのデジタル化やデータ活用、IoTの導入事例もございます。ぜひ、弊社の実績をご覧ください。

製造業界での支援内容を見る

Share this article