You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**NOTE:** SSH access requires that your deployment use an SSH-compatible image and that the provider supports SSH connections. Some Spheron-supported images with SSH access include:
**NOTE:** For examples of deployment configurations:
@@ -74,8 +75,6 @@ When configuring port exposure for your services, keep these points in mind:
74
75
as: 80
75
76
to:
76
77
- global: true
77
-
accept:
78
-
- www.yoursite.com
79
78
```
80
79
81
80
### Port Range
@@ -439,6 +438,42 @@ gpu:
439
438
</Callout>
440
439
441
440
441
+
#### Configuring VRAM Requirements
442
+
443
+
You can specify the minimum VRAM requirement using the `req_vram` attribute in your GPU configuration. Here's an example:
444
+
445
+
```yaml
446
+
profiles:
447
+
compute:
448
+
gpu-intensive:
449
+
resources:
450
+
cpu:
451
+
units: 4.0
452
+
memory:
453
+
size: 8Gi
454
+
storage:
455
+
size: 20Gi
456
+
gpu:
457
+
units: 1
458
+
attributes:
459
+
vendor:
460
+
nvidia:
461
+
- model: rtx3090
462
+
req_vram: ">=80"
463
+
```
464
+
465
+
The `req_vram` attribute accepts a string value with a comparison operator followed by the required VRAM amount in percentage. The supported comparison operators are:
466
+
467
+
- `>=`- Greater than or equal to
468
+
- `>`- Greater than
469
+
- `<=`- Less than or equal to
470
+
- `<`- Less than
471
+
472
+
Examples:
473
+
- `req_vram: ">=80"` - Requires at least 80% of VRAM
474
+
- `req_vram: ">50"` - Requires more than 50% of VRAM
475
+
- `req_vram: "<=20"` - Requires 20% of VRAM or less
476
+
- `req_vram: "<50"` - Requires less than 50% of VRAM
442
477
443
478
## 7. Private Container Registry Integration
444
479
@@ -629,9 +664,17 @@ attributes:
629
664
region: us-east
630
665
```
631
666
667
+
You can also specify multiple regions using a semicolon-separated list:
668
+
669
+
```yaml
670
+
attributes:
671
+
region: us-east;ap-south;eu-west
672
+
```
673
+
632
674
This is particularly useful when you need to:
633
675
- Minimize latency for users in specific geographic areas
634
676
- Ensure your application runs in a specific region
677
+
- Deploy your workload across multiple regions for redundancy
635
678
636
679
<Callout type="info">
637
680
The region must match one of Spheron's [supported region codes](/rent-gpu/supports#supported-regions). If not specified, your application can be deployed in any available region.
@@ -645,10 +688,18 @@ attributes:
645
688
exclude_region: us-east
646
689
```
647
690
691
+
You can also exclude multiple regions using a semicolon-separated list:
692
+
693
+
```yaml
694
+
attributes:
695
+
exclude_region: us-east;ap-south;eu-west
696
+
```
697
+
648
698
This is useful when you:
649
699
- Want to exclude a specific region from deployment
650
700
- Need to ensure your application does not run in a specific region
651
701
- Are testing region-specific functionality
702
+
- Have compliance requirements that prohibit deployment in certain regions
652
703
653
704
#### 3. Provider Selection
654
705
The `desired_provider` attribute lets you specify a particular provider using their blockchain address:
@@ -658,13 +709,21 @@ attributes:
658
709
desired_provider: "0x1234...5678" # Replace with actual provider address
659
710
```
660
711
712
+
You can also specify multiple providers as a semicolon-separated list:
- Need to maintain application state on a particular node
745
+
- Want to deploy to specific nodes you trust
746
+
- Need to maintain application state on particular nodes
681
747
- Are testing node-specific functionality
748
+
- Want to distribute your workload across multiple preferred fizz nodes
682
749
683
750
#### 4. Bandwidth Selection
684
751
<Callout type="info">
@@ -707,6 +774,7 @@ profiles:
707
774
region: us-east # Geographic region
708
775
desired_provider: "0x1234...5678" # Specific provider
709
776
desired_fizz: "0xabcd...ef12" # Specific fizz node
777
+
bandwidth: 100mbps # Minimum bandwidth
710
778
pricing:
711
779
web:
712
780
token: uSPON # uSPON is the only supported token for Spheron deployments
@@ -776,3 +844,62 @@ Mac deployments are particularly beneficial for:
776
844
<Callout type="info">
777
845
**Note:** The availability of specific Mac models depends on what's currently available in the Spheron network. Please check the supported models before configuring your deployment.
778
846
</Callout>
847
+
848
+
## 13. GPU VRAM Minimum Requirements
849
+
850
+
<Callout type="warning">
851
+
**Important:** This feature is only applicable for fizz mode deployments and not for provider deployments.
852
+
</Callout>
853
+
854
+
When deploying GPU workloads to fizz nodes, you can specify minimum VRAM (Video RAM) requirements to ensure you get nodes with sufficient available memory for your application. This helps maximize your workload efficiency and prevents deployment to nodes with insufficient VRAM.
855
+
856
+
### Configuring VRAM Requirements
857
+
858
+
You can specify the minimum VRAM requirement using the `req_vram` attribute in your GPU configuration. Here's an example:
859
+
860
+
```yaml
861
+
profiles:
862
+
compute:
863
+
gpu-intensive:
864
+
resources:
865
+
cpu:
866
+
units: 4.0
867
+
memory:
868
+
size: 8Gi
869
+
storage:
870
+
size: 20Gi
871
+
gpu:
872
+
units: 1
873
+
attributes:
874
+
vendor:
875
+
nvidia:
876
+
- model: rtx3090
877
+
req_vram: ">=80"
878
+
```
879
+
880
+
### Using Comparison Operators
881
+
882
+
The `req_vram` attribute accepts a string value with a comparison operator followed by the required VRAM amount in percentage. The supported comparison operators are:
883
+
884
+
- `>=`- Greater than or equal to
885
+
- `>`- Greater than
886
+
- `<=`- Less than or equal to
887
+
- `<`- Less than
888
+
889
+
Examples:
890
+
- `req_vram: ">=80"` - Requires at least 80% of VRAM
891
+
- `req_vram: ">50"` - Requires more than 50% of VRAM
892
+
- `req_vram: "<=20"` - Requires 20% of VRAM or less
893
+
- `req_vram: "<50"` - Requires less than 50% of VRAM
894
+
895
+
### Benefits
896
+
897
+
Specifying VRAM requirements provides several advantages:
898
+
899
+
1. **Resource Optimization**: Ensures your application gets the GPU memory it needs to run efficiently.
900
+
2. **Deployment Reliability**: Prevents your workload from running on underpowered nodes that might crash or perform poorly.
901
+
3. **Cost Efficiency**: Helps you match your workload to appropriate resources without overprovisioning.
902
+
903
+
<Callout type="info">
904
+
**Note:** When using the `req_vram` attribute, make sure the value you specify is between 0% and 100% of the GPU's total VRAM. For example, don't request more than 100% of the VRAM or less than 0%.
0 commit comments