Dedicated Endpoints
Pricing
Billing based on the GPU type, number of instances and the duration of instance services.
Get inference flavor
Example Request:
curl --location 'https://api.netmind.ai/v1/inference-service/flavor' \
--header 'Authorization: Bearer {{API_TOKEN}}'
import requests
url = "https://api.netmind.ai/v1/inference-service/flavor"
payload = {}
headers = {
'Authorization': 'Bearer {{API_TOKEN}}'
}
response = requests.request("GET", url, headers=headers, data=payload)
print(response.text)
Example Response:
{
"flavor_list": [
{
"flavor_id": "69475e82e81c4dd6be3467e2ca374e0c",
"display_name": "NVIDIA_GeForce_RTX_4090",
"cluster_id": "1",
"cluster_flavor_id": "US_01_4090",
"meta_info": {
"cuda": "12.0",
"region": "cn"
},
"billing": {
"cny_price_unit": 2.2,
"usd_price_unit": 0.3,
"nmt_price_unit": 0.147164
},
"created_at": "2024-11-13 09:00:19",
"updated_at": "2024-11-13 09:00:19",
"deleted_at": null,
"is_deleted": false,
"available_num": 1,
"node_max_gpu": 2
},
{
"flavor_id": "7b2f36d30e0743debc6c60d5017e2d16",
"display_name": "NVIDIA_GeForce_RTX_4090",
"cluster_id": "1",
"cluster_flavor_id": "US_01_4090",
"meta_info": {
"cuda": "12.0",
"region": "other"
},
"billing": {
"cny_price_unit": 2.2,
"usd_price_unit": 0.3,
"nmt_price_unit": 0.147164
},
"created_at": "2024-11-13 09:00:19",
"updated_at": "2024-11-13 09:00:19",
"deleted_at": null,
"is_deleted": false,
"available_num": 1,
"node_max_gpu": 2
}
]
}Create endpoint
Example Request:
Example Response:
Get endpoint
Example Request:
Example Response:
Update endpoint
Example Request:
Example Response:
Delete endpoint
Example Request:
Example Response:
Last updated
Was this helpful?