Overview
Read about how Serverless AI works and how to choose between endpoints and jobs
Getting started with jobs
Create your first job that runs nvidia-smi and prints information about the GPUs in use
Getting started with endpoints
Launch a simple endpoint and send authenticated requests to it
Monitoring
Track resource utilization to schedule quota increases and to quickly identify anomalies
Pricing and quotas
Learn what other services Serverless AI uses and how this affects pricing and quotas