Skip to main content
llm.info

Modal

API

Serverless GPU infrastructure for AI workloads

About

Modal provides elastic GPU capacity with serverless compute for AI, ML, and data teams. Spin up GPU-enabled containers in as little as 1 second, autoscale to hundreds of GPUs and back to zero without managing infrastructure. Features pay-per-use pricing with no quotas or reservations. Custom infrastructure enables 2-4 second cold starts. Used for ML inference, fine-tuning, batch jobs, and GPU-accelerated tasks. Raised $80M in 2025. Most flexible serverless GPU provider with ergonomic Python SDK for arbitrary code execution.

Compatibility

Supported Languages

python

Details

Category
API

Resources

No description available