Providers — oss.report

◇

Inference APIs

Send prompts, get completions. No GPU management needed.

Widest OSS model selection. Serverless + dedicated endpoints.

Fast inference, competitive pricing, good developer experience.

Fastest inference (custom LPU silicon). Limited model selection.

Wafer-scale inference. Extremely fast on supported models.

Pay per second. Easy API, broad model variety including image/video.

Cost-effective inference. Good selection of popular OSS models.

Pythonic API, built-in fine-tuning, serverless GPUs.

□

Rent GPUs by the hour. Bring your own model, run your own stack.

GPU marketplace. Cheapest per-hour rates. Community + secure cloud.

Serverless GPU + pods. Good for fine-tuning and batch inference.

GPU cloud for training + inference. H100 clusters available.

Enterprise GPU cloud. Large-scale training infrastructure.

ML-focused cloud with notebooks and deployments.