A dark server room corridor lit in red, representing a specialized GPU cloud data center.
CoreWeave rents fleets of NVIDIA GPUs wired together for one job: training and serving large AI models.

CoreWeave is a cloud provider built around one thing: renting NVIDIA GPUs for artificial intelligence work. General clouds like AWS or Azure serve every kind of workload, from email servers to databases. CoreWeave narrows the focus to GPU compute for training foundation models and running inference at scale. It calls itself an AI-native cloud. The industry calls this category a “neocloud”: a provider that specializes in GPU capacity instead of offering a broad menu of general services.

The problem it solves is supply and specialization. Large AI teams need thousands of the newest GPUs, connected by fast networking, available now, and tuned so a training run does not stall or crash halfway through. Hyperscalers offer GPUs too, but capacity is often scarce and the stack is general-purpose. CoreWeave concentrates its data centers, networking, and software on that narrow demand.

Where CoreWeave sits in the stack

Your workload
Model training Fine-tuning Inference serving What you run on the rented GPUs
Orchestration
CoreWeave Kubernetes Service Slurm on Kubernetes (SUNK) Mission Control Scheduling, observability, job management
Networking
InfiniBand fabric NVIDIA SHARP Low-latency links between GPU nodes at cluster scale
Compute and storage
NVIDIA GPUs (Blackwell, Hopper) Bare metal servers Object and file storage The physical hardware in CoreWeave data centers

CoreWeave sits at the infrastructure layer, below the model. You bring the model and the training code. CoreWeave provides the GPUs, the fast links between them, and the tools to schedule and watch the jobs.

Neocloud versus hyperscaler

A hyperscaler is a large general-purpose cloud: Amazon Web Services, Microsoft Azure, Google Cloud. It offers hundreds of services across compute, storage, databases, analytics, and more. GPUs are one product among many.

A neocloud specializes. It rents GPU compute and the networking and software that AI teams need around it, and little else. The bet is that focus produces better availability of new hardware, tighter tuning for large training runs, and lower prices per GPU-hour than a general cloud.

What CoreWeave rents

CoreWeave offers access to NVIDIA GPUs, including current Blackwell-generation parts (such as GB200 and HGX B200) and Hopper-generation parts (H100 and H200), alongside older data-center GPUs. It provides bare-metal servers, an InfiniBand networking fabric that connects GPUs across a cluster, and storage options including object storage and distributed file storage. On top of the hardware, it offers a managed Kubernetes service, a Slurm-on-Kubernetes scheduler for large training jobs, and Mission Control for observability.

How to access it and typical use

CoreWeave is a cloud service, so there is no local install. You access it as an account and provision GPU clusters through its console, Kubernetes service, or APIs.

Step 1 Contact and onboard Engage CoreWeave for an account and reserve capacity for your GPU class and cluster size.
Step 2 Provision a cluster Spin up GPU nodes through the CoreWeave Kubernetes Service, connected over InfiniBand.
Step 3 Run the job Schedule training or inference workloads with SUNK or Kubernetes; watch them in Mission Control.
Scale and serve Grow the cluster for larger runs, then serve the trained model for inference.

Typical users are AI labs, model builders, and enterprises with heavy AI workloads. CoreWeave has named customers including OpenAI, Mistral AI, IBM, and Google. The common thread is a need for large blocks of current GPUs, wired for distributed training, available on demand.

How it compares

CoreWeaveAWS / Azure / GCPLambda CloudNebius
TypeNeocloud, GPU-focusedHyperscaler, generalNeocloud, GPU-focusedNeocloud, GPU-focused
Primary offeringNVIDIA GPU clustersFull cloud menuNVIDIA GPU cloudNVIDIA GPU cloud
NetworkingInfiniBand fabricVaries by instanceInfiniBand on clustersInfiniBand on clusters
Managed AI stackKubernetes, SUNK, Mission ControlBroad managed servicesGPU-focused toolingGPU-focused tooling
Best forLarge-scale training and inferenceTeams wanting one cloud for everythingGPU rentals for research and startupsGPU capacity, Europe presence

CoreWeave is a public company. It listed on Nasdaq under the ticker CRWV on 2025-03-28, at an IPO price of 40 US dollars per share. It was founded in 2017 and is headquartered in Livingston, New Jersey. It began as a crypto-mining startup before pivoting to AI cloud infrastructure.

When not to use it

CoreWeave is a poor fit if you do not run heavy GPU workloads. If your app is a standard web service, a database, or a batch job on CPUs, a hyperscaler or a smaller host serves you better and cheaper. If you want a single cloud for everything, from your database to your queue to your GPUs, the specialization that makes CoreWeave attractive becomes a limitation, because you would still need another provider for the rest of the stack.

CoreWeave is also aimed at scale. If you want to call a model through a simple API and never manage hardware, a managed model service such as Amazon Bedrock fits better than renting and operating your own GPU cluster. Renting GPUs means you own the job of scheduling, monitoring, and keeping large runs healthy.

Further reading

Sources