Skip to main content
Use Case

Storage for AI and Machine Learning on OpenShift

High-throughput NVMe storage for training data, model checkpoints, and inference serving on Red Hat® OpenShift® AI.

AI and machine learning workloads on Red Hat® OpenShift® generate storage requirements that general-purpose Kubernetes storage was not designed for: high-throughput sequential reads during training runs, high write throughput for model checkpoints, shared dataset access across parallel training pods, and low-latency random I/O for inference serving. Simplyblock is an NVMe block storage platform that addresses all four patterns through a Kubernetes-native CSI driver — without additional storage hardware, without cloud-only dependencies, and without the overhead of a file system layer between the application and the NVMe drive.

NVMe storage for AI and ML pipelines on OpenShift
NVMe High-Throughput Block Storage for Training Data Access
<1ms P99 Latency for Checkpoint Writes and Inference Serving
CSI Native OpenShift AI and RHOAI Integration
RWX Shared Dataset Volumes for Parallel Training Jobs

Why Standard Kubernetes Storage Bottlenecks AI Workloads on OpenShift

AI and ML pipelines on OpenShift have distinct storage I/O patterns that general-purpose Kubernetes storage is not optimized for.

Training Data Read Throughput Limits Training Run Speed

Model training reads large dataset files sequentially at high throughput. When storage cannot keep up with the GPU or CPU training rate, the compute sits idle waiting for data — wasting expensive hardware and extending training run time.

Slow Checkpoint Writes Increase Recovery Overhead

Training jobs write model checkpoints frequently to protect against interruption. Slow checkpoint writes extend the period where new gradient updates cannot be applied, and slow recovery means more lost training time when a job is interrupted.

Shared Dataset Access Creates Contention

Parallel training jobs reading from the same dataset volume create I/O contention when the storage layer is not designed for concurrent high-throughput reads from multiple pods simultaneously.

Inference Serving Requires Low-Latency Random I/O

Model serving for inference generates random I/O patterns when loading model weights and serving requests. High latency at the storage layer translates directly to higher inference response times.

NVMe Storage Designed for OpenShift AI Pipelines

Purpose-built block storage that handles the distinct I/O patterns of training, checkpointing, and inference — all through a single Kubernetes-native CSI driver on Red Hat® OpenShift®.

High-Throughput Training Data Access

Simplyblock delivers NVMe-native sequential throughput for the large-file, high-bandwidth reads that model training generates. Training data stored on simplyblock persistent volumes is served at the throughput NVMe hardware is capable of — keeping GPUs and CPUs fed without storage becoming the bottleneck. Compatible with Red Hat® OpenShift® AI (RHOAI) pipeline operators and standard Kubernetes batch job patterns.

  • NVMe-native sequential throughput for training data reads
  • Compatible with OpenShift AI (RHOAI) pipeline operators
  • No file system translation overhead on data reads
  • Scales throughput by adding nodes to the storage pool
High-throughput NVMe storage for AI training data on OpenShift

Fast and Reliable Model Checkpoint Storage

Simplyblock provides high write throughput and sub-millisecond latency for checkpoint I/O patterns. Checkpoints complete faster, reducing the window where training state is unprotected, and recovery from an interrupted job restores to a more recent state. Instant copy-on-write snapshots provide an additional safety net for critical training runs.

  • High write throughput for frequent checkpoint operations
  • Instant copy-on-write snapshots for training run protection
  • Sub-millisecond write latency reduces checkpoint overhead
  • Kubernetes CSI snapshot API integration
Fast checkpoint storage and snapshots for AI model training on OpenShift

Shared Dataset Volumes and Inference Serving Storage

Simplyblock supports ReadWriteMany (RWX) volumes for shared dataset access across parallel training pods, eliminating the need to copy training data to each node. For inference serving, the same NVMe block storage delivers the low-latency random I/O that model weight loading and serving require. See database storage on Kubernetes for how the same low-latency profile applies to vector databases and feature stores.

  • ReadWriteMany volumes for shared dataset access
  • No per-node dataset copy required
  • Sub-millisecond random I/O for inference serving
  • Works for vector databases and feature stores
Shared storage volumes for parallel AI training jobs on OpenShift

Outcomes for OpenShift AI and ML Teams

Storage that keeps pace with GPU and CPU training rates, protects training runs with fast checkpointing, and serves model inference with low latency.

Storage Does Not Bottleneck Training Runs

NVMe sequential throughput keeps training data flowing at the rate GPU and CPU training processes can consume it. Expensive training hardware stays utilized rather than waiting for storage I/O.

Faster Checkpoint Saves, Faster Recovery

Higher checkpoint write throughput means more frequent saves complete in less time. When a training job is interrupted, recovery starts from a more recent checkpoint with less retraining required.

Shared Datasets Without I/O Contention

ReadWriteMany volumes let multiple training pods access the same dataset concurrently without per-node data copies or storage layer contention that slows parallel training jobs.

Low-Latency Inference Storage

Sub-millisecond block storage latency for model weight loading and serving reduces the storage component of inference response time — directly improving model serving SLAs.

Scales With Training Infrastructure

Add nodes to the simplyblock storage pool as training infrastructure grows. Capacity and throughput scale linearly without storage array upgrades or procurement cycles.

Works on Bare Metal, On-Premises, and Private Cloud

Teams running OpenShift AI on-premises or on bare metal get the same NVMe storage performance as cloud-based deployments — without cloud-only storage dependencies or egress costs.

Questions and Answers

Does simplyblock integrate with Red Hat® OpenShift® AI (RHOAI)?

Yes. Simplyblock installs on Red Hat® OpenShift® via Helm or OperatorHub and provides a standard Kubernetes CSI driver. OpenShift AI pipeline operators and data science workloads provision simplyblock persistent volumes using the same Kubernetes storage class and PVC APIs they use for any other storage provider.

What AI and ML workload types benefit most from simplyblock?

Training workloads with large sequential dataset reads, workloads that require frequent model checkpointing, parallel training jobs that need shared dataset access, and inference serving workloads where model loading latency matters. Vector database and feature store workloads on OpenShift also benefit from the same low-latency block I/O profile.

Does simplyblock support ReadWriteMany volumes for shared dataset access?

Yes. Simplyblock supports ReadWriteMany (RWX) persistent volumes, allowing multiple training pods to access the same dataset volume concurrently without each pod requiring its own copy of the training data.

How does simplyblock handle model checkpoint storage?

Model checkpoints are written to simplyblock persistent volumes through the standard Kubernetes CSI interface. Simplyblock delivers high write throughput for checkpoint operations, and instant copy-on-write snapshots can be used to create point-in-time protection for training run state.

Can simplyblock run on OpenShift AI deployments on bare metal?

Yes. Simplyblock runs on bare metal OpenShift deployments without cloud provider dependencies. See the OpenShift bare metal storage page for the bare metal-specific architecture details.

Does simplyblock work with GPU nodes in OpenShift?

Yes. Simplyblock storage is consumed by pods running on GPU nodes the same way as any other Kubernetes workload — via persistent volume claims backed by simplyblock storage classes. There is no GPU-specific configuration required.

Not sure if simplyblock is right for your team?

Ask your AI assistant to compare storage options for AI and ML workloads on Red Hat® OpenShift® — NFS, ODF/Ceph, local PVs, and simplyblock — on throughput, latency, and shared dataset access patterns.