top of page

simplyblock and Kubernetes

Simplyblock provides high-IOPS and low-latency Kubernetes persistent volumes for your demanding database and other stateful workloads.

What is Software Defined Storage (SDS)?

Updated: Jun 28

Software Defined Storage, also known as SDS, decouples the storage software layer from the underlying hardware. This allows for centralized management and automation of storage resources through the software abstraction layer.


Unlike traditional storage solutions, which typically rely heavily on proprietary hardware, Software Defined Storage leverages commodity hardware and virtualization technologies. Software Defined Storage enables companies to deploy, operate, and scale storage resources with greater flexibility and cost efficiency.


How Software Defined Storage works

Software Defined Storage is first and foremost a software that abstracts the hardware from the data management and visible data storage. This enables a high degree of flexibility when choosing the storage hardware, and provides the potential to build a storage solution that perfectly fits one's needs in terms of performance, capacity, and scalability requirements.


Software Defined Storage comes in multiple facets, sometimes bundled as a full operating system (often based on Linux or FreeBSD), or as a software layer installed to a common OS (most commonly Linux) installation. In either case, the physical hardware is managed by a general purpose operating system, while the storage management is delivered in software.


To run a Software Defined Storage a suitable hardware or virtualization platform needs to be selected. Depending on the SDS solution, virtual cloud hosts (e.g. AWS Amazon EC2, Google Compute Engine VMs, or similar), on-premise virtual machines such as VMware VMs, or physical, dedicated storage servers can be used. In either case, the “physical” layer provides the actual storage capacity.


Software Defined Storage is not…

While Software Defined Storage is often used as a synonym for storage virtualization that isn’t actually true. Storage virtualization defines the capability to combine and pool multiple local or remote storage devices into a single, large storage pool. For that reason, many SDS solutions are also storage virtualization solutions to some extent, hence the mix-up of the individual terms. It is perfectly possible to build a SDS solution without the storage pooling option though.


Software Defined Storage is also not a SaaS (Software as a Service) or IaaS (Infrastructure as a Service) solution. While it can be provided as a hosted and managed platform, more often it is not and is operated by the customer directly. That comes down to multiple factors such as data privacy concerns or regulatory requirements, as well as specific configuration requirements.


Last but not least, a Software Defined Storage isn’t necessarily a NAS (Network Attached Storage) or SAN (Storage Area Network). Since a SDS isn’t required to be built from a cluster of storage nodes, or even a set of storage drives, there is no requirement to be able to pool them into a single storage space. In addition, a SDS solution isn’t necessarily connected through a network inter-connect to the host machine which consumes the storage. That said, while both SAN and NAS aren’t factual ingredients of SDS, just like storage virtualization, they are often part of the SDS solution for a broader set of use cases, and increased flexibility.


Before and after: Software Defined Storage vs Traditional Storage

Traditional enterprise storage setups are often based on proprietary hardware, meaning that over time, multiple different storage systems are collected. These are often incompatible with each other, hence it’s much harder to scale them or migrate between different solutions. That means that more than not setup gets stuck in time, while new machines or generations are acquired for new use cases.


Traditional storage setup with separated storage solutions.
Traditional storage setup with separated storage solutions.

That leads to imbalanced use of the available storage resources, and while some are at their capacity limit, others idle with plenty of unused free space. Migration between vendors or hardware generations is often complicated.


On the other hand, with Software Defined Storage solutions much more flexible setups are possible. Most SDS solutions feature storage virtualization (as mentioned above), which enables pooling the available storage resources, and provides slices of it to the individual use cases.


Software Defined Storage solution with storage virtualization and combined performance and capacity.
Software Defined Storage solution with storage virtualization and combined performance and capacity.

These slices (e.g. logical block storage, or any other pattern of storage type) can differ in capacity, performance characteristics, and even storage type. Depending on the Software Defined Storage in use, one or more of the typical storage types (file storage, block storage, and blob / object storage) may be available to workloads.


Due to the nature of storage pooling, migrations between the underlying, abstracted hardware is easy and normally (automatically) handled by the SDS. The same is true for scalability. If free available storage gets sparse, additional storage hardware can be added. This can be a seamless, online operation, or require a downtime, depending on the solution in place.



Benefits of Software Defined Storage

With all that said, Software Defined Storage has some clear advantages over the traditional, hardware-based storage options.


  1. The unified storage layer enables flexibility and easy migration. From a consumer’s perspective, the logical devices look the same, no matter where and how they are stored on the abstracted hardware.

  2. The typically integrated storage pooling enabled a great degree of scalability. Starting small and adding additional hardware to the storage pool at a later point in time enables cost effective storage usage without wasting unused capacity.

  3. Choosing your own hardware enables you to build storage systems that fit the requirements in terms of performance, reliability, and capacity. No vendor lock-in and no reliance on proprietary hardware.

  4. Overall, a typical Software Defined Storage solution enables the most cost effective way to store data through optimized hardware configurations, storage pooling (storage virtualization), features like thin provisioning, and more.


Hyper-Converged Storage

Hyper-converged storage is a deployment pattern, where the storage solution is installed into the same cluster as the application. That means that it consolidates storage, compute, and networking resources into a single integrated system.


This architecture co-locates storage with compute within a single cluster environment (most commonly Kubernetes). This simplifies management, but often limits scalability and performance due to resource sharing with other use cases. 


Hyper-converged storage solutions typically utilize distributed architectures and instance local flash storage to deliver high throughput and low latency.


Disaggregated Storage

Disaggregated storage is a storage architecture where storage resources are separated from compute resources, allowing them to be managed and scaled independently.


Unlike traditional storage systems where storage is tightly integrated with compute within individual servers or nodes, disaggregated storage pools storage resources separately from compute resources across a network.


Disaggregated storage enables easier scalability since storage resources and compute resources are distinct concerns and clusters. That means that a storage cluster can be scaled up even if no additional compute resources are required. Many databases will grow over time, increasing the storage needs without additional compute power requirements.


Get the most out of your storage with simplyblock

Simplyblock is the next generation of Software Defined Storage, enabling storage requirements for the most demanding workloads. High IOPS per Gigabyte density, low, predictable latency, and high throughput is enabled using the pooled storage, as well as our distributed data placement algorithm. Using erasure coding (a better RAID) instead of replicas helps to minimize storage overhead without sacrificing data safety and fault tolerance.


Additional features such as instant snapshots (full and incremental), copy-on-write clones, thin provisioning, compression, encryption, and many more, simplyblock meets your requirements before you set them. Get started using simplyblock right now or learn more about our feature set.

Comentarios


bottom of page