NAS research: Which RAID type is right for me?
Guide

NAS research: Which RAID type is right for me?

Martin Jud
22.3.2019
Translation: machine translated

Twix used to be called RAIDer. And RAIDer always came with two hard discs. If you now operate your own cloud using RAID, it also requires at least two hard drives. But unlike the chocolate bar, you can't just insert the HDDs. First you have to decide on a RAID type.

As an attentive reader of the you will have noticed that I am buying a replacement for my ageing NAS. I have decided on the following configuration: Synology DS918+, 4 GB RAM expansion and four HDDs Seagate IronWolf 8 TB.
.
Why this configuration in particular, you can read here:

  • Background information

    More space on my NAS: which HDDs are worth it?

    by Martin Jud

I'm really looking forward to my new cloud. But before I set it up, how important is reliability to me? Depending on the RAID type, I get more storage space or more security.

What is a RAID?

The term RAID stands for "Redundant Array of Independent Disks", which translates as "redundant collection of independent hard disks". This is a combination of several physical hard drives to form a single logical drive. This can bring benefits in terms of performance, reliability and resilience.

Redundancy? No, that's not a backup!

In general, redundancy means the multiple presence of similar objects. In relation to RAID, it means that you use additional resources as a reserve. In other words, part of the available storage space is used as a fail-safe and therefore cannot be used as conventional network storage. Depending on the RAID type/level used, one or more hard disks can fail without the risk of definitive data loss. The data lost on the failed hard drive can be recovered from the remaining HDDs thanks to redundant copies. You replace the defective hard drive while the system is running and that's it.

If you are now thinking that redundancy is a backup, I have to disappoint you. Redundancy is merely a fail-safe measure that allows you to continue working despite a defective hard drive.

For the following reasons, your network storage is not a backup even with redundancy: If you catch a virus or incorrectly written data due to a software error, the problems will also be transferred to the redundantly stored data. There is also the risk of accidentally deleted files. These can only be restored with luck and some effort. In addition, a backup should be stored at a different location than the original data. As you can see, not having a backup because of a RAID system can be a bad idea.

Parity: The RAID error correction

Parity is generally a technique for error correction. It is necessary because data transfer is not a completely error-free process. Parity is used to check whether data has been lost or overwritten during copying, moving or transferring. This works by adding an additional binary number to a group of bits - the so-called parity bit. This can be used to determine whether the moved bits have arrived successfully.

The concept of parity checking exists for many applications - for example, in modem communication. Or also in an extended form (horizontal and vertical parity) for a RAID or certain RAID levels. RAID 5, for example, offers redundancy through the use of parity. The parity information enables the data to be rebuilt in the event of a drive failure. Depending on the RAID level, the parity is stored on a single drive or distributed across several drives.

RAID types and their properties

Whether you have purchased a NAS or a hardware RAID, want to create a software RAID or even a host RAID - the question of the RAID type/level always arises. Unless you are not interested and let the setup wizard of your NAS adopt the factory default setting.

Which RAID type you can or should use for your hard drive network depends on the number of hard drives. On the other hand, it also depends on the power of the processor/NAS, as certain RAID types cause more load due to the parity information. But see for yourself - the most important RAID levels are explained below. I have omitted more exotic RAID types (RAID 2, 3, 4, 7, etc.), which are only used rarely or in special cases, for the sake of clarity.

The different RAID levels

The following table gives you an initial overview of the different RAID levels. You can see the minimum number of HDDs required for each level, how much storage space is available to you, the reliability offered, the read and write speed and an indication of the costs.

Raid levelMin. number of HDDsUtilisation of capacityFail-safeRead speedWriting speedCosts
RAID 02100 %noneVery highVery highVery low
RAID 121 HDD (50 % with 2 drives)one driveVery highnormal*high (to very high)
RAID 53at least 66.6 %one drivehighmostly normal**low
RAID 64at least 50 %two driveshighusually a little slower**high with 4 drives, cheaper with more
RAID 10 (1+0)450 %One drive per sub-arrayVery highhighhigh
*Normal = like a single hard drive / **Depending on the given system performance.

RAID 0 - disc striping

RAID 0 is the RAID that is not a RAID. This is because there is no redundancy - all data is only present once. In a RAID 0 system, the data is divided into blocks and written to all drives in the array. RAID 0 requires at least two hard drives.

Source: www.seagate.com
Source: www.seagate.com

Advantages:

  • The highest possible storage capacity is utilised = 100 per cent.
  • Excellent performance for read and write access.
  • There is no overhead (additional data) due to parity checks.

Disadvantages:

  • RAID 0 is not redundant and offers no reliability. If a drive fails, all data is lost.

RAID 1 - Disk Mirroring

RAID 1 consists of a data drive and (at least) one mirror drive. If one of the drives fails, all data can be restored quickly and easily thanks to the mirroring.

Source: www.seagate.com
Source: www.seagate.com

Advantages:

  • With RAID 1 you get an excellent read speed and a write speed comparable to a single drive.
  • If a drive fails, the data does not have to be recreated from parity information, but is simply copied to the replacement drive.
  • The technology is simple and keeps the processor load low.

Disadvantages:

  • You only get the storage capacity of a hard drive

  • With a software RAID-1 system, hot-swapping, i.e. replacing the hard drive during operation, is not always supported.

RAID 5 - Striping with Parity

This is the most commonly used "secure" RAID level with parity. RAID 5 requires at least three drives and supports up to 16 drives. The data blocks are distributed across all drives. The parity information for the blocks is also not written to a single drive. This has the advantage that it does not matter which hard drive fails. RAID 5 can withstand a single hard drive failure.

Source: www.seagate.com
Source: www.seagate.com

Advantages:

  • Reads are fast

  • Any drive can fail without losing data
  • When using four drives, you get a capacity utilisation of 75 per cent, where a RAID 10 only offers 50 per cent in comparison.

Disadvantages:

  • Write accesses are slower than with RAID 10, as parity must always be calculated.
  • The technology is very complex. If a hard drive fails, it takes a long time to recover the data.
  • Data recovery requires a large number of accesses to all drives. If a second hard drive is already somewhat damaged and fails during this time, the data is lost.

RAID 6 - Striping with Double Parity

RAID 6 is a RAID 5 with double parity. It therefore requires at least four drives. Data blocks and parity information are distributed across all drives. Two of the hard drives used can fail without the risk of data loss.

Source: www.seagate.com
Source: www.seagate.com

Advantages:

  • Reads are fast

  • Any two drives can fail without losing data. This makes RAID 6 more secure than RAID 5.
  • When using more than four drives, capacity utilisation is better than with RAID 10.

Disadvantages:

  • Write accesses are slower than with RAID 5 and considerably slower than with RAID 10.
  • Capacity utilisation of at least 50 per cent is worse than with RAID 5.
  • The technology is very complex. If a hard drive fails, it takes a long time to recover the data.
  • Data recovery requires a large number of accesses to all drives, which is not conducive to the "mean time between failures" of the hard drives.

RAID 10 (1+0) - Striping and Mirroring

RAID 10 is the combination of RAID 1 and RAID 0 and requires at least four drives (only an even number possible). It combines the advantages and disadvantages of both RAID levels in one system. This is a nested or hybrid RAID configuration. This offers security by mirroring all data on secondary drives as well as simultaneous distribution across each drive block. This in turn speeds up data transfer. With RAID 10, one drive per sub-array can fail without the risk of data loss.

Source: www.seagate.com
Source: www.seagate.com

Advantages:

  • You get excellent read speed and high write speed with RAID 10

  • If a drive of a sub-array fails, the data does not have to be recreated from parity information, but is simply copied to the replacement drive.
  • The technology is simple and keeps the processor load low.

Disadvantages:

  • As all data is written twice, you only get 50 per cent of the total hard drive capacity.
  • Very expensive compared to RAID 5 or RAID 6.

Automated RAID Management

Some NAS manufacturers offer automated RAID Management. At Synology, for example, this is called SHR (Synology Hybrid RAID). This is primarily intended for users who are not very familiar with RAID. The system does most of the configuration work and offers fail-safety for at least one HDD. If you have two drives in operation, SHR will create a RAID 1. However, if you have more drives, automated RAID Management may differ from standard RAID levels. For more detailed information, it is best to contact the manufacturer of your NAS.

Who has the choice ...

So, now I know more. And yet I'm still a little undecided as to which RAID type I should configure my NAS with. In particular, discussions with computer scientists from my environment make me doubt whether it is a good idea to use RAID 5. I was told that RAID 5 is not always a secure and somewhat fiddly technology. But when I ask Product Management in-house, they advise me to use RAID 5. Anything else would be a waste of storage space.

Well, I'm off to the weekend to ponder whether I should use RAID 5, RAID 10 or even RAID 6. You'll find out what I decide in the next article about my NAS migration.

The right RAID type

What RAID level would you use on a NAS with four bays (hard drive bays)?

  • RAID 0
    5%
  • RAID 5
    51%
  • RAID 6
    12%
  • RAID 10
    31%

The competition has ended.

67 people like this article


User Avatar
User Avatar

I find my muse in everything. When I don’t, I draw inspiration from daydreaming. After all, if you dream, you don’t sleep through life.


These articles might also interest you

Comments

Avatar