Would you prefer tea with sugar or washing your hands with soap?

Wednesday, 27 May 2015 00:00

font size decrease font size increase font size

Would you prefer tea with sugar or washing your hands with soap?

Rate this item

(0 votes)

Major question, isn’t it? Sounds a bit stupid unless you’ve been to Russia of 90s of 20th century.

The first decade of new millennia comes to an end and you still hear an equally awkward phrase with ever increasing frequency: “why backup if I’ve already set up the RAID?”. And, as a matter of fact, one doesn’t hear it at the moment of creating the system, but only at the very moment when data has to be rescued and not at all by standard means. The more expensive the RAID controller was, the more is the outrage – “I’ve spent SO MUCH money on this controller, do I look like a Rockefeller to backup data on top of that?! It’s over! Manufacturers have conspired! No place for justice in this world! I’ve purchased the controller to protect my data!” As you could get it from the above, we are talking mostly about ‘home’ users, though small businesses experience same problems as well.

And it could have been justified if at least one RIAD controller manufacturer was promoting its product as a replacement for data backup. So what for do we need RAID? Can it ensure data protection? The answer to the question is quite ambiguous: it can, in specific cases. In what cases? Very simple - RAID (except RAID-0) would ensure data accessibility in case of crash of one or more disks (for instance, two in case of RAID-6). That is, so to say, the protection which could be ensured by hardware or software RAID. No more, no less! Take note of the word ‘accessibility’ – this is the main task, i.e. the objective is not at all data protection, it is a minimization of possible downtime. Can a data from RAID ‘get lost’? Sure it can! There is a number of ways and the following is just a few examples of data loss:

• Program error – the simplest case which does not depend on whether RAID is there or not.

• User error – quite common scenario.

• Crash of two disks in RAID-5 (or three in RAID-6). You might think that it is unlikely to happen. No it’s not. If high-capacity disks are used, the possibility of repeated failure considerably increases when array is rebuilding in the event of disk crash. Besides that, a banal problem with power unit could simply ‘kill’ the electronics on several disks.

• Accumulated logical errors. Where do they come from? Usually hardware RAID Controllers have cache, able considerably increase disk performance. However, if cache is not protected, then any unexpected system reboot will lead to data loss in controller cache. If reboot occurred when data simply ‘waited’ in cache, then the error occurs on the file system level. But, in case if at the moment of rebooting cache data has been written to disks, data might be written only partially. At this stage the errors occur not only at file system level but also at the level of RAID array, since it is not clear what part of stripe was successfully written and what was not. The majority of manufacturers advise consistency check to ‘catch’ such errors, but who uses them unless in trouble? You can protect yourself by having battery backup unit, flash memory capacitors or disabling write cache. The first one cost money, second – decreases performance.

• Not only controller has cache but disks do too. Disk write operations are cached. It is recommended to disable this cache, though for SATA disks and weak controllers it substantially reduces disk subsystem performance. And those who are not willing to get slow system let it remain on. What can happen? Correct, as it is mentioned above, reboot can lead to cached data loss. Even if controller thinks that array is fine, in reality, the data will be different. And if such failure occurred in the parity block, the data will be accessible until everything is fine with array. As soon as the block is used for recovery (after failure of another disk), the recovered data will be a complete ‘trash’.

• Controller interacts with disks which can ‘respond’ to its commands with different delays (for example disk tries to remap a bad sector), and controller may not wait for the answer and send the disk ‘to rest’. What happens if it is already a seconds disk in RAID-5? Correct, say good bye to data. Well, of course, disks from the compatibility lists experience such problems very seldom, but how often does the user check these notorious lists? Despite the presence of lists, unfortunately, the outcome of voting is usually based either on money or in favour of the favourite brand.
I have listed above only the most common cases. All such problems can pile up and the number of potential problems would expand like a snow ball. If at the moment of failure there is a ‘risky’ array operation taking place (for instance, adding a disk), the probability of successful recovery with own hands (I am not even talking of controller tools) rapidly nears zero chances. It is, unfortunately, what we observe quite often (from the side).

So you may say that RAID controllers are ‘evil’ and a tool for enrichment of avaricious manufacturers. So many terrifying stories have been told, so do I really need RAID at home? In which cases it makes sense to use it?

• If you need to increase the speed of disk subsystem performance (when one disk performance is not enough). IF your hobby is video processing or ‘games’ with virtual machines, why not?

• Computer is a part of home office and contains crucial commercial information. You need to protect your data before it is backed up.

• Can’t spare time to reinstall the system in case of disk failure. Quite reasonable, especially if computer is not a test range and weekly Windows re-installation is not due on every Sunday 

• You keep huge volumes of data in on-line mode and you do not wish to recover it in case of disk failure.

Things you need to remember when you decided to make life easier with RAID?

1. Using any RAID, installation of BBU, disabling caches, regular checking – nothing guarantees data security and integrity, unless you have a reliable backup copy.
2. Copies of important data shall be kept on separate storage devices and it is highly advised that one of such devices is not rewritable.
Below you will find several tips in case you still don’t care about said above and decided to go your own way:
1. When creating RAID array note down all settings (disk order, size of stripe etc.). Note down even if you accepted all defaults. Actual meaning of default may vary for different firmware versions of hardware controller. It goes without saying that you should not keep this data on the array – spare a sheet of paper.
2. Regularly update drivers and firmware. Although it doesn’t mean that you have to install a new firmware version on the day it is released. If you don’t have problems, wait a week or two. There is always a chance that it contains some problems or defects and will be soon replaced.
3. Use all available monitoring tools. If you found out about failure not from an email message but from the fact that system doesn’t boot, usually it means that it is too late to rescue anything.
4. Perform regular data integrity verification.
5. Copy at least the most important data to DVD disks, for example.
6. If you plan to modify the configuration of array (add disks, change RAID level and so forth), refer to point No 5 once again (or better read the entire article). In case if the modification was successful, recollect point No 1 and make relevant notes.
7. Do not choose disks for hardware controller on the basis of price and general brand impression, choose them from compatibility lists of the given controller.
8. If anything got broken, first of all copy the crucial data and only then try to mend the problem.

9. If something’s got broken in a way that data is not accessible any more, refrain from any actions and call professionals.

Finding professionals is not difficult nowadays – even if you are far from the city, you can mail or talk via phone. Trust me, mailing expenses would bleak on the background of recovery cost. If it is possible, make sector by sector copy of all disks and experiment on those.

All of the above will not save you from data loss, although it will help to substantially reduce the risk of loss and is likely to reduce the cost of recovery (should the ‘X’ time come). Once again: be ready to trust the recovery job to specialized organization. Do not wonder if the cost of recovery would be times higher than the cost of controller and disks put together. If you can’t bear such expenses, think once again about backing up the data you do not want to loose. So wash your hands with soap and put sugar in your tea.

Last modified on Wednesday, 27 May 2015 16:16

Data Recovery Expert

Viktor S., Ph.D. (Electrical/Computer Engineering), was hired by DataRecoup, the international data recovery corporation, in 2012. Promoted to Engineering Senior Manager in 2010 and then to his current position, as C.I.O. of DataRecoup, in 2014. Responsible for the management of critical, high-priority RAID data recovery cases and the application of his expert, comprehensive knowledge in database data retrieval. He is also responsible for planning and implementing SEO/SEM and other internet-based marketing strategies. Currently, Viktor S., Ph.D., is focusing on the further development and expansion of DataRecoup’s major internet marketing campaign for their already successful proprietary software application “Data Recovery for Windows” (an application which he developed).

Latest from Data Recovery Expert

More in this category: « Recovering the GRUB on software RAID Recovery of RAID 5, 16 disks WD2001FASS, Maxtronic SS-6601E, New York »

Make sure you enter the (*) required information where indicated. HTML code is not allowed.

Data Recoup 4.8 / 5 based on 208 user ratings

Sidebar

Main Menu