Tag Archives: Greg Schulz

Data De-duplication for Disaster Recovery

our beautiful site

For today’s time-strapped, resource-limited small businesses, the prospect of having to write backup data to magnetic tape each night and deliver it off-site for secure storage is daunting, to say the least. But while factors ranging from hackers to hurricanes make strict backup policies a business necessity, a relatively new technology is helping to ease the burden. Although best known for slashing storage costs, data de-duplication technology is making a name for itself as a key component of any disaster recovery strategy. Traditional backup systems store countless copies of the same information again and again — bytes and blocks of data that greatly accumulate throughout the course of a business day. Data de-duplication, on the other hand, works by only storing the changes that have been made to that data. Instead, redundant data is replaced with pointers indicating unique copies. By carefully eliminating redundant data, data de-duplication essentially reduces the volume of backup data — and bandwidth — required for transmitting large amounts of information to a recovery system. The result is a technology that not only allows for increased storage capacity with fewer hard drives and longer data retention periods but faster recovery times in the event of a disaster. Greater simplicity — and space That’s a huge relief to many small businesses. “For smaller companies that don’t have a lot of IT resources to handle data recovery, they want to make the backup process as simple as possible,” says Lauren Whitehouse, an analyst with Enterprise Strategy Group, of Milford, Mass. “Reducing their dependence on things like tape media and tape devices helps bring more simplicity to the IT environment.” But that’s not all. “Data de-duplication squishes the data down so much that you recapture disk space that you didn’t have before. So instead of backing up once a day, you can back up four times a day,” says Whitehouse. In practical terms, if a network outage occurs at 4 p.m. on a busy Wednesday afternoon, an IT manager need only retrieve data stored between the hours of 1 p.m. and 4 p.m., for example, rather than the past 24 hours, as is often the case when a company is working with limited storage capacity and/or tape media. No wonder then that data de-duplication is fast catching on. According to a survey by The 451 Group, while only 23 percent of IT organizations are using data de-duplication in their backup and data protection infrastructure, 28 percent of non-adopters said they plan to use it within six months, and another 23 percent said they would adopt it within a year. Caveat emptor However, there is some controversy surrounding data de-duplication’s powers of recovery. “The downside of storing more data on a given disk drive is it might actually take you longer to do backup and recovery,” warns Greg Schulz, founder of StorageIO Group, a Stillwater, Minn-based consulting firm. “That’s because de-duplication looks at the data as its being ingested into a storage system and has to do some heavy thinking. That’s a time constraint so if you need to restore, recover, and repopulate that data very quickly, performance can be a concern.” Of course, lightening fast data recovery for one company may seem painstakingly slow to another. It’s all the more reason, says Schulz, for companies to carefully consider their unique disaster recovery needs when turning to data de-duplication for protection.

Decrease Your Data by De-Duplication

As small and mid-sized businesses adopt new strategies that seek to leverage social networking and collaboration technologies to compete more effectively in global, national or even local economies, owners and managers are seeing the amount of data that must be stored, managed and accessed explode.  When this is combined with new mandates to protect consumer privacy — as the credit card industry is aggressively pursuing with companies that process payments through their systems — businesses face a significant challenge in controlling the cost of data stored, as well as ensuring that the life cycle of information is effectively managed.  To address this challenge, there is growing awareness — and even enthusiasm — about a new technology called data de-duplication.  Also known as single instance storage (SIS), this technology is designed to significantly reduce the amount of physical disk space needed for backup and other archival functions. How de-duplication works Data de-duplication – or SIS – configures data processing and storage functions in a way that eliminates or removes redundant files, bytes, or blocks of data to ensure that only unique data is stored on disks, according to analysts at the Enterprise Strategy Group (ESG).  It represents a new and interesting alternative to more traditional methods of dealing with data that companies accumulate in the course of their day-to-day business.  “Organizations are generating and depending upon ever larger volumes of data,” says Greg Schulz, founder and senior analyst with the StorageIO Group, of Stillwater, Minn.  The research and consulting firm specializes in advising companies on how to manage storage issues. “Data de-duplication is part of an ongoing effort by organization to reduce their data footprint.” This foot print grows every day as companies: Collect information about customers in customer relationship management (CRM) applications; Aggregate transactional data from point of sale or Web commerce systems; and Share information with partners in multi-organizational collaboration initiatives. Solving small business data woes The importance of storage is rising and becoming a bigger line item on the IT balance sheet.  According to AMI partners, the sales of small and mid-size storage solutions will significantly outpace the overall growth of their other IT investments.  The analysts are anticipating 18 to 20 percent a year growth in spending over the next few years in this segment.   If companies don’t manage their data footprint, they will risk losing control of their storage budget, because there is no sign that the amount of data that is accumulating will abate any time in the foreseeable future. “De-duplication is effective for data footprint reduction,” Schulz says. But it is not a panacea. It is a good solution for backup of same or similar data and files, however, it is not as effective for dissimilar data or online primary storage, he noted. Nevertheless, vendors are eyeing the small business market for this technology with great anticipation. Quantum Corp., of San Jose, Calif., brought to market a disk-based backup solution that incorporates data de-duplication and replication technologies. “This is such a step forward that it is impossible to ignore,” says Mike Sparkes of Quantum. Sparkes contends that this technology will plug a major gap in capabilities among small and mid-sized businesses, in particular, and provide a starting point for taming the data storage beast.