Technology Industry
Industry: Email Alert RSS FeedPlanning for backup and recovery: the key elements for a successful backup and recovery strategy
Computer Technology Review, March, 2004 by James Dow
Backup and recovery are point-source solutions on the continuum of availability and resiliency. Availability describes operational behavior of a system under adverse conditions (i.e., failure of one-half of a clustered system). Resiliency describes the operational behavior of system restoration after service degradation due to an unplanned event. In their most extreme examples, both availability and resiliency are provided by backup and recovery systems.
Most RecentTechnology Articles
Backup and recovery systems traditionally address two specific requirements: error correction (i.e., inadvertent deletion of a necessary file or files) and disaster recovery. Many firms developed and currently operate backup and recovery strategies primarily designed to address the operational issue of error recovery. The backup media generated by these error recovery strategies are subsequently taken off site and extended to meet disaster recovery requirements mandated by regulatory bodies or to support limited recovery for only a few weeks or long enough to close the business. Most of these strategies focused on the risks associated with a site- or institution-specific outage or disaster.
Re-Evaluating B&R Strategies
Enterprises of all sizes, from small businesses to large financial services firms, are re-evaluating backup and recovery strategies. This is driven by several, highly correlated factors. The terrorist attacks of September 11, 2001, and the North East power grid failure of August 14, 2003 demonstrated, firstly, that regional disasters must be considered as likely as localized institutional disasters--which calls into question the validity of over-subscription in the first-come, first-served shared services disaster recovery market, such as IBM, EDS, and SunGard. Secondly, these events demonstrated that in the age of e-business, same-day settlement, and real-time Internet transactions, computer systems are no longer business facilitation tools, but rather fundamental components of the ability to perform business at all. This calls into question the pervasive belief that long-term recovery strategies can rely on the ability to close the business through access to sufficient books and records. These factors are evident in the Interagency White Paper on Sound Practices to Strengthen the Resilience of the U.S. Financial System, which addresses business resiliency in the financial services sector, and was co-authored by the Federal Reserve Board, the Office of the Comptroller of the Currency, and the Securities and Exchange Commission.
Internal providers of data center services are increasingly challenged to address a diverse set of changing requirements and subjective interpretations. The key to success in this endeavor, however, is to ensure that the solutions are well mapped to the business requirements. Inasmuch as business requirements are potentially subject to significant change, as indicated above, the first step in the architecture of a functional availability and resiliency model is to perform a Business Impact Analysis or BIA.
Business Impact Analysis
The BIA provides identification of critical business processes required to operate the business. These business processes are then mapped to the underlying applications, which, in turn, allow identification of systems, databases, and files required to operate the business. Within the context of the BIA, all business processes and applications seldom share a common level of criticality, requiring the imposition of tiers of availability; for example, some systems must be operational within 30 minutes of an event, and other systems must be operational within two business days. An important factor for the enterprise to consider during the BIA is the dollar value of the business to be recovered. Backup and recovery systems mitigate risk of loss, but only at the cost of implementation. A well-understood cost model of the business to be protected is critical for valid interpretation of the costs to deploy a robust recovery system.
RTO, RPO and MPO
The definition of these tiers of availability and resiliency provide the foundation for a functional architecture. The key data points that differentiate the tiers of availability are the RTO, RPO, and MPO. The RTO is the maximum time between an event and the time at which a system must be returned to operation, or Recovery Time Objective. The RPO is the maximum allowable data loss, or Recovery Point Objective. The MPO, or Maintenance Point Objective, is an additional but less commonly articulated metric that describes the maximum allowable window for the performance of system maintenance.
The RTO, RPO and MPO are critical data points to consider during architecture of an availability and resiliency model. As previously discussed, tape-based backup and recovery systems can be considered as point-source solutions on this continuum. Tape-based backup systems, however, do not easily accommodate a RTO of less then four hours, due to the time required to physically retrieve the data from tape, nor do they easily accommodate a RPO requirement of less than 24 hours for similar reasons.
CXO UnpluggedSmart Business interviews on BNET
Brought to you by CBS MoneyWatch.com
- Best- and Worst-Paid College Degrees
- 6 Things You Should Never Do on Twitter or Facebook
- How Much Sleep Do You Really Need?
- 6 Big Myths about Gas Mileage
- 5 Rules for Immediate Annuities
- Death in the Family: 12 Things to Do Now
- Dumbest Things You Do With Your Money
- 6 Online Networking Mistakes to Avoid
- 401(k) Mistakes to Avoid
- 5 Economic Scenarios to Keep You Up at Night
- The Real ‘Best Places to Retire’
- Best Credit Cards for You
- 12 Tough Questions to Ask Your Parents
- The Real ‘Best Colleges’
- Home Buyer Tax Credit: How to Cash In
- Why You Shouldn't Bash Cash
- 8 Phony 'Bargains' and Better Alternatives
- Danger: 3 Debit Card Scams to Avoid
- 6 Myths About Gas Mileage
- 29 Fees We Hate Most
- Quick and Easy Ways to Boost Returns
- Best Stocks to Buy Now
- Lower Your Taxes: 10 Moves to Make Now
- New Jobs: 8 Lessons from Real-Life Career Switchers
- The New Job Market: Who Wins and Who Loses?
- Health Care Reform's Public Option: Everything You Need to Know
- Volunteer Work When Unemployed: Should You Work for Free?
- Whose Recovery Is This?
- Long-Term-Care Insurance: 4 Biggest Risks to Avoid
Content provided in partnership with
Most Recent Technology Articles
Most Recent Technology Publications
Most Popular Technology Articles
- Building cost comparison between conventional and formwork system: a case study of four-storey school buildings in Malaysia
- Speed control of separately excited DC motor
- Failed businesses in Japan: a study of how different companies have failed, and tips on how to succeed, in the Japanese market
- BizRate to monitor in-store customer satisfaction for Office Depot stores - Market Intelligence
- Political stability and economic growth in Asia




