File sharing over the WAN - Storage Networking

Computer Technology Review, March, 2003 by Trevor Hughes

File sharing over local area networks (LANs) has become an integral part of enterprise computing. Over 20 years ago, personal computer users realized the benefits of abandoning "sneakernet" by connecting individual computers into a network. This allowed more efficient file sharing and project collaboration for members of a workgroup who were only a cable length away, typically in the same room or building. Now there is a desire for global organizations to share file data in real time among all their locations.

These global organizations now wish to implement wide area network (WAN) based storage consolidation, to reduce storage resource management costs while also enabling remote employees to collaborate on projects just as if they were all on a local network. Given the current state of technology, however, enterprises are forced to limit the deployment of real time file sharing and networks to within each LAN, resulting in disconnected islands of file sharing throughout the enterprise.

Corporations have tried to fix this problem in various ways. For large files or large groups of files, companies may file transfer protocol (FTP) the files overnight. For ad hoc collaboration between employees, people might attach files to email messages. While these solutions are marginally acceptable, they each suffer from the same problem. The file sharing is not in real time with everyone working off the same version of the data. Rather than sharing a single copy among many users, these methods duplicate files and propagate private copies to each user. The moment a file is emailed or FTP'ed, two or more potentially out of sync copies are created. The dissimilar versions need to be manually reconciled, usually after every set of revisions by the people working on the project.

The pain users feel sharing files over the WAN causes frustration and reduces productivity. In response, users have developed elaborate workaround solutions. The waste in end user time is only one of the costs. As increasingly divergent versions of employee work product are saved in storage resources throughout the corporation, a storage resource management problem is created for the IT staff. Not only is the overall amount of storage managed greater than necessary, it also creates disaster recovery, backup and restore difficulties. These issues result in greater capital outlay and put pressure on the bottom line.

These approaches have been used because there has been no better way to accomplish the task. A better way would be to use the same protocols and methods over the wide area as the local. This would allow people and applications to work together as they always have, coherently and in real time.

Network File System (NFS) and common Internet file systems (CIFS) are network file sharing protocols used by Unix and Microsoft Windows operating systems, respectively. However experiments in extending NFS and CIFS over the wide area have met with great frustration.

These traditional file-sharing protocols were designed to share files over a local network, but perform poorly and prove unreliable in a WAN setting. Reading and writing files using standard file sharing protocols over WANs is orders of magnitude slower than over LANs. While this difference is dramatic, it is also a best-case scenario. In the real world, wide area networks are far less reliable than local area networks. The response times--both for opening or writing files--is, invariably, unacceptably slow.

That is because these protocols are "chatty:" the client and server frequently send messages back and forth to stay synchronized. In addition, these messages must receive an answer before the next message is sent, further slowing the response tune. There is a common misconception that adding bandwidth to the WAN will solve this problem. Bandwidth is only one variable, and not the most significant. The large delay in sharing files over long distances is actually caused by the speed of light coupled with the synchronous nature of the protocols.

The limitation imposed by the speed of light (plus the additional delays introduced by switching equipment) causes latency. In other words, it's not the size of the "pipe" that matters; it's how long the trip takes that creates the delay. For example, while one can fit more people in a 767 jumbo-jet than in a Gulfstream, both will still take the same amount of time to cross the country.

Remote file sharing protocols like CIFS and NFS are especially sensitive to latency. Each message incurs a latency penalty because each message causes the protocol to wait for a response. Although the latency penalty for each individual message and response can be small for WAN circuits (20-30ms for local Campus, 120ms for NYC to UK access), the number of messages and responses made during the reading or writing, of even a small file, add up to make the total delay quite large.

For example, saving an edited 2MB MS Word document normally takes only four seconds over a LAN. The same operation could suffer a latency penalty of more than two minutes over a very fast 1MB WAN circuit that has an 80ms delay--a delay that most users would consider to be unacceptable.


 

BNET TalkbackShare your ideas and expertise on this topic

Please add your comment:

  1. You are currently: a Guest |
  2.  

Basic HTML tags that work in comments are: bold (<b></b>), italic (<i></i>), underline (<u></u>), and hyperlink (<a href></a)

advertisement
CXO UnpluggedSmart Business interviews on BNET

See and hear how senior level executives across the Asia Pacific are developing smart business ideas across a variety of sectors. The focus is on the future, and on how businesses need to evolve.

advertisement
  • Click Here
  • Click Here
  • Click Here
advertisement
Click Here

Content provided in partnership with Thompson Gale