More information is attainable faster - Communication

USA Today (Society for the Advancement of Education), June, 2003

A message-passing library that makes it possible to extract optimum performance from both workstation and personal computer clusters, as well as from large massively parallel supercomputers, has been developed by researchers at the U.S. Department of Energy's Ames (Iowa) Laboratory. The library, called MP_Lite, supports and enhances the basic capabilities that most software programs require to communicate between computers.

Although MP_Lite could be scaled up easily, its objective is not to provide all the capabilities of the full message-passing interface standard. MPI is a widely used model that standardizes the syntax and functionality for message-passing programs, allowing a uniform interface from the application to the underlying communication network. Parallel libraries that offer the full MPI standard ease programming problems by reducing the need to repeat work, such as defining consistent data structures, data layouts, and methods that implement key algorithms.

"Our goal with MP_Lite is to illustrate how to get better performance in a portable and user-friendly manner and to understand exactly where any inefficiencies in the MPI standard may be coming from," explains David Turner, an Ames Laboratory assistant scientist and the principal investigator working on the MP_Lite project. He notes that the MP_Lite library is smaller and much easier to work with than full MPI libraries. "It's ideal for performing message-passing research that may eventually be used to improve full MPI implementations and possibly influence the MPI standard"

Turner says that it was "mainly frustration" that led him to develop the MP_Lite library. "Most message-passing packages are large and clunky to work with, and are often difficult to install. If you run into any errors at all, they give you very cryptic messages that mean nothing unless you actually wrote the library. So a lot of the reason I got into the project was not just to improve the efficiency, but also to make the message-passing more user-friendly."

Offering an example, Turner points out that, "If two processors are communicating, and one waits a minute for a response from the other one--well, a minute is a very long time in this context--the library should put out a warning into a log file. But that's something that's not done. Most message-passing systems don't tell you what's wrong if a communication buffer overflows or a node is waiting for a message that never gets sent.

"What if there's a five-minute wait for a message? Something is probably frozen up; so at that point the library should implement an abort and give the user as much information about the current state of the system as possible." MP_Lite operates with minimal buffering and warns users if there are any potential problems. When possible, MP_Lite will dump warnings to a log file and eventually time-out when a lock-up occurs. "There's a lot of these user-friendly aspects that I'd like to see put into other message-passing systems."

COPYRIGHT 2003 Society for the Advancement of Education
COPYRIGHT 2003 Gale Group

 

BNET TalkbackShare your ideas and expertise on this topic

Please add your comment:

  1. You are currently: a Guest |
  2.  

Basic HTML tags that work in comments are: bold (<b></b>), italic (<i></i>), underline (<u></u>), and hyperlink (<a href></a)

advertisement
advertisement
  • Click Here
  • Click Here
  • Click Here
advertisement

Content provided in partnership with Thompson Gale