Distributed System Failures Essay

Custom Student Mr. Teacher ENG 1001-04 1 June 2016

Distributed System Failures

There are four types of failures that may be encountered when using and operating within a distributed system. Hardware failures occur when a single component within the system fails. Network failures refer to the failure of links within the distributed system network. Application failure occur to the failure of applications that run within the system, and can occur when the application stops working or operates incorrectly. Failure of synchronization occurs when different points in the system do not synchronize correctly. Both hardware and application failures may also occur within a centralized system as well as distributed systems. In the event of an application failure, it is important to first be able to differentiate between operator error and software error in order to determine the point of failure. When a hardware error occurs, this can be due to a few simple causes. Hardware failures occur when a single component within the system fails. The most common types of hardware failures are of a link, a site, or the loss of a message. At one point hardware failures were a common occurrence, but with recent innovations in hardware design and manufacturing these failures tend to be few and far between.

Instead, more failures that now occur tend to be network or drive related. Network failures refer to the failure of links within the distributed system network. Processors within a distributed system need to be able to communicate with each other via the network. When a link in this network connection fails, this causes functions to cease running. Application failure occur to the failure of applications that run within the system, and can occur when the application stops working or operates incorrectly. These failures may be caused by a variety of issues, including software bugs. Because there are numerous fail points in software issues, the problem can be hard to replicated and solve. Failure of synchronization occurs when different points in the system do not synchronize correctly. When individual processors in the distributed system fail to synchronize, processes that require two or more processors to complete successfully instead become delayed or fail. Both hardware and application failures may also occur within a centralized system as well as distributed systems. A hardware failure in a centralized system can be catastrophic if it were to occur at the hub, since this would affect operations to all outlying machines.

An application failure in the hub as well could prove to be more troublesome that if it were to occur in an access point. Although application errors can usually be attributed to faulty code or a bug, it can also occur due to operator error. In the event of an application failure, it is important to first be able to differentiate between operator error and software error in order to determine the point of failure. If the failure is due to operator error, the fix is something as simple as training of the operator who initially made the error. In the case of actual software error, in order to determine the course of action to fix the issue first you must be able to replicate the problem. By replicating the issue, you are then able to narrow down where the bug may be occurring. When a hardware error occurs, this can be due to a few simple causes. The general rule is that the more hardware there is present in the system, the more likely for a failure to occur.

By replicating the process that took place when the failure occurred, it is then possible to be able to determine what faulty piece of hardware is malfunctioning. Although failures occur within both centralized and distributed systems, by following procedure in replicating the processes the cause of the failure can usually be determined, though each type of failure presents a unique challenge. Hardware failures can usually be repair just by replacing a faulty piece of equipment, while application failures are a little more complicated. Network and failure of synchronization errors may span through the network, which can cause delay in processes as well as being able to replicate faulty processes.

References
Distributed System Failure Types. (2014, April). Studymode.com. Retrieved from http://www.studymode.com/essays/Distributed-System-Failure-Types-1602939.html

Free Distributed System Failures Essay Sample

A

  • Subject:

  • University/College: University of Arkansas System

  • Type of paper: Thesis/Dissertation Chapter

  • Date: 1 June 2016

  • Words:

  • Pages:

We will write a custom essay sample on Distributed System Failures

for only $16.38 $12.9/page

your testimonials