Distributed System Failures

There are four types of failures that may be encountered when using and operating within a distributed system. Hardware failures occur when a single component within the system fails. Network failures refer to the failure of links within the distributed system network. Application failure occur to the failure of applications that run within the system, and can occur when the application stops working or operates incorrectly. Failure of synchronization occurs when different points in the system do not synchronize correctly. Both hardware and application failures may also occur within a centralized system as well as distributed systems.

In the event of an application failure, it is important to first be able to differentiate between operator error and software error in order to determine the point of failure. When a hardware error occurs, this can be due to a few simple causes. Hardware failures occur when a single component within the system fails. The most common types of hardware failures are of a link, a site, or the loss of a message.

Get quality help now
Doctor Jennifer
Verified writer

Proficient in: Failure

5 (893)

“ Thank you so much for accepting my assignment the night before it was due. I look forward to working with you moving forward ”

+84 relevant experts are online
Hire writer

At one point hardware failures were a common occurrence, but with recent innovations in hardware design and manufacturing these failures tend to be few and far between.

Instead, more failures that now occur tend to be network or drive related. Network failures refer to the failure of links within the distributed system network. Processors within a distributed system need to be able to communicate with each other via the network. When a link in this network connection fails, this causes functions to cease running.

Get to Know The Price Estimate For Your Paper
Topic
Number of pages
Email Invalid email

By clicking “Check Writers’ Offers”, you agree to our terms of service and privacy policy. We’ll occasionally send you promo and account related email

"You must agree to out terms of services and privacy policy"
Check writers' offers

You won’t be charged yet!

Application failure occur to the failure of applications that run within the system, and can occur when the application stops working or operates incorrectly. These failures may be caused by a variety of issues, including software bugs. Because there are numerous fail points in software issues, the problem can be hard to replicated and solve. Failure of synchronization occurs when different points in the system do not synchronize correctly. When individual processors in the distributed system fail to synchronize, processes that require two or more processors to complete successfully instead become delayed or fail. Both hardware and application failures may also occur within a centralized system as well as distributed systems. A hardware failure in a centralized system can be catastrophic if it were to occur at the hub, since this would affect operations to all outlying machines.

An application failure in the hub as well could prove to be more troublesome that if it were to occur in an access point. Although application errors can usually be attributed to faulty code or a bug, it can also occur due to operator error. In the event of an application failure, it is important to first be able to differentiate between operator error and software error in order to determine the point of failure. If the failure is due to operator error, the fix is something as simple as training of the operator who initially made the error. In the case of actual software error, in order to determine the course of action to fix the issue first you must be able to replicate the problem. By replicating the issue, you are then able to narrow down where the bug may be occurring. When a hardware error occurs, this can be due to a few simple causes. The general rule is that the more hardware there is present in the system, the more likely for a failure to occur.

By replicating the process that took place when the failure occurred, it is then possible to be able to determine what faulty piece of hardware is malfunctioning. Although failures occur within both centralized and distributed systems, by following procedure in replicating the processes the cause of the failure can usually be determined, though each type of failure presents a unique challenge. Hardware failures can usually be repair just by replacing a faulty piece of equipment, while application failures are a little more complicated. Network and failure of synchronization errors may span through the network, which can cause delay in processes as well as being able to replicate faulty processes.

References
Distributed System Failure Types. (2014, April). Studymode.com. Retrieved from http://www.studymode.com/essays/Distributed-System-Failure-Types-1602939.html

Cite this page

Distributed System Failures. (2016, Jun 01). Retrieved from https://studymoose.com/distributed-system-failures-essay

👋 Hi! I’m your smart assistant Amy!

Don’t know where to start? Type your requirements and I’ll connect you to an academic expert within 3 minutes.

get help with your assignment