Fault Detection and Tolerance in Cluster of Workstations using Message Passing Interface
Journal Title: Sir Syed University Reseacrh Journal of Engineering and Technology - Year 2011, Vol 1, Issue 1
Abstract
A Cluster of Workstations (COW) is network based multi-computer system aimed to replace supercomputers. A cluster of workstations works on Divisible Load Theory (DLT) according to which a job is divided into n subtasks and delegated to n workstations in the COW architecture. To get the job completed, all subtasks must be completed. Therefore, for satisfactory job completion, all workstations must be functional. However, a faulty node can suspend the overall job completion task until and unless some fault avoidance and correction measures are taken. This paper presents a fault detection and fault tolerant algorithm which will use Message Passing Interface (MPI) to identify faulty workstations and transfer the subtask being performed by them to a normally working workstation. The assigned workstations will continue their original subtasks in addition to assigned subtasks on time sharing basis.
Authors and Affiliations
Syed Misbahuddin
Maximum Likelihood Decoder for Variable Length Codes
Variable Length Codes (VLC) are used to transfer same amount of digital information in relatively short period of time. In variable length coding, the characters with higher probability of occurrence are assigned shorter...
Deployment of Sensors to Optimize the Network Coverage Using Genetic Algorithm
Wireless Sensor Networks (WSNs) are commonly used in various pervasive applications. Wireless communication is the fastest growing segment of the communication industry that has captured attention of the media and imagin...
Wireless Security Threats
Wireless Communication Technologies has completely revolutionized the world. Wireless Communication Technologies provide ease to the users such as portability of the devices and mobile access to the internet. These porta...
RF Based Wireless Fire Security System for Hospitals
Wireless Fire Security System increases the secu-rity against fire as compared to conventional fire security systems. Conventional fire security systems are wired and they usually contain smoke detectors and control pane...
Optimizing the Material, Inter-distance and Temperature Effect of Intramuscular Electrodes used to Stimulate the Thoracic Diaphragm
The technique of electrically stimulating the thoracic diaphragm is conducted by implanting a diaphragmatic pacemaker, in which the phrenic nerve is stimulated, resulting in stimulating the diaphragm. A diaphragmatic pac...