This work investigates feasibility issues that must be addressed when threshold-based mechanisms are to be used for diagnostic purposes in COTS-based distributed systems. Threshold based mechanisms have typically been used for such purposes in embedded systems. A variety of solutions exist, with different characteristics of completeness, accuracy, and induced overhead. We first discuss the challenges related to applying such mechanisms to COTS-based distributed applications. We then identify alternative strategies for diagnosis, which use run-time data on COTS component service failures to trigger alarms to reconfiguration and fault treatment mechanisms. We implement those strategies in a system prototype, which is based on a substantial application, i.e. a real world (as opposed to a toy) application. We discuss the relationships between the sensitivity of the quality of service (QoS) provided by the diagnostic mechanisms and the accuracy of the available failure data. Our considerations and preliminary experiments on the prototype suggest that a careful evaluation of tradeoffs must be conducted, in order to achieve the best compromise between accuracy and cost, which depends on application characteristics, and service deployment requirements.

IMPLEMENTATION OF THRESHOLD-BASED DIAGNOSTIC MECHANISMS FOR COTS-BASED APPLICATIONS / L. ROMANO; S. CHIARADONNA; A. BONDAVALLI; D. COTRONEO. - STAMPA. - (2002), pp. 296-303. (Intervento presentato al convegno IEEE SRDS 2002, 21ST IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED DYSTEMS tenutosi a OSAKA) [10.1109/RELDIS.2002.1180200].

IMPLEMENTATION OF THRESHOLD-BASED DIAGNOSTIC MECHANISMS FOR COTS-BASED APPLICATIONS

BONDAVALLI, ANDREA;
2002

Abstract

This work investigates feasibility issues that must be addressed when threshold-based mechanisms are to be used for diagnostic purposes in COTS-based distributed systems. Threshold based mechanisms have typically been used for such purposes in embedded systems. A variety of solutions exist, with different characteristics of completeness, accuracy, and induced overhead. We first discuss the challenges related to applying such mechanisms to COTS-based distributed applications. We then identify alternative strategies for diagnosis, which use run-time data on COTS component service failures to trigger alarms to reconfiguration and fault treatment mechanisms. We implement those strategies in a system prototype, which is based on a substantial application, i.e. a real world (as opposed to a toy) application. We discuss the relationships between the sensitivity of the quality of service (QoS) provided by the diagnostic mechanisms and the accuracy of the available failure data. Our considerations and preliminary experiments on the prototype suggest that a careful evaluation of tradeoffs must be conducted, in order to achieve the best compromise between accuracy and cost, which depends on application characteristics, and service deployment requirements.
2002
Reliable Distributed Systems, 2002. Proceedings. 21st IEEE Symposium on
IEEE SRDS 2002, 21ST IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED DYSTEMS
OSAKA
L. ROMANO; S. CHIARADONNA; A. BONDAVALLI; D. COTRONEO
File in questo prodotto:
File Dimensione Formato  
Romano02.pdf

Accesso chiuso

Tipologia: Versione finale referata (Postprint, Accepted manuscript)
Licenza: Tutti i diritti riservati
Dimensione 559.33 kB
Formato Adobe PDF
559.33 kB Adobe PDF   Richiedi una copia

I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificatore per citare o creare un link a questa risorsa: https://hdl.handle.net/2158/834
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 1
social impact