[t]he capability of a computer system, after any malfunction of any of its hardware or 'software' components, to continue to operate without human intervention, at a given level of service that provides: continuity of operation, data integrity, and recovery of service within a given time.[3]
Fault tolerance is generally focused on mitigating the impacts of non-malicious events such as accidents and random failures. New principles need to be added to the concept in order to develop systems that are resilient in the face of malicious activity and hostile attacks. In a highly distributed system environment such as the Internet, component and node failures are common. Resilient systems (also referred to as “fail-secure” systems in the context of IT security) that retain their security properties amid component failures could mitigate potential risks that may arise as a result of such failures. Systems designed to maintain predictable timeliness properties must also be resilient against denial of service attacks and disruption of system resources.
References[]
↑ISO/IEC Standard 9126 (1991), revised by ISO/IEC 25010:2011.