CERT
Back to [5]   [6]    Forwards to [7]



ELECTRONIC QUARANTINE: An Automated Intruder Response Tool

Paul Brutch, Tasneem Brutch, and Udo Pooch
Department of Computer Science
Texas A&M University
paulb, tasneem, pooch @cs.tamu.edu

In "Survivable Network Systems: An Emerging Discipline", Ellison, Fisher, Linger, et. al. organize the survivability services into four general categories: resistance, recognition, recovery, and adaptation and evolution [Ellison 97].  It is our contention that intruder response and damage assessment should be treated as separate survivability services.  We propose six key services necessary for survivability, and ordered these services based upon the time when each service is required.  We also present our research in the area of intruder response.

Introduction
Linger, Mead, and Lipson define survivability as the capability of a system to continue providing essential services after a successful intrusion and compromise, and the ability to recover full services in an acceptable period of time [Linger 97].  Survivability may be viewed as a composite of many disciplines such as security, fault tolerance, dependability, and software reliability.  Neumann and Porras state that when designing a survivable information infrastructure, we need to understand how to specify and integrate key properties from these various fields [Neumann 97].

In computer security, available tools provide automated capabilities in one or more of the following areas: intruder prevention, intruder detection, and damage assessment.  Many intrusion detection tools provide some form of automated intruder response, but few security tools perform any automated recovery.  Ellison, Fisher, Linger, et. al. define the four key properties that survivable systems must exhibit as: resistance to attacks; recognition of attacks and the extent of damages; recovery of full and essential services after attack; and adaptation and evolution to reduce effectiveness of future attacks [Ellison 97].  Anderson and Lee list the four phases of fault tolerance as: error detection, damage confinement and assessment, error recovery, and fault treatment and continued service [Anderson 81].  Although there are some similarities in the requirements for computer security, survivability, and fault tolerance, significant difference do exist between these three disciplines.

As proposed by Ellison, Fisher, Linger, et. al., the four survivability services can be categorized as: resistance, recognition, recovery, and adaptation and evolution [Ellison 97].  It is our contention that intruder response and damage assessment should also be separate survivability services.  We propose six key services for survivability, and order these key services based upon the time when each service is required.

Intruder Response and Damage Assessment Services
Intruder response and damage assessment are essential services necessary for the provision of information survivability.  If a system is to survive a potentially damaging event, it must first be able to recognize that such an event is occurring.  Next, the system must try to respond to the intrusion and confine the effects of the damaging event.  After the damaging event has ended, the process of damage assessment can be performed.  The goal of damage assessment is to determine which portions of the system must be restored in order to recover full and essential services.

In fault tolerant computing once the occurrence of a fault has been detected, the errors resulting from that fault must be confined from spreading through the system.  The term damage confinement is used to define this process.  Constraints are placed on the flow of information within the system to confine these errors.  Unfortunately, some time may elapse before a fault manifests itself as an error and the error is detected.  So in addition to confining the errors from spreading through the system, we need to assess what damage has been caused to the system as a result of the fault.  In performing damage assessment, the system is evaluated to identify the boundaries through which the errors have not flowed.  The extent of the damage then must reside inside these system boundaries.

In computer security, intruder response is performed during an attack in order to stop an intruder from damaging the system.  A number of automated intruder responses have been implemented as part of intrusion detection systems.  Some responses may be active such as terminating a process or closing a connection.  Other responses are passive such as sending an e-mail to the system administrator.  Damage assessment is normally performed after an attack.  A number of vulnerability scanning tools such as "Tiger" may be used to perform damage assessment.  Other tools such as the "Tripwire" were specifically developed to aid in damage assessment.  At Texas A&M, a prototype tool called the Automated Incident Response System (AIRS) [Fisch 97] was developed to perform damage control and and damage assessment on individual hosts in a network.  Fisch, White, and Pooch state that the AIRS primary function is to provide protection of the local system [Fisch97].

Proposed Six Key Properties of Survivable Systems
Table 1. presents our proposed six key properties of survivable systems ordered based on time.  Table 1 is a modification to the four properties of survivability proposed by Ellison, Fisher, Linger, et. al. [Ellison 97] and our modifications to their original table are shown in italics.

Table 1. The Six Key Properties of Survivable Systems.
(Modified from "Survivable Network Systems: An Emerging Discipline" [Ellison 97])
Key Property Description
Resistance to attacks strategies for preventing attacks from occurring
strategies for repelling attacks
Recognition of attacks strategies for detecting an attack and understanding the current state of the system
Intruder Response strategies for responding to an intruder in order to limit the extent of damage during an attack
Damage Assessment strategies for evaluating the extent of damage
Recovery of full and essential services after attack strategies for restoring compromised information or functionality, maintaining or if necessary restoring essential services within the mission time constraints
Adaptation and evolution to reduce effectiveness of future attacks strategies for improving system survivability based on knowledge gained from intrusions.

Electronic Quarantine
Our research focuses on the area of intruder response.  We are developing an electronic quarantine to provide a dynamic response capability to detected intrusions. The electronic quarantine attempts to automatically confine an intruder and the damaged caused by the intrusion to the compromised host within a bounded network.  Although the electronic quarantine does not provide all of the requirements necessary for survivability, we believe that it makes a contribution to this emerging discipline.  The electronic quarantine provides a bounded network with the capability to respond to an intrusion and adapt by modifying the existing network service access controls within the network to quarantine the compromised host(s).  For a survivable infrastructure, this capability needs to be extended to an unbounded network.

The goal of the electronic quarantine is to dynamically and without need for human intervention isolate a compromised system from hosts inside the bounded network and the Internet [Brutch 97].  Often this means logically isolating the compromised host from the rest of the bounded network until the compromised host can be fully restored.  If the compromised host is a server which supports a critical application, then a complete quarantine of this server would not be an acceptable solution.  For some critical applications, however, the availability of the service may be more important than the integrity or confidentiality of the information.  In this situation, the electronic quarantine could permit remote users to access the server only for this critical application.  All other remote access to the compromised server would be denied, and no user on the server's console would be able to connect to any other host in the bounded network or the Internet.

Electronic Quarantine Concept
The electronic quarantine builds on the capabilities of existing tools which have been developed at Texas A&M.  The first tool is the Basic Network Security Tool (BNST) [Brutch 96].  The BNST provides a graphical user interface to aid security administrators in implementing their organization's network service access policy [Brutch 96].  A prototype of the BNST was developed, but this tool is currently being redesigned.  The new tool called the Security Administration and Network Configuration Tool for UNIX Machines (SANCTUM) accepts inputs from network administrators, host administrators, and authorized end users in order to evaluate and implement an organization's network service access policy.  In addition, SANCTUM also accept input from Intrusion Detection Systems (IDS) concerning the status of monitored hosts within the bounded network.

We also intend to use the capabilities provided by intrusion detection systems (IDS).  A prototype IDS called the Cooperating Security Managers (CSM) [White 96] was developed at Texas A&M.  The CSM not only maintains a suspicion level for each user on the host but also evaluates the overall suspicion level of the entire system.  White, Fisch, and Pooch state that the system's suspicion level is a function of the suspicion level of all current users [White 96].  Whenever the suspicion level for a host has reached a predefined threshold value, the electronic quarantine will consider that host compromised.  White, Fisch, and Pooch state that the design of the CSM system dictates that a CSM be run on each host in the bounded network, and that autonomous CSMs communicate with each other to track users as they travel between the monitored hosts [White 96].

The electronic quarantine concept requires the use of host-based intrusion detection systems, which perform real-time activity monitoring, and maintain a suspicion level for each user as well as an overall suspicion level of the monitored host.  It is also assumed that the intrusion detection system can provide active responses such as terminating processes, closing connections, and disabling accounts.  Although not absolutely required, the ability of host-based intrusion detection systems to cooperate and share information in order to track users as they connect to other monitored hosts, is also important.   The electronic quarantine also requires that the organization has at least an Internet firewall and that each host has a sever filter such, as "TCP Wrapper" installed to perform network service access control.

Since the majority of intrusions are caused by inside users, we will explain how an electronic quarantine is implemented for a server providing a critical service when it is compromised by an intruder on the server's console.  When the intrusion detection system on the server determines that the server has been compromised, it first needs to determine which account is responsible.  Once the intruder is identified, the intrusion detection system will terminate all processes and close all connections of the intruder.  The intrusion detection system will then disable the intruder's account and contact SANCTUM, which resides on a centralized security server in the bounded network.  SANCTUM will evaluate a new organizational network service access policy which quarantines the compromised server while allowing remote access only for the critical service.  SANCTUM does this by developing a firewall access policy which blocks all incoming Internet traffic to the compromised server, except for the critical service.  SANCTUM also develops a network service access policy for the compromised server which denies all remote access, except for the critical service.  One exception to this rule is that the compromised server will allow continued access from the centralized security server on which SANCTUM resides.  SANCTUM also develops network service access polices for each of the monitored hosts in the bounded network in order to deny any connection from the compromised server.  In addition, SANCTUM develops a new firewall access policy in order to block any connections from the compromised server to the Internet.

The network service access policies developed by SANCTUM will be sent to each component in the bounded network which provides network service access control.   When a component, such as a host or a firewall, receives a new network service access policy, that policy must be implemented.  Currently this is performed manually by the BNST system.  SANCTUM must provide the capability to automatically implement these new host and firewall access control policies without the need for human intervention.

If the attack occurs from another host within the bounded network, similar actions are taken except that the intruder's presence must also be removed from the host on which the attack originated.  Intrusion detection systems that cooperate to provide user tracking, such as the CSM, can effectively terminate an intruder's sessions and disable their account over multiple hosts.  One of the goals of the electronic quarantine is to ensure that the intruder can not access the compromised host again in order to exploit any back doors which may have been left.

Damage Assessment and Recovery
Although the electronic quarantine provides for the continued access to critical services, until damage assessment and recovery have been performed essential properties such as integrity and confidentiality may not be preserved.  In some missions, the availability of the service may be more important the actual integrity or confidentiality of the information.  Although automated damage assessment and recovery are not part of the electronic quarantine concept, these capabilities are necessary to provide some measure of survivability.  Once damage assessment has been performed and the full and essential services on the compromised host are recovered, the electronic quarantine can be lifted.  This is performed by instructing SANCTUM to implement a new organizational network service access policy allowing normal network service access to/from the "cured" host.


References

Anderson 81
Anderson, T.; and Lee, P., "Fault Tolerance: Principles and Practices", Prentice Hall International, 1981.

Brutch 96
Brutch, P.; McDonald, M.; Crotwell, L.; Marti, W.; and Pooch, U., "Basic Network Security Tool", http://www.cs.tamu.edu, Technical Report TR 96-033, 1 October 1996.

Brutch 97
Brutch, P.; Marti, W., White, G.; Pooch, U.; and Pradhan, D.,"Intruder Containment: An Automated Method of Response to Potential Security Incidents", 9th Annual FIRST Conference and Workshop on Computer Security Incident Handling and Response, 23 -27 June 1997.

Ellison 97
Ellison, R.; Fisher, D.; Linger, R. et. al., "Survivable Network Systems: An Emerging Discipline", CMU/SEI-97-TR-013, Carnegie Mellon University, http://www.cert.org/research, November 1997.

Fisch 97
Fisch, E.; Pooch, U.; and White, G., "The Design and Creation of a UNIX Based Automated Incident Response System", 9th Annual FIRST Conference and Workshop on Computer Security Incident Handling and Response, 23 -27 June 1997.

Linger 97
Linger, R.; Mead N.; Lipson. H., "Requirements Definition for Survivable Network Systems", http://www.cert.org/research, 1997.

Neumann 97
Neumann, P.; and Porras, P., "A Global View of Information Survivability" 1997 Information Survivability Workshop - ISW 97, February 12-13 1997.

White 96
White, G.; Fisch, E.; and Pooch, U., "Cooperating Security Managers: A Peer-Based Intrusion Detection System", IEEE Network, Jan./Feb. 1996.


TCP Wrapper is available at (ftp://coast.cs.purdue.edu/pub/tools/unix/tcp_wrappers).

Tiger is available at (ftp://net.tamu.edu/pub/security/TAMU).

Tripwire is available at (ftp://coast.cs.purdue.edu/pub/COAST/Tripwire).


Back to the Table of Contents
Back to [5]   [6]    Forwards to [7]