CHAPTER 16
Incident Response and Threat Management
Updated: May 15, 2026
25 min read
# CHAPTER 16
Incident Response and Threat Management
1. Introduction
The core tenet of modern cybersecurity is "Assume Breach." No matter how robust your network architecture is, a sophisticated attacker will eventually bypass the perimeter. When the SIEM alarms trigger, indicating that an attacker is actively moving laterally across the corporate network, chaos is not an option. Incident Response (IR) is the structured, systematic approach to handling and managing the aftermath of a security breach. In this chapter, we will explore the National Institute of Standards and Technology (NIST) Incident Response Lifecycle, focusing on containment, eradication, and forensic preservation.2. Learning Objectives
By the end of this chapter, you will be able to:- Define Incident Response and the role of the CSIRT.
- Understand the 6 phases of the NIST Incident Response Lifecycle.
- Differentiate between a security event and a security incident.
- Understand the critical importance of proper Containment.
- Explain the concept of digital chain of custody.
3. Beginner-Friendly Explanation
Imagine a fire breaking out in a large office building.- Preparation: Buying fire extinguishers and holding fire drills *before* a fire happens.
- Detection: The smoke alarm goes off.
- Containment: Closing the fire doors to ensure the fire in the kitchen doesn't spread to the offices.
- Eradication: The firefighters spraying water to put the fire out entirely.
- Recovery: Cleaning up the water, painting the walls, and reopening the kitchen.
- Lessons Learned: Realizing the fire started because of a faulty microwave, and buying a safer microwave to prevent it from happening again.
Incident Response is simply digital firefighting.
4. The NIST Incident Response Lifecycle
The standard framework used by professional Security Operations Centers (SOCs).- 1. Preparation: Establishing the Computer Security Incident Response Team (CSIRT). Creating Playbooks (e.g., "What to do if Ransomware is detected"). Ensuring backups are working and segregated.
- 2. Identification (Detection): The SIEM generates an alert. An analyst investigates and confirms it is a True Positive (an actual breach), upgrading it from an "Event" to an "Incident."
- 3. Containment: Stopping the bleeding. This must happen immediately. If malware is on a laptop, you disconnect it from the network.
- 4. Eradication: Removing the threat. Deleting the malware, disabling the compromised user accounts, and patching the vulnerability the attacker used to get in.
- 5. Recovery: Restoring the affected systems from clean backups, bringing them back online, and monitoring them closely to ensure the attacker didn't leave a backdoor.
- 6. Lessons Learned: The most important phase. Writing an After-Action Report detailing how the breach happened, what the IR team did right/wrong, and updating the firewall rules to ensure it never happens again.
5. The Golden Rule of Containment
When an IT administrator discovers a virus on a server, their first instinct is often to pull the power plug or hold the power button to turn it off. Never do this. Turning off a machine destroys the Random Access Memory (RAM). Advanced malware often runs entirely in memory and leaves no trace on the hard drive. If you kill the power, you destroy the only forensic evidence of the attack, including the cryptographic keys the ransomware used. Proper Containment: Pull the ethernet cable or disable the network switch port. The machine stays powered on for forensic analysis, but it can no longer communicate with the network.6. Threat Hunting
Incident Response is reactive; you wait for an alarm. Threat Hunting is proactive. A security analyst assumes the SIEM missed the attacker. The analyst actively searches through network logs, looking for subtle anomalies (e.g., "Why is the Marketing server using PowerShell to execute base64-encoded commands at 3:00 AM?"). They hunt for the adversary *before* the malware detonates.7. Mini Project: Build an Incident Response Plan
Draft a high-level playbook for a "Compromised Executive Account" scenario.- 1. Trigger: SIEM alerts that the CFO logged in from a known malicious IP address.
- 2. Containment: Immediately disable the CFO's Active Directory account. Revoke all active session tokens in Office 365.
- 3. Investigation: Query the SIEM. Did the CFO's account download any files? Did it send any emails (Business Email Compromise)?
- 4. Eradication: Mandate a password reset. Re-image the CFO's laptop if malware is suspected.
- 5. Recovery: Re-enable the account once the CFO's identity is verified via phone.
- 6. Notification: If the attacker downloaded financial data, notify the Legal and PR departments immediately.
8. Real-World Scenarios
A hospital was hit by a destructive ransomware attack. They had no Incident Response plan. When the servers started encrypting, the IT team panicked and began shutting down random core routers, accidentally taking down the hospital's VoIP phone system and critical care monitoring systems. The lack of a structured Containment protocol caused more operational damage than the ransomware itself. A trained CSIRT would have systematically isolated the infected subnets while keeping life-critical networks operational.9. Best Practices
- Out-of-Band Communication: If an advanced attacker compromises your network, assume they are reading your corporate emails and listening to your Teams/Slack messages. The Incident Response team must use a completely separate, secure communication platform (like Signal) to coordinate the defense. If you plan your counter-attack on corporate email, the hacker will read the plan.
10. Legal and Ethical Notes
If an incident involves an insider threat or requires law enforcement intervention, the IR team must maintain a strict "Chain of Custody." Every forensic action taken on a compromised machine must be heavily documented. If an IT admin casually browses the compromised hard drive, they contaminate the evidence, making it legally inadmissible in court.11. Exercises
- 1. Detail the 6 phases of the NIST Incident Response Lifecycle. Which phase is considered the most critical for preventing future breaches?
- 2. Explain why pulling the power cord on a compromised server is a severe forensic mistake.
12. FAQs
Q: Should a company ever pay a ransom? A: Law enforcement agencies globally advise against it. Paying funds organized crime and encourages future attacks. Furthermore, there is no guarantee the criminals will actually provide the decryption key. Proper Preparation (having immutable, offline backups) is the only way to ensure you never have to consider paying.13. Interview Questions
- Q: Describe the difference between a Security Event and a Security Incident. At what point does an alert escalate to trigger the formal Incident Response plan?
- Q: You are leading an Incident Response engagement for a suspected active breach. Detail your strategy for communicating with your team and external stakeholders, specifically addressing the risk of compromised internal channels.