5.6: Detecting Attacks on Data and Applications

Essential Questions

How do user activity logs reveal patterns that distinguish normal behavior from malicious data access attempts?
What makes honeypots effective early warning systems, and why do they generate few false positives?
How can cryptographic hash functions prove whether files have been tampered with or remain unchanged?
Which specific characters and patterns in log files indicate SQL injection, XSS, or directory traversal attacks?
How do you balance detection speed against accuracy when choosing between real-time and retrospective analysis methods?

Overview

Imagine a bank that installed the most sophisticated vault, advanced locks, and multiple security barriers, but never bothered to monitor who enters the building or what they do inside. Even the best protective measures become meaningless without the ability to detect when they're being tested, bypassed, or compromised. This scenario perfectly mirrors the challenge facing modern applications and data systems: protection is only half the security equation.

Detection fills the critical gap between prevention and response. While firewalls, encryption, and input validation work to prevent attacks, detection systems monitor ongoing activity to identify when attacks succeed, when new threats emerge, or when insider threats manifest. Without detection capabilities, organizations remain blind to breaches until the damage becomes obvious—often weeks or months after the initial compromise.

Effective detection requires understanding what normal looks like so you can recognize what abnormal looks like. This lesson explores three complementary approaches to attack detection: analyzing user activity logs for suspicious patterns, deploying honeypots as early warning systems, and using cryptographic hashes to verify data integrity. You'll learn to recognize the specific indicators that reveal different types of application attacks in log files, understand how to balance detection speed against accuracy, and master practical techniques for verifying whether files have been altered.

How to Detect Attacks on Data (5.6.A)

Data represents one of the most valuable assets in modern organizations, making it a primary target for attackers seeking to steal, modify, or destroy information for financial gain or competitive advantage. Detecting attacks on data requires understanding how legitimate users interact with information and recognizing deviations from normal patterns. The foundation lies in comprehensive accounting—recording and monitoring all user activities related to data access, modification, and transmission.

Every interaction with data generates digital traces that reveal malicious activity. When users access files, databases, or applications, systems create log entries recording who accessed what information, when, from which device, and what actions were performed. These logs become primary evidence for detecting unauthorized access attempts.

Suspicious activity patterns often stand out against normal usage baselines. Users typically access files during business hours, from familiar devices, following predictable patterns based on job responsibilities. A marketing employee accessing customer lists during business hours from their usual laptop appears normal. The same employee accessing financial databases at 3 AM from an unfamiliar device raises immediate red flags.

Attackers often reveal themselves through the types of data they target. Legitimate users access information they need for work, but attackers frequently attempt to access sensitive files outside their normal scope. An HR employee suddenly accessing source code repositories, or a contractor attempting to access proprietary research files represent access patterns warranting investigation.

File access attempts often reveal attack progression. Attackers typically don't know exactly where valuable data is stored, so they explore systematically. Log analysis can detect patterns of broad file access, attempts to access multiple directories rapidly, or queries searching for files containing sensitive keywords like "password" or "confidential." These exploration patterns differ markedly from legitimate users who navigate directly to specific needed files.

Honeypots provide another powerful detection approach—files that appear to contain valuable data but actually contain fake information. These files have attractive names like "customer_database_backup.sql" or "executive_salaries.xlsx" but contain synthetic data with no legitimate business purpose. Since there's no reason for legitimate users to access honeypot files, any attempt to open, copy, or modify them indicates malicious activity.

The strength of honeypots lies in their low false positive rate. Unlike behavioral analysis that might flag legitimate users working unusual hours, honeypot access is almost always malicious. This characteristic makes honeypots valuable for high-confidence alerting and automated response systems.

Cryptographic hash functions offer a complementary detection approach focusing on data integrity rather than access patterns. Hash functions generate unique digital fingerprints for files that change dramatically if even a single bit is modified. By calculating and storing hashes for important files, organizations can later verify whether those files have been altered unexpectedly.

The process is straightforward: calculate a cryptographic hash for a file and store that hash value securely. Later, recalculate the hash for the same file and compare the new value to the stored value. If hashes match, the file is unchanged. If they differ, the file has been modified since the original hash was calculated. This technique detects unauthorized modifications, corruption, or tampering attempts that might not be visible through other monitoring methods.

A56_BehaviorAnalysisDemoACTIVITY

Complete the activity below.

Interactive Behavioral Anomaly Detection

Analyze user access patterns to identify suspicious activities. See how behavioral baselines help distinguish normal work patterns from potential security threats.

User

Time Range

Anomaly Threshold: 75

Show Suspicious Activity

Total Activities

Anomalies Detected

Suspicious Events

Normal Activities