CloudWatch Logs Data Protection Policies for SLG/EDU

Although logging data is beneficial in general, however, masking them is useful for organizations who have strict regulations such as the Health Insurance Portability and Accountability Act (HIPAA), General Data Privacy Regulation (GDPR), Payment Card Industry Data Security Standard (PCI-DSS), and Federal Risk and Authorization Management Program (FedRAMP).

Data Protection policies in CloudWatch Logs enables customers to define and apply data protection policies that scan log data-in-transit for sensitive data and mask sensitive data that is detected.

These policies leverage pattern matching and machine learning models to detect sensitive data and helps you audit and mask those data that appears in events ingested by CloudWatch log groups in your account.

The techniques and criteria used to select sensitive data are referred to as matching data identifiers. Using these managed data identifiers, CloudWatch Logs can detect:

Credentials such as private keys or AWS secret access keys
Device identifiers such as IP addresses or MAC addresses
Financial information such as bank account number, credit card numbers or credit card verification code
Protected Health Information (PHI) such as Health Insurance Card Number (EHIC) or Personal health Number
Personally Identifiable Information (PII) such as driver’s licenses, social security numbers or taxpayer identification numbers

note

Sensitive data is detected and masked when it is ingested into the log group. When you set a data protection policy, log events ingested to the log group before that time are not masked.

Let us expand on some of the data types mentioned above and see some examples:

Data Types

Credentials

Credentials are sensitive data types which are used to verify who you are and whether you have permission to access the resources that you are requesting. AWS uses these credentials like private keys and secret access keys to authenticate and authorize your requests.

Using CloudWatch Logs Data Protection policies, sensitive data that matches the data identifiers you have selected is masked. (We will see a masked example at the end of the section).

The CloudWatch Logs Data Protection for Credentials1

The CloudWatch Logs Data Protection for Credentials2

tip

Data classification best practices start with clearly defined data classification tiers and requirements, which meet your organizational, legal, and compliance standards.

As a best practice, use tags on AWS resources based on the data classification framework to implement compliance in accordance with your organization data governance policies.

tip

To avoid sensitive data in your log events, best practice is to exclude them in your code in the first place and log only necessary information.

Financial Information

As defined by the Payment Card Industry Data Security Standard (PCI DSS), bank account, routing numbers, debit and credit card numbers, credit card magnetic strip data are considered as sensitive financial information.

To detect sensitive data, CloudWatch Logs scans for the data identifiers that you specify regardless of the geo-location the log group is located once you set a data protection policy.

The CloudWatch Logs Data Protection for Financial

info

Check the full list of financial data types and data identifiers

Protected Health Information (PHI)

PHI includes a very wide set of personally identifiable health and health-related data, including insurance and billing information, diagnosis data, clinical care data like medical records and data sets and lab results such as images and test results.

CloudWatch Logs scan and detect the health information from the chosen log group and mask that data.

The CloudWatch Logs Data Protection for PHI

info

Check the full list of phi data types and data identifiers

Personally Identifiable Information (PII)

PII is a textual reference to personal data that could be used to identify an individual. PII examples include addresses, bank account numbers, and phone numbers.

The CloudWatch Logs Data Protection for PHI

info

Check the full list of pii data types and data identifiers

Masked Logs

Now if you go to your log group where you set your data protection policy, you will see that data protection is On and the console also displays a count of sensitive data.

The CloudWatch Logs Data Protection for PHI

Now, clicking on View in Log Insights will take you to the Log Insights console. Running the below query to check the logs events in a log stream will give you a list of all the logs.

fields @timestamp, @message
| sort @timestamp desc
| limit 20

Once you expand a query, you will see the masked results as shown below:

The CloudWatch Logs Data Protection for PHI

important

When you create a data protection policy, then by default, sensitive data that matches the data identifiers you've selected is masked. Only users who have the logs:Unmask IAM permission can view unmasked data.

tip

Use AWS IAM and Access Management(IAM) to administer and restrict access to sensitive data in CloudWatch.

tip

Regular monitoring and auditing of your cloud environment are equally important in safeguarding sensitive data. It becomes a critical aspect when applications generate a large volume of data and manual and thereby, it is recommended not to log an excessive amount of data. Read this AWS Prescriptive Guidance for Logging Best Practices

tip

Log Group Data is always encrypted in CloudWatch Logs. Alternatively, you can also use AWS Key Management Service to encrypt your log data.

tip

For resiliency and scalability, set up CloudWatch alarms and automate remediation using AWS Amazon EventBridge and AWS Systems Manager.

Data Types​

Credentials​

Financial Information​

Protected Health Information (PHI)​

Personally Identifiable Information (PII)​

Masked Logs​

Data Types

Credentials

Financial Information

Protected Health Information (PHI)

Personally Identifiable Information (PII)

Masked Logs