Skip to content

Latest commit

 

History

History
173 lines (131 loc) · 9.23 KB

File metadata and controls

173 lines (131 loc) · 9.23 KB

Amazon Redshift WLM Query Monitoring Rule (QMR) Action Notification Utility

Goals

This utility uses a scheduled Lambda function to pull records from the QMR action system log table (stl_wlm_rule_action) and publish them to an SNS topic. This utility can be used to send periodic notifications based on the WLM query monitoring rule actions taken for your unique workload and rules configuration.

In Amazon Redshift workload management (WLM), query monitoring rules define metrics-based performance boundaries for WLM queues and specify what action to take when a query goes beyond those boundaries. For example, for a queue dedicated to short running queries, you might create a rule that aborts queries that run for more than 60 seconds. To track poorly designed queries, you might have another rule that logs queries that contain nested loops. The rule actions are captured in stl_wlm_rule_action system table. For more information about Redshift workload management (WLM) query monitoring rules and how to configure it, please refer to Redshift Documentation

The utility periodically scans stl_wlm_rule_action.actions (log/hop/abort) recorded by WLM query monitoring rules and sends the records as SNS notifications. In summary, a Lambda function is invoked on a scheduled interval, connects to your Redshift cluster, reads events from stl_wlm_rule_action and publishes them to an SNS topic as a JSON string.

Installation Notes:

Prerequisites

This utility requires the following items:

  • VPC: A VPC which currently contains your Amazon Redshift resource and will contain this utility's Lambda function. NOTE: VPC ID

  • Private Subnets with NAT route: At least two private subnets within that VPC with private routes to the target Amazon Redshift cluster. You should have a NAT Gateway to give access to the Internet for those subnets' routing tables. You cannot use public subnets. You can read more information on this Lambda requirement here: AWS blog. NOTE: Subnet IDs

  • Security Group: A VPC security group which allows the Lambda function access to your Amazon Redshift cluster on the port specified for SQL connections. NOTE: VPC Security Group ID

  • An Amazon Redshift cluster in the above VPC. NOTE: Amazon Redshift cluster's Endpoint, Port, Database

  • Database user credentials for an Amazon Redshift user with access to STL_WLM_RULE_ACTION. A superuser will be able to see all rows in this table, and a non-privileged user will be able to see only their own rows. More on visibility here: Visibility of Data in System Tables and Views. NOTE: Amazon Redshift cluster's user name and password

  • An active WLM configuration with QMR enabled (Documentation).

  • Access to an IAM user with privileges to create and modify the necessary CloudFormation, KMS, IAM, SNS, and CloudWatch Events resources.

  • A locally cloned amazon-redshift-utils project containing this utility and AWS CLI and/or AWS Console access.

Installation from CloudFormation Template:

The quickest way to get up and running with the QMRNotificationUtility is by leveraging the packaged CloudFormation template and the AWS CLI.

1. Navigate to the QMRNotificationUtility's directory within the amazon-redshift-utils project:

git clone/pull amazon-redshift-utils
cd amazon-redshift-utils/src/QMRNotificationUtility

2. Copy the zipped python Deployment Package for the Lambda function to a location of your choosing in S3:

aws s3 cp ./lambda/dist/qmr-action-notification-utility-1.4.zip s3://yourbucket/qmr-action-notification-utility-1.4.zip

3. Gather the necessary identifiers noted in the prerequistes section above:

  • VPC ID
  • Subnet ID(s)
  • Security Group ID(s)
  • Cluster Endpoint
  • Cluster Port
  • Cluster Database
  • Cluster Credentials (Username and Password)
  • Bucket to host the Lambda Deployment Package
  • Email address to be notified of WLM actions

4. Create the Lambda Function

Use the AWS CLI to create a stack containing the necessary dependencies and Lambda function:

aws cloudformation create-stack \
--stack-name qmr-action-notification-utility \
--template-body file://./cloudformation/qmr-action-notification-utility.yaml \
--parameters \
  ParameterKey=S3Bucket,ParameterValue=yourbucket \
  ParameterKey=S3Key,ParameterValue=qmr-action-notification-utility-1.4.zip \
  ParameterKey=SNSEmailParameter,[email protected] \
  ParameterKey=VPC,ParameterValue=vpc-abcd1234 \
  ParameterKey=SubnetIds,ParameterValue=subnet-abcd1234 \
  ParameterKey=SecurityGroupIds,ParameterValue=sg-abcd1234 \
  ParameterKey=RedshiftMonitoringUser,ParameterValue=monitoring_user \
  ParameterKey=RedshiftClusterPort,ParameterValue=cluster_port \
  ParameterKey=RedshiftClusterEndpoint,ParameterValue=examplecluster.abcd12340987.us-east-1.redshift.amazonaws.com \
  ParameterKey=RedshiftClusterDatabase,ParameterValue=db_name \
  ParameterKey=MonitoringDBPasswordCiphertext,ParameterValue= \
--capabilities CAPABILITY_IAM

5. Verify creation is complete

It may take a few mintues for the stack's resources to be provisioned, and is completed when the following command returns "CREATE_COMPLETE":

aws cloudformation describe-stacks --stack-name qmr-action-notification-utility --query 'Stacks[0].StackStatus' --output text

6. Add an encrypted password

From the completed stack creation, extract the KMS Key ID, and use that Key to process your plaintext database password to ciphertext:

# Extract KMS Key ID
KMSKEYID=`aws cloudformation describe-stack-resource --stack-name qmr-action-notification-utility --logical-resource-id RedshiftKMSKey --query 'StackResourceDetail.PhysicalResourceId' --output text`
# Generate a read restricted local file to store your plaintext password
(umask 077; touch passwd.txt)
# Insert your plaintext password into file. If using vi ensure binary mode and no automatic EOL
vi -b -c 'set noeol' passwd.txt
# Read plaintext password file contents into kms encrypt to generate ciphertext
CIPHERTEXT=`aws kms encrypt --key-id $KMSKEYID --plaintext file://./passwd.txt --query 'CiphertextBlob' --output text`
# Cleanup password file
rm passwd.txt

7. Update your CloudFormation stack

Add the MonitoringDBPasswordCiphertext parameter with the ciphertext generated from the previous step, leaving all other parameters unchanged:

aws cloudformation update-stack \
--stack-name qmr-action-notification-utility \
--use-previous-template \
--parameters \
  ParameterKey=MonitoringDBPasswordCiphertext,ParameterValue=$CIPHERTEXT \
  ParameterKey=S3Bucket,UsePreviousValue=true \
  ParameterKey=S3Key,UsePreviousValue=true \
  ParameterKey=SNSEmailParameter,UsePreviousValue=true \ 
  ParameterKey=VPC,UsePreviousValue=true \
  ParameterKey=SubnetIds,UsePreviousValue=true \
  ParameterKey=SecurityGroupIds,UsePreviousValue=true \
  ParameterKey=RedshiftMonitoringUser,UsePreviousValue=true \
  ParameterKey=RedshiftClusterPort,UsePreviousValue=true \
  ParameterKey=RedshiftClusterEndpoint,UsePreviousValue=true \
  ParameterKey=RedshiftClusterDatabase,UsePreviousValue=true \
--capabilities CAPABILITY_IAM

8. Verify the modification is complete

It may take a moment for the stack's resources to be updated, and is done when the following command returns "UPDATE_COMPLETE":

aws cloudformation describe-stacks --stack-name qmr-action-notification-utility --query 'Stacks[0].StackStatus' --output text

9. Check the inbox of the email address you included for SNSEmailParameter.

There should be an "AWS Notification - Subscription Confirmation" from [email protected] asking that you confirm your subscription. Click the link if you wish to receive updates on this email address.

10. Verify the email address receives an email notification within 5 minutes

By purposely triggering a QMR action by manually running SQL that is known to violate a rule defined in your active WLM configuration. Below is one example SNS notification email message:

[
  {
     "clusterid":"examplecluster",
     "database":"dev",
     "userid":100,
     "query":1186777,
     "service_class":6,
     "rule":"Rule1_Nested_Loop",
     "action":"abort",
     "recordtime":"2017-06-12T06:51:57.167052"
  },
  {
      "clusterid":"examplecluster",
      "database":"dev",
      "userid":100,
      "query":1186999,
      "service_class":6,
      "rule":"Rule2_high_Return_Rows",
      "action":"log",
      "recordtime":"2017-06-12T06:53:48.935375"
   }
]

Rebuilding Lambda Function

If you wish to rebuild the Lambda function yourself, you can use lambda/build.sh to create a zipped Deployment Package to upload to your S3 bucket. This utility requires pip and virtualenv python dependencies. This script will initialize a transient virtual environment, download python dependencies from requirements.txt, and zip the lambda function source code with dependencies into a versioned archive for uploading to S3.