LearningTree Β· AWS Β· Management & Governance

Management & Governance β€”
Visibility, Compliance & Operational Control

CloudWatch, CloudTrail, AWS Config, Systems Manager, Organizations, Control Tower β€” the services that monitor, audit, enforce compliance, and automate operations across your entire AWS environment.

CloudWatch CloudTrail AWS Config Systems Manager Organizations Control Tower
01
Chapter One

Why Management & Governance?

Management & Governance services give you the eyes, ears, and guardrails for your AWS environment β€” so you can monitor performance, audit every API call, enforce compliance, and automate operations at scale.

As your AWS footprint grows from a single account to hundreds of accounts running thousands of resources, you need a layered management stack. Without it, you're flying blind β€” unable to diagnose failures, prove compliance, or control costs.

The Four Pillars of AWS Management
πŸ“Š

Monitor & Observe

  • Collect metrics, logs, and traces from every resource
  • Set alarms and dashboards for real-time visibility
  • Detect anomalies before they become outages
  • Services: CloudWatch, X-Ray
πŸ”

Audit & Comply

  • Record every API call across accounts
  • Track resource configuration changes over time
  • Evaluate resources against compliance rules
  • Services: CloudTrail, AWS Config
πŸ”§

Operate & Automate

  • Patch, inventory, and configure fleets of instances
  • Automate runbooks and incident response
  • Manage parameters and secrets centrally
  • Services: Systems Manager
πŸ›οΈ

Govern & Control

  • Organise accounts into OUs with policies
  • Set up landing zones with guardrails
  • Track costs, set budgets, get alerts
  • Services: Organizations, Control Tower, Cost Explorer
Three Core Management Models
πŸ“ˆ

Reactive Monitoring

  • Metrics + alarms β†’ detect and respond to issues
  • CloudWatch Alarms trigger SNS or Auto Scaling
  • Log analysis after incidents occur
  • Good start, but you're always behind
πŸ›‘οΈ

Proactive Compliance

  • Rules define what "good" looks like
  • Continuous evaluation catches drift instantly
  • Auto-remediation fixes issues before humans notice
  • Config Rules + SSM Automation
πŸ—οΈ

Preventive Governance

  • SCPs and guardrails block bad actions before they happen
  • Landing zones enforce account structure
  • Budget alerts prevent cost surprises
  • Organizations + Control Tower
02
Chapter Two

Services & Spectrum

Core Management Services
Management Models at a Glance
PillarServicesPrimary DataScopeBest For
MonitoringCloudWatchMetrics, Logs, TracesPer-resource / applicationPerformance monitoring, alarming, dashboards
AuditingCloudTrailAPI call eventsPer-account / org-wideSecurity audit, forensics, compliance evidence
ComplianceAWS ConfigConfiguration snapshotsPer-resource / multi-accountDrift detection, rule evaluation, remediation
OperationsSystems ManagerInventory, commands, patchesFleet-wide (hybrid)Patch management, automation, parameter store
GovernanceOrganizations, Control TowerPolicies, guardrailsOrg-wideMulti-account strategy, preventive controls
Cost ControlCost Explorer, BudgetsBilling dataPer-account / org-wideCost visibility, budget alerts, right-sizing
The Management Spectrum β€” Observe to Govern

Each management service builds on the one before it. CloudWatch tells you what is happening now. CloudTrail tells you what happened. Config tells you if it's correct. Systems Manager lets you fix it. Organizations + Control Tower prevent it from going wrong in the first place.

← Real-time observation Preventive governance β†’
CloudWatch
Monitor & alert
CloudTrail
Audit API calls
Config
Evaluate compliance
Systems Manager
Operate & fix
Organizations
Govern & prevent
How the Services Connect
AWS Management β€” Data flow between services
MONITOR AWS Resources CloudWatch Alarms / SNS Metrics β†’ Alarms β†’ Action AUDIT API Calls CloudTrail S3 Bucket CW Logs EventBridge Who did what, when COMPLY Config Changes Config Rules Eval Remediation Is it compliant? OPERATE EC2 / On-prem SSM Patch Β· Run Β· Automate Β· Params GOVERN AWS Accounts Orgs SCPs Β· OUs Β· Consolidated Billing Your resources / events AWS management service
03
Chapter Three

Decision Guide

When to Use What
Use CloudWatch when…
  • Monitor CPU, memory, disk, custom metrics
  • Set alarms for threshold breaches
  • Centralise application and infrastructure logs
  • Build operational dashboards
  • Detect anomalies in metric patterns
Use CloudTrail when…
  • Audit who made an API call and when
  • Investigate security incidents forensically
  • Meet regulatory audit requirements
  • React to specific API events via EventBridge
  • Query historical API activity with Lake
Use AWS Config when…
  • Check if resources comply with internal rules
  • Detect configuration drift over time
  • Auto-remediate non-compliant resources
  • View resource configuration timeline
  • Run advanced queries across resources
Use Systems Manager when…
  • Patch EC2 or on-prem instances at scale
  • Run commands across fleets without SSH
  • Store config values and secrets (Parameter Store)
  • Automate operational runbooks
  • Manage hybrid (cloud + on-prem) environments
Use Organizations / Control Tower when…
  • Manage multiple AWS accounts centrally
  • Apply guardrails (preventive + detective)
  • Consolidate billing across accounts
  • Provision new accounts with best practices
  • Enforce SCPs to limit account-level actions
Full Service Comparison
ServicePillarData CollectedRetentionScopeBilling
CloudWatchMonitoringMetrics, Logs, Traces15 months (metrics) / configurable (logs)Per-resourcePer metric, log GB, alarm
CloudTrailAuditingManagement & data events90 days (console) / S3 unlimitedPer-account / orgFree (mgmt events) / per data event
AWS ConfigComplianceConfiguration itemsConfigurable (7 yrs default)Per-resource / multi-accountPer config item + rule eval
Systems ManagerOperationsInventory, commands, patches30 days (run history)Fleet / hybridFree (most features) / advanced tiers
OrganizationsGovernanceSCPs, OU structureN/AOrg-wideFree
Control TowerGovernanceGuardrails, landing zoneN/AOrg-wideFree (underlying services billed)
Cost ExplorerCost ControlBilling & usage data12 monthsPer-account / orgFree (API calls billed)
04
Chapter Four

Architecture Patterns

Common Production Patterns

Most production environments combine management services in complementary layers. Here are the three canonical patterns:

πŸ“Š

Pattern 1: Observe β†’ Alert β†’ Heal

CloudWatch β†’ Alarm β†’ SNS β†’ Lambda β†’ SSM Automation
  • Metric crosses threshold β†’ alarm fires
  • SNS triggers Lambda for triage
  • SSM Automation runs remediation runbook
  • Best for: self-healing infrastructure
πŸ”

Pattern 2: Audit β†’ Detect β†’ Respond

CloudTrail β†’ EventBridge β†’ Lambda β†’ SNS
  • Sensitive API call recorded by CloudTrail
  • EventBridge rule catches event in real-time
  • Lambda evaluates and blocks / notifies
  • Best for: security incident response
πŸ›‘οΈ

Pattern 3: Evaluate β†’ Remediate β†’ Report

Config Rule β†’ Non-compliant β†’ SSM Automation β†’ Dashboard
  • Config Rule detects resource drift
  • Auto-remediation via SSM fixes the issue
  • Compliance status reported to dashboard
  • Best for: continuous compliance enforcement
How Management Fits in the Architecture
CloudWatch
Observe
CloudWatch Β· X-Ray
β†’
CloudTrail
Audit
CloudTrail Β· Lake
β†’
Config
Comply
Config Rules
β†’
SSM
Operate
SSM Β· Automation
β†’
Organizations
Govern
Orgs Β· Control Tower
05
Chapter Five

Exam Insights

Exam Decision Hints
If the question says…Think…
"Monitor CPU utilisation" or "set alarm"CloudWatch
"Who deleted the S3 bucket?" or "API call history"CloudTrail
"Is this resource compliant?" or "configuration drift"AWS Config
"Patch instances" or "run command at scale"Systems Manager
"Multi-account strategy" or "SCP"Organizations
"Landing zone" or "guardrails"Control Tower
"Cost anomaly" or "budget alert"Cost Explorer + Budgets
"Configuration timeline" or "resource history"AWS Config
"Centralise logs from multiple accounts"CloudWatch + CloudTrail (org trail)
"Auto-remediate non-compliant resource"AWS Config + SSM Automation
"Store database password securely"SSM Parameter Store (SecureString)
"Best-practice recommendations"Trusted Advisor
Common Exam Traps
TrapReality
"CloudTrail records performance metrics"NO. CloudTrail records API calls. CloudWatch records metrics and logs.
"AWS Config can block non-compliant changes"Config detects and remediates AFTER the change. Use SCPs to prevent changes.
"Systems Manager requires SSH access"SSM Agent uses HTTPS β€” no SSH, no bastion, no open inbound ports.
"CloudWatch Logs retention is automatic"Logs are kept forever by default. You must set retention policy explicitly.
"Organizations SCPs affect the management account"SCPs never apply to the management account. It always has full access.
"Control Tower replaces Organizations"Control Tower builds on top of Organizations, adding guardrails and landing zones.
"Config Rules prevent resource creation"Config Rules are detective, not preventive. They evaluate after creation.
Summary
πŸ“‹ Management & Governance β€” Recap
  • Management = visibility + control. You can't operate what you can't see, audit, or enforce rules on.
  • Core services: CloudWatch (monitor), CloudTrail (audit), Config (comply), Systems Manager (operate), Organizations (govern).
  • CloudWatch + CloudTrail work together β€” metrics show WHAT happened, trail shows WHO did it.
  • Config + SSM Automation is the compliance loop β€” detect drift, auto-remediate, report compliance.
  • Organizations + Control Tower for enterprise multi-account β€” SCPs prevent, guardrails protect, landing zones standardise.
πŸ‘‰ Key Takeaway

Layer your management stack: CloudWatch observes, CloudTrail audits, Config evaluates, Systems Manager operates, and Organizations governs. Each service fills a gap the others cannot β€” use them together for complete operational control.