Skip to main content

Drift Detection Troubleshooting

Resolve configuration drift detection and monitoring issues

troubleshootingdrift detectionmonitoringbaselinesalerts

Drift Detection Troubleshooting

Resolve issues with configuration drift detection including monitoring failures, baseline problems, and alert delivery.

Monitoring Not Running

Schedule Not Active

Symptoms:

  • No new drift events detected
  • Last scan was days/weeks ago
  • Schedule shows inactive

Solutions:

  1. Enable monitoring schedule

    1. Go to Drift Detection → Schedules
    2. Find your schedule
    3. Toggle to Active
    4. Save changes
    
  2. Check schedule configuration

    • Verify frequency is set
    • Check if schedule is paused
    • Review start/end times if configured

No Baselines Configured

Symptoms:

  • Monitoring runs but no drift detected
  • "No baselines to compare" message

Solutions:

  1. Create baselines

    • Go to Drift Detection → Baselines
    • Create baseline for resources to monitor
    • Activate baselines
  2. Verify baseline scope

    • Check which resources are included
    • Ensure relevant resource types selected

Connection Issues

Symptoms:

  • "Unable to fetch configuration" errors
  • Monitoring fails repeatedly

Solutions:

  1. Test M365 connection

    • Settings → Integrations → Test
    • Fix connection issues first
  2. Check permissions

    • Drift detection needs same permissions as assessments
    • Verify all permissions granted

Drift Events Not Appearing

False Silence

Symptoms:

  • Know configuration changed
  • No drift event created
  • History shows no changes

Causes:

  • Change not in monitored scope
  • Baseline doesn't include that setting
  • Comparison rule too lenient

Solutions:

  1. Check baseline scope

    • Review what's included in baseline
    • Add missing resource types
    • Expand monitoring scope
  2. Review comparison rules

    • Rules may be too lenient
    • Exact match vs contains
    • Adjust thresholds
  3. Check resource type coverage

    • Not all M365 settings monitored
    • Review supported resource types

Delayed Detection

Symptoms:

  • Changes detected hours/days later
  • Expecting immediate detection

Causes:

  • Scheduled monitoring (not real-time)
  • Schedule frequency too low

Solutions:

  1. Understand schedule frequency

    • Drift detection runs on schedule
    • Not real-time monitoring
    • Increase frequency if needed
  2. Run manual scan

    • Drift Detection → Run Now
    • Immediate scan for changes

Too Many Drift Events

Alert Fatigue

Symptoms:

  • Hundreds of drift events
  • Minor changes creating noise
  • Can't find important changes

Causes:

  • Baseline too strict
  • Monitoring scope too broad
  • Normal changes flagged

Solutions:

  1. Adjust baseline thresholds

    • Allow minor variations
    • Use regex patterns
    • Exclude volatile settings
  2. Narrow monitoring scope

    • Focus on critical settings
    • Exclude low-risk resources
    • Prioritize by severity
  3. Use severity filtering

    • Focus on high/critical
    • Acknowledge low-priority events
    • Configure alert thresholds

Expected Changes Flagged

Symptoms:

  • Planned changes create drift events
  • Maintenance creates alerts
  • Known-good changes flagged

Solutions:

  1. Update baseline after changes

    • Reflect new expected state
    • Drift Detection → Update Baseline
    • Re-snapshot current state
  2. Acknowledge events

    • Mark as acknowledged
    • Add note explaining change
    • Won't alert again
  3. Use maintenance windows

    • Pause monitoring during changes
    • Resume after updates
    • Update baseline before resuming

Baseline Issues

Baseline Creation Fails

Symptoms:

  • Can't create new baseline
  • "Failed to snapshot" error

Solutions:

  1. Check M365 connection

    • Test connection first
    • Fix any connection issues
  2. Verify permissions

    • Need read access to resource types
    • Check required permissions
  3. Try smaller scope

    • Start with single resource type
    • Expand after successful creation

Baseline Out of Date

Symptoms:

  • Baseline represents old state
  • Many drift events from outdated comparison

Solutions:

  1. Update baseline

    1. Go to Drift Detection → Baselines
    2. Select baseline
    3. Click Update from Current
    4. Confirm update
    
  2. Review before updating

    • Check current drift events
    • Decide which changes to accept
    • Update to new known-good state

Comparison Rule Problems

Symptoms:

  • Wrong settings flagged
  • Expected matches not matching
  • Regex not working

Solutions:

  1. Test comparison rules

    • Preview before saving
    • Test with sample data
    • Adjust patterns
  2. Use simpler rules

    • Start with exact match
    • Add flexibility as needed
    • Document rule logic

Alert Delivery Issues

Not Receiving Alerts

Symptoms:

  • Drift events exist but no alerts
  • Email alerts not arriving
  • Webhook not triggering

Solutions:

  1. Check alert configuration

    1. Drift Detection → Alerts
    2. Verify alerts enabled
    3. Check severity thresholds
    4. Verify delivery method
    
  2. Email delivery issues

    • Check spam/junk folder
    • Verify email address correct
    • Add securtea.io to allowlist
  3. Webhook issues

    • Test webhook URL
    • Check authentication
    • Review webhook logs

Too Many Alerts

Symptoms:

  • Alert overload
  • Every small change triggers alert
  • Can't keep up with notifications

Solutions:

  1. Adjust severity threshold

    • Only alert on high/critical
    • Lower severity → no alert
  2. Configure digest alerts

    • Daily/weekly summary
    • Instead of per-event
  3. Reduce monitoring scope

    • Focus on critical resources
    • Less scope = fewer alerts

Alert Delays

Symptoms:

  • Alerts arrive late
  • Real-time expected but delayed

Solutions:

  1. Understand alert timing

    • Alerts sent after scan completes
    • Not real-time
    • Based on schedule frequency
  2. Check delivery method

    • Webhooks are immediate
    • Email may have delays
    • Check email queue status

History and Snapshots

History Missing

Symptoms:

  • Can't view historical configurations
  • Snapshots not being saved

Causes:

  • Retention period exceeded
  • Snapshots disabled
  • Storage issues

Solutions:

  1. Check retention settings

    • History has retention limits
    • Older snapshots may be purged
  2. Enable snapshot storage

    • Settings → Drift Detection
    • Enable historical snapshots

Can't Compare Versions

Symptoms:

  • Compare feature unavailable
  • Only one version exists

Solutions:

  1. Need multiple snapshots

    • Run monitoring multiple times
    • Wait for more history
  2. Check date range

    • Select appropriate dates
    • Both versions must exist

Performance Issues

Scans Taking Too Long

Symptoms:

  • Monitoring timeout
  • Scans run for hours
  • Progress stuck

Causes:

  • Large tenant
  • Too many resource types
  • API throttling

Solutions:

  1. Reduce scope

    • Monitor fewer resource types
    • Focus on critical resources
    • Split into multiple baselines
  2. Adjust frequency

    • Less frequent = more thorough
    • Trade-off between coverage and speed
  3. Schedule off-peak

    • Run during low-activity periods
    • Avoid busy times

Error Codes

Common Drift Detection Errors

CodeMeaningSolution
DRIFT001No connectionSetup/verify M365 connection
DRIFT002Permission deniedGrant required permissions
DRIFT003Baseline not foundCreate or select baseline
DRIFT004Snapshot failedCheck connection, retry
DRIFT005Comparison errorReview comparison rules
DRIFT006Alert delivery failedCheck delivery settings
DRIFT007Schedule errorReview schedule config

Best Practices

Baseline Management

  1. Start narrow - Small scope initially
  2. Expand gradually - Add resources as needed
  3. Update regularly - After planned changes
  4. Document changes - Note why baseline updated

Alert Configuration

  1. Severity-based - Different actions per severity
  2. Team routing - Right alerts to right people
  3. Digest option - Consider daily summaries
  4. Test alerts - Verify delivery working

Regular Maintenance

  1. Review open events - Don't let them pile up
  2. Acknowledge or resolve - Keep queue manageable
  3. Update baselines - Keep current
  4. Tune thresholds - Reduce noise over time

What's Next?

Global Search

Search for pages, settings, and documentation