Incident Learning Library

Incidents and Postmortems

Separate operational incidents from finalized postmortems, then coach by incident lead trends.

100 records 100 incidents 0 postmortems 100 without PM

Filtered View

100

records match current filters

Incident Feed

100 visible
01KJ4C8DG0EMFJXH75R1AJQBR9 shared medium resolved incident

ALARM: "tb-ecomm-prod-hybris-incrementals-alarm" in US East (N. Virginia)

2/23/2026

n/a

01KJ45E02K07DCN4KAZGGC2SCA connect high resolved incident

Switchboard unable to dispatch events to KFC US endpoints in production

• Org: kfc_us • Env: production • Service: switchboardserviceretry1m ## Key Information: > • Switchboard is unable to dispatch events to kfc_us > > • This error message typically occurs when switchboard fails to dispatch switchboard events to the brands. > > • Switchboard events are dispatched to KFC and TB end points via Akamai. If there is a networking error at Yum end and the event did not reach Akamai there will be no reference id generated at Akamai for that dispatched event. ## Troubleshooting Steps: 1. Please look at the [switchboard US dashboard](https://app.datadoghq.com/dashboard/6x9

2/23/2026

n/a

Sarita Maheshwari

01KJ4091P9HVNRRJTTAG1VJNH4 shared medium resolved incident

Duplicate customer email IDs in customer_signal_profile table

C360_O67 : 53 duplicate customer email IDs present in customer360.customer_signal_profile table Confluence P&P Link- C360_O67_P&P

2/23/2026

n/a

01KJ3ZSYZS3EN08H2P6BHN80S0 shared low resolved incident

Singel Store Error in Delivery-DGS for kfc_scy_us - The requested StoreDeliveryProvider has not be enabled at this store

**Problem**: Delivery-DGS service reported failed or unknown action outcomes due to a provider not enabled at one store. **Impact**: Only a single store (V000021) in the kfc_scy_us organization is affected. There is no evidence of a wider or sustained issue. **Causes**: The error was triggered because the requested StoreDeliveryProvider has not been enabled at store V000021. **Steps to resolve**: We confirmed the issue is limited to one store and will follow up on this specific configuration tomorrow.

2/23/2026

n/a

Adam Schweitzer

01KJ3YSK0T71Z2ZQS8HZYG2C57 shared low resolved incident

This is just a test please ignore

This is just a test please ignore

2/23/2026

n/a

Christian Manning

01KJ3XDNED17G6V11P72EJNN5R commerce low resolved incident

Anomalous increase in payment declines for Mobile channel on HB platform

• Org: hb_us • Env:production • Service: payment-dgs • Client App ID: 7viorc6ovt9v7s38sv7i3iqe6k Key Information:This monitors Payment declines from HB on Yum Platform. • client.id:7viorc6ovt9v7s38sv7i3iqe6k - Mobile • client.id:5o5qms1h402q83ntvdj4ei26tf - Web • client.id:3k6kpnbek6dvf99t77l0lnl24h - Call Center Troubleshooting Steps: 1. Please review dashboard and logging to determine if any outliers and common decline reasons. 2. Identify if there is a service disruption with Fiserv or credit card provider. [Link To Decline Reasons](https://app.datadoghq.com/logs?query=env%3Aprod%2A%20servi

2/23/2026

n/a

01KJ3WYZGMAYQF3F3K4C8H04GH commerce medium resolved incident

Ecomm order_lineitems data completeness at 79.7% in ecomm_dv.sat_sh_order_entry for DO_C360_yumecomm_O43

C360_yumecomm_O43 : 79.7252802747197252900% of records are there in ecomm_dv.sat_sh_order_entry Confluence P&P Link- DO_C360_yumecomm_O43_P&P

2/23/2026

n/a

01KJ3SFE4S05X73HAJY1JREF96 commerce medium resolved incident

0.08% of Order Details missing in Lakehouse Headers for C360_13_Hybris daily load

C360_13_Hybris : 0.08% of Order Details which do not have records in Lakehouse Headers for 2026-02-21 Confluence P&P Link- C360_O13_Hybris_P&P

2/22/2026

n/a

01KJ3P1AJQSAEET23QNWDBE0TX commerce medium resolved incident

GH_O2 sat_gh_order_line files not received in S3 for 20260222

2/22/2026

n/a

01KJ3P14S5PTT999K81R0884SJ commerce medium resolved incident

GH_O2 sat_gh_order_adjustment files missing from S3 for 20260222

2/22/2026

n/a

01KJ3P059QMJW4X4XAVSQPNV8B shared medium resolved incident

GH_O2 sat_gh_menu_item files not received in S3 for 20260222

2/22/2026

n/a

01KJ3KXQ3YNN1SJFZXREM323ZH commerce low resolved incident

Increase in payment declines for HB on Mobile channel

• Org: hb_us • Env:production • Service: payment-dgs • Client App ID: 7viorc6ovt9v7s38sv7i3iqe6k Key Information:This monitors Payment declines from HB on Yum Platform. • client.id:7viorc6ovt9v7s38sv7i3iqe6k - Mobile • client.id:5o5qms1h402q83ntvdj4ei26tf - Web • client.id:3k6kpnbek6dvf99t77l0lnl24h - Call Center Troubleshooting Steps: 1. Please review dashboard and logging to determine if any outliers and common decline reasons. 2. Identify if there is a service disruption with Fiserv or credit card provider. [Link To Decline Reasons](https://app.datadoghq.com/logs?query=env%3Aprod%2A%20servi

2/22/2026

n/a

01KJ3HHDCYKQGHDGN655BKFWCD commerce low resolved incident

Order rejection rate exceeds 4% in PH_UK due to invalid miObjectNum and possible POS or menu issues

• Org: ph_uk • Env: prod-hopper • Service: workflowservice **Key Information** Rejection Reason: Invalid miObjectNum: 20112|1This monitor has generated an alert due to 4 percent or more orders have been rejected by restaurant(s) within the last 1 hour. There are many reasons that can reject an order. Typically stores being closed and/or POS is not ready to accept online orders, menu mismatch between YUM and POS. More information can be found in confluence: https://yumbrands.atlassian.net/wiki/spaces/YCC/pages/2621440058/ORDER+PROD+-+PH+UK+-+Rejected+Orders **What do Rejected Orders mean?** • *

2/22/2026

n/a

Adam Schweitzer

01KJ3GMVXVY3645442P138AV75 shared medium resolved incident

Webex Contact Center Unavailable

**Problem:** Service Desk analysts are unable to access and/or change status in Webex Contact Center. Analysts are also getting kicked out of Webex application. **Impact:** Taco Bell Service Desk analysts are unable to assist stores with troubleshooting. Restaurants are stuck on hold waiting for an analyst to answer. **Root Cause:** TBD **Steps to Resolve:** TBD. Issue is now resolved. Affected users are able to access Webex and take calls again.

2/22/2026

n/a

Anthony Ku

01KJ3GA9TA7FTN0A26N2V00HHY commerce medium resolved incident

User aborted requests on payment-dgs during payment operations

• Org: kfc_us • Env: production • Service: payment-dgs Detects user aborted requests. Usually triggers during Fiserv service disruptions. Please review logs to determine what type of operation is in danger of failing and which brand is impacted. Troubleshooting guide: @slack-payments-alerts-production @webhook-reops-incident-io@webhook-byte-incident-io The monitor was last triggered at Sun Feb 22 2026 14:22:06 CST. ---- [[Monitor Status](https://app.datadoghq.com/monitors/96391746?group=%40organization%3Akfc_us%2Cenv%3Aproduction&from_ts=1771790826000&to_ts=1771792026000&event_id=8514922331350

2/22/2026

n/a

01KJ3G96Y7VB8CT1ZARBR03MFX commerce medium resolved incident

Fiserv uCom call failures in payment-dgs for kfc_us

• Org: kfc_us • Env: production • Service: payment-dgs > [Troubleshooting Guide](https://yumbrands.atlassian.net/wiki/spaces/YCSD/pages/1518469121/Steps+for+Payment+500+Errors) Key Information: > Escalate Alert state incidents to Fiserv.If events are significantly greater than the historical trend, there may be an outage. Troubleshooting Steps: > 1. Identify what event(s) failed, how many and which Brand is impacted. > > 2. Reach out to the brand integration team to confirm if they are encountering/observing any issues. > > 3. Escalate the incident(s) to Fiserv. @slack-datadog-alerts @webhook-

2/22/2026

n/a

01KJ3F7ZAEBDFZK9V76CT37VQ5 commerce medium resolved incident

Order header data completeness for DO_C360_yumecomm_O42 at 55% in ecomm_dv.sat_sh_order

C360_yumecomm_O42 : 55.2434897749006914900% of records are there in ecomm_dv.sat_sh_order Confluence P&P Link- DO_C360_yumecomm_O42_P&P

2/22/2026

n/a

01KJ3F72YFP813N8GT5BDNT5DT commerce medium resolved incident

Ecomm order_lineitems data completeness for DO_C360_yumecomm_O43 at 56% in lakehouse YumEcomm

C360_yumecomm_O43 : 56.0333015608244966100% of records are there in ecomm_dv.sat_sh_order_entry Confluence P&P Link- DO_C360_yumecomm_O43_P&P

2/22/2026

n/a

01KJ3DE1TZ85NP2DZ6YHR97BZG commerce medium resolved incident

All Order Details missing Headers in Redshift C360_O10_Grubhub daily load for 2026-02-22

C360_10_grubhub : 100.0 % of Order Details which do not have records in Headers for 2026-02-22 Confluence P&P Link- C360_O10_Grubhub_P&P

2/22/2026

n/a

01KJ3DE0YYMQBEPM4G8NE4CM2A commerce medium resolved incident

0.08% of Order Details missing Headers in C360_O10_Hybris Redshift daily load

C360_10_hybris : 0.08 % of Order Details which do not have records in Headers for 2026-02-22 Confluence P&P Link- C360_O10_Hybris_P&P

2/22/2026

n/a

01KJ3DDCG9C46BDMXP0DVNWXSQ commerce medium resolved incident

All order headers missing details in C360_O5_Grubhub on 2026-02-22

C360_O5_Grubhub : distinct orders - 100.0% Order Headers do not have records in Details for 2026-02-22 Confluence P&P Link- C360_O5_O6_P&P

2/22/2026

n/a

01KJ37PV9KHK8BNQWQF6TQB36S pos medium resolved incident

Increased rate of stores marked OFFLINE in Olympus (possible connection instability)

• Org: tb_uk • Env: prod-hopper • Service: olympus-daemons ## Key Information: > This monitor tracks when a store status has successfully change to OFFLINE in Olympus.This is a percent based monitor that tracks the total percent of stores that have changed their store status in the last 30 minutes > > > Warning is set to 3.0. Alert is set to 5.0This can be due to a network outage for brands causing stores to drop offlineThis could also indicate an issue with Olympus erroneously marking stores as OFFLINE ## Troubleshooting Steps: Confirm Total Store Count of Offline Stores > US: [Olympus Store

2/22/2026

n/a

01KJ37K5MPZ9FT4HCY81N92VPE shared medium resolved incident

Container forcibly terminated due to out of memory in backend-users-deployment-green pod

• Org: • Env: prod • Service: A container in prod was forcibly terminated • indicates either a memory leak or insufficient memory limit • engage on-call SRE to determine impact if monitor enters Alert state or remains active for a prolonged period @webhook-incident-io-superapp @khatibur.rahman@yum.com@webhook-byte-incident-io The **count** of events matching **[Task ran out of memory env:prod*](https://app.datadoghq.eu/event/explorer?query=Task+ran+out+of+memory+env%3Aprod%2A&from_ts=1771782250000&to_ts=1771782550000&live=false)**, grouped by **env,kube_cluster_name,pod_name**, was **> 0.0** d

2/22/2026

n/a

01KJ35AEYRZNNEH6KJN0AECPCS shared medium resolved incident

voc - daily - voc_talend_daily_prod - Failure - Taco Bell - Severity 1

2/22/2026

n/a

01KJ35668J6YEYB0M0Z93E18HC shared medium resolved incident

voc - daily - voc_lake_daily_prod - Failure - Taco Bell - Severity 1

2/22/2026

n/a

01KJ353BNK1NYS1YSPE74EVY46 shared medium resolved incident

Mismatch between Athena and Redshift HME data for multiple stores

2/22/2026

n/a

01KJ3510FHAWXE49DYAJTDQ8GX shared medium resolved incident

Uber Eats daily data missing for 2026-02-22

2/22/2026

n/a

01KJ31XAB19P0X0EHXXV0014PP shared medium resolved incident

Stores with blank Owner or DMA details detected

2/22/2026

n/a

01KJ31KBF7935WW4ZVYPN564QJ shared medium resolved incident

Stores with blank owner or DMA details detected

2/22/2026

n/a

01KJ2W9QNB4R20TKAH4X4K2N0P shared medium resolved incident

Stores have blank Owner or DMA details

2/22/2026

n/a

01KJ2W6WMZ0GSRMJCGDNGKKSXB shared medium resolved incident

Stores with missing owner or DMA details in tbcdb alignment tables

2/22/2026

n/a

01KJ2TKVX9C8W6WQM1NXMVKBP7 pos medium resolved incident

Mismatch between store status in Alignment data and POS transaction activity

2/22/2026

n/a

01KJ2SYX2GMS43W31CCNZKSXXP shared medium resolved incident

Team members unable to access QR code to launch Best Voice Survey in Best Voices Micro App

Team members are currently unable to access the QR code needed to launch the Best Voice Survey in the Best Voices micro app.

2/22/2026

n/a

01KJ2SC2NSR7E9P3TPT24TXK9M shared medium resolved incident

Mismatch in LMS store alignment and user certification records

2/22/2026

n/a

01KJ2MKVXYG149816NMDH48KXJ shared medium resolved incident

Mismatch in VOC survey data between Athena and Redshift for multiple dates

2/22/2026

n/a

01KJ22R01ZKBXWJMS90469HYY4 shared medium resolved incident

20260222|P2:FIN Please check FIN Files availability for kfc and ph

2/22/2026

n/a

01KJ1SVPFS6YCRBABAYZ80BFXR shared medium resolved incident

ALARM: "tb-ecomm-prod-hybris-incrementals-alarm" in US East (N. Virginia)

2/22/2026

n/a

01KJ1MV4E33FYRDWEQ0X9K8N48 shared medium resolved incident

LMS_O1 tb_course_object_completion files missing from S3 for 20260222

2/22/2026

n/a

01KJ1EGNR2JP3C9SJNYFA32X3Q commerce medium resolved incident

PH US Pick Up Order Flow 2 synthetic test failed on Pizza Hut website

This browser synthetic test is basic. It tests for basic user functionality on Pizza Hut's website and refreshes orders on Customer Support with default filter. • If this fails, please help determine which step and try to manually recreate. • Identify if there are any deployments that may have caused this to fail. • Review with Support leads and then PH team if requiring additional help. • Plans to improve this in the future.@caleb.harris@yum.com@webhook-byte-incident-io The monitor was last triggered at Sat Feb 21 2026 19:13:09 CST. ---- [[Test Status](https://app.datadoghq.com/synthetics/det

2/22/2026

n/a

01KJ1E1HTAKXJAYYPRHFKZYTNV shared medium resolved incident

Duplicate customer email IDs found in customer_signal_profile table

C360_O67 : 52 duplicate customer email IDs present in customer360.customer_signal_profile table Confluence P&P Link- C360_O67_P&P

2/22/2026

n/a

01KJ1AFEMQXFB2T93XHD93NTTQ commerce medium resolved incident

Latency anomaly in Temporal Cloud affecting workflow processing for yc-hopper-prod-promotions.bd6e0 namespace

• Env: production • Service: Temporal • Namespace: yc-hopper-prod-promotions.bd6e0 ## Key Information: Temporal Cloud integration reports latency anomaly _on their end_, which could delay workflow processing.Multiple services utilize Temporal in different ways > P2 - Order namspace as this affects order workflowsP3 - Cart, Menu, Payments, Loyalty, Promotions - Latency in these namespaces will not affect business continuity, but is latency occurs for longer than 1 hour it cause service degredation. ## Troubleshooting Steps: 1. For non order namespaces, check if latency recovers within 1 hour 2.

2/22/2026

n/a

01KJ1736DYTBT028CW6QSH1MXF commerce medium resolved incident

0.09% of Order Details missing in Lakehouse Headers for 2026-02-20 in C360_13_Hybris daily load

C360_13_Hybris : 0.09% of Order Details which do not have records in Lakehouse Headers for 2026-02-20 Confluence P&P Link- C360_O13_Hybris_P&P

2/21/2026

n/a

01KJ1734ZH1J1C8MWFW5W6MEGJ shared medium resolved incident

Missing DoorDash daily data for 2026-02-21

2/21/2026

n/a

01KJ16GB3W1G8HNZWM57BY87QX commerce medium resolved incident

Customer login flow failure on PH US Pick Up Order for Ultimate Hut Bundle

This browser synthetic test is basic. It tests for basic user functionality on Pizza Hut's website and refreshes orders on Customer Support with default filter. • If this fails, please help determine which step and try to manually recreate. • Identify if there are any deployments that may have caused this to fail. • Review with Support leads and then PH team if requiring additional help. • Plans to improve this in the future.@caleb.harris@yum.com@webhook-byte-incident-io The monitor was last triggered at Sat Feb 21 2026 16:53:09 CST. ---- [[Test Status](https://app.datadoghq.com/synthetics/det

2/21/2026

n/a

01KJ164E883MJFSKRFTN40ETBH shared medium resolved incident

Container in backend-users-deployment-green-57cf4cbf4b-fbzch pod terminated due to out of memory

• Org: • Env: prod • Service: A container in prod was forcibly terminated • indicates either a memory leak or insufficient memory limit • engage on-call SRE to determine impact if monitor enters Alert state or remains active for a prolonged period @webhook-incident-io-superapp @khatibur.rahman@yum.com@webhook-byte-incident-io The **count** of events matching **[Task ran out of memory env:prod*](https://app.datadoghq.eu/event/explorer?query=Task+ran+out+of+memory+env%3Aprod%2A&from_ts=1771713610000&to_ts=1771713910000&live=false)**, grouped by **env,kube_cluster_name,pod_name**, was **> 0.0** d

2/21/2026

n/a

01KJ15DGXJFQDDHD2GWTFVW901 commerce low resolved incident

Increase in payment declines for Habit Burger Web channel

• Org: hb_us • Env:production • Service: payment-dgs • Client App ID: 5o5qms1h402q83ntvdj4ei26tf Key Information:This monitors Payment declines from HB on Yum Platform. • client.id:7viorc6ovt9v7s38sv7i3iqe6k - Mobile • client.id:5o5qms1h402q83ntvdj4ei26tf - Web • client.id:3k6kpnbek6dvf99t77l0lnl24h - Call Center Troubleshooting Steps: 1. Please review dashboard and logging to determine if any outliers and common decline reasons. 2. Identify if there is a service disruption with Fiserv or credit card provider. [Link To Decline Reasons](https://app.datadoghq.com/logs?query=env%3Aprod%2A%20servi

2/21/2026

n/a

01KJ14BPC7XZ04DNJG2H6QQXG8 shared medium resolved incident

Container in backend-stores-deployment-blue pod was terminated due to out of memory

• Org: • Env: prod • Service: A container in prod was forcibly terminated • indicates either a memory leak or insufficient memory limit • engage on-call SRE to determine impact if monitor enters Alert state or remains active for a prolonged period @webhook-incident-io-superapp @khatibur.rahman@yum.com@webhook-byte-incident-io The **count** of events matching **[Task ran out of memory env:prod*](https://app.datadoghq.eu/event/explorer?query=Task+ran+out+of+memory+env%3Aprod%2A&from_ts=1771711750000&to_ts=1771712050000&live=false)**, grouped by **env,kube_cluster_name,pod_name**, was **> 0.0** d

2/21/2026

n/a

01KJ13M7RQDQ2030K91N9GQMJX commerce medium resolved incident

GH_O2 sat_gh_order_adjustment files not received in S3 for 20260221

2/21/2026

n/a

01KJ0WVGB3HY5XADMKBCN7351P commerce medium resolved incident

Ecomm order_lineitems data completeness for DO_C360_yumecomm_O43 at 25% in lakehouse YumEcomm

C360_yumecomm_O43 : 25.0125535371437010800% of records are there in ecomm_dv.sat_sh_order_entry Confluence P&P Link- DO_C360_yumecomm_O43_P&P

2/21/2026

n/a

01KJ0V583G1FNRAF32928K1XSD commerce medium resolved incident

0.09% of Order Details missing matching Headers in C360_O10_Hybris Redshift load for 2026-02-21

C360_10_hybris : 0.09 % of Order Details which do not have records in Headers for 2026-02-21 Confluence P&P Link- C360_O10_Hybris_P&P

2/21/2026

n/a

01KJ0V37JS8K4Q2XJYQ8WMZ01K commerce medium resolved incident

0% accuracy in C360_O10_Grubhub Redshift daily data load: all Order Details missing Headers for 2026-02-21

C360_10_grubhub : 100.0 % of Order Details which do not have records in Headers for 2026-02-21 Confluence P&P Link- C360_O10_Grubhub_P&P

2/21/2026

n/a

01KJ0V30Y34C8P9S98GP3JX1C0 commerce medium resolved incident

Failure in Pizza Hut customer login browser synthetic test for US Pick Up Order Flow 2 - Ultimate Hut Bundle

This browser synthetic test is basic. It tests for basic user functionality on Pizza Hut's website and refreshes orders on Customer Support with default filter. • If this fails, please help determine which step and try to manually recreate. • Identify if there are any deployments that may have caused this to fail. • Review with Support leads and then PH team if requiring additional help. • Plans to improve this in the future.@caleb.harris@yum.com@webhook-byte-incident-io The monitor was last triggered at Sat Feb 21 2026 13:33:39 CST. ---- [[Test Status](https://app.datadoghq.com/synthetics/det

2/21/2026

n/a

01KJ0V22MXVPNWKNDKRAH3SP4R commerce medium resolved incident

All Order Headers missing Customer Order Details for Grubhub on 2026-02-21

C360_O5_Grubhub : distinct orders - 100.0% Order Headers do not have records in Details for 2026-02-21 Confluence P&P Link- C360_O5_O6_P&P

2/21/2026

n/a

01KJ0SA4NGQPQK2VASVG9N0DCD commerce medium resolved incident

Latency anomaly in Temporal Cloud causing delayed workflow processing in yc-alpha-prod-promotions.bd6e0 namespace

• Env: production • Service: Temporal • Namespace: yc-alpha-prod-promotions.bd6e0 ## Key Information: Temporal Cloud integration reports latency anomaly _on their end_, which could delay workflow processing.Multiple services utilize Temporal in different ways > P2 - Order namspace as this affects order workflowsP3 - Cart, Menu, Payments, Loyalty, Promotions - Latency in these namespaces will not affect business continuity, but is latency occurs for longer than 1 hour it cause service degredation. ## Troubleshooting Steps: 1. For non order namespaces, check if latency recovers within 1 hour 2.

2/21/2026

n/a

01KJ0QW6QHTRKDVKZ4B7V9T4HN commerce medium resolved incident

Order deserialization failures detected in KFC digital ordering

**Problem**: Order deserialization failures detected in KFC digital ordering. The system recorded 2 failures for specific stores during the latest monitoring window. **Impact**: Two digital orders failed to process for KFC stores E685066 and E685073 in the last hour. No additional failures have been observed, and the issue appears isolated to these stores. It is not yet clear if customers were impacted or able to retry. **Causes**: Unknown **Steps to resolve**: We have notified the INT and DM teams to review the issue on Monday and will continue monitoring for increased errors.

2/21/2026

n/a

Adam Winders

01KJ0PZQK7CCKHEBC0T1V3664H shared medium resolved incident

ALARM: "TB-BI-Talend-Ubuntu-Prod" in US East (N. Virginia)

2/21/2026

n/a

01KJ0NXGYQ43N2DDSW1FWE479X shared medium resolved incident

DD_O2 DoorDash/Sales_report files not received in S3 for 20260221

2/21/2026

n/a

01KJ0NXGY689HEBCJYK2J0SYJ1 shared medium resolved incident

DD_O2 DoorDash Operations_summary files not received in S3 for 20260221

2/21/2026

n/a

01KJ0NXFF79EM7YZB7X3H6Q7KE shared medium resolved incident

DD_O2 DoorDash/Refunds_data files not received in S3 for 20260221

2/21/2026

n/a

01KJ0NV3NECSQJ83Q8BHPRS9J0 shared medium resolved incident

ALARM: "TB-BI-Talend-Ubuntu-Prod" in US East (N. Virginia)

2/21/2026

n/a

01KJ0N8QJK4H0RH7NX060HK4XG commerce medium resolved incident

Lower than expected payment authorization attempts by tender type for ph_mx

• Org: ph_mx • Env: • Service: payment-dgs • TenderType: **Key Information** • Payment authorization attempts are lower than usual. • This anomaly tracks payment auth's and triggers when rates are below normal levels by Tender Type. I.e. Apple Pay, Visa, Mastercard, etc. • These can be caused by an issue stemming from Fiserv, bank or payment processor • This monitor does not give insight into root cause **Troubleshooting Steps** • Identify which tender type is impacted [Payment Authorization by tender type](https://app.datadoghq.com/logs?query=kube_cluster_name%3Aprod%2A%20service%3A%28commerc

2/21/2026

n/a

01KJ0N54JDQ7CFGSYCQSGYQDYH shared medium resolved incident

ALARM: "TB-BI-Talend-Ubuntu-Prod" in US East (N. Virginia)

2/21/2026

n/a

01KJ0M5JDDZ0MEES932WA0C6G9 connect medium resolved incident

Increased rate of stores marked OFFLINE in Olympus due to connection instability

• Org: tb_uk • Env: prod-hopper • Service: olympus-daemons ## Key Information: > This monitor tracks when a store status has successfully change to OFFLINE in Olympus.This is a percent based monitor that tracks the total percent of stores that have changed their store status in the last 30 minutes > > > Warning is set to 3.0. Alert is set to 5.0This can be due to a network outage for brands causing stores to drop offlineThis could also indicate an issue with Olympus erroneously marking stores as OFFLINE ## Troubleshooting Steps: Confirm Total Store Count of Offline Stores > US: [Olympus Store

2/21/2026

n/a

01KJ0JVZPA32MWJ9HKFVX8RM0Q shared medium resolved incident

voc - daily - voc_talend_daily_prod - Failure - Taco Bell - Severity 1

2/21/2026

n/a

01KJ0JV213CSHZG7S9KC6VH0XF shared medium resolved incident

Missing Uber Eats data for 2026-02-21 (UE_O1)

2/21/2026

n/a

01KJ0JS76EFWXHMFEJ3XQ1SBHX shared medium resolved incident

Mismatch between Athena and Redshift HME data for multiple stores

2/21/2026

n/a

01KJ0JRZ0QHY7HMX6WKS5V9HVP shared medium resolved incident

voc - daily - voc_lake_daily_prod - Failure - Taco Bell - Severity 1

2/21/2026

n/a

01KJ0HBRKKTATQCZS3Z0KQ4GBW commerce medium resolved incident

Customer login flow failing on Pizza Hut website synthetic test

This browser synthetic test is basic. It tests for basic user functionality on Pizza Hut's website and refreshes orders on Customer Support with default filter. • If this fails, please help determine which step and try to manually recreate. • Identify if there are any deployments that may have caused this to fail. • Review with Support leads and then PH team if requiring additional help. • Plans to improve this in the future.@caleb.harris@yum.com@webhook-byte-incident-io The monitor was last triggered at Sat Feb 21 2026 10:43:39 CST. ---- [[Test Status](https://app.datadoghq.com/synthetics/det

2/21/2026

n/a

01KJ0FPP9FG6QSF0VWQZ5ZG9G5 shared medium resolved incident

Container in backend-dashboards-deployment-green pod terminated due to out of memory

• Org: • Env: prod • Service: A container in prod was forcibly terminated • indicates either a memory leak or insufficient memory limit • engage on-call SRE to determine impact if monitor enters Alert state or remains active for a prolonged period @webhook-incident-io-superapp @khatibur.rahman@yum.com@webhook-byte-incident-io The **count** of events matching **[Task ran out of memory env:prod*](https://app.datadoghq.eu/event/explorer?query=Task+ran+out+of+memory+env%3Aprod%2A&from_ts=1771690090000&to_ts=1771690390000&live=false)**, grouped by **env,kube_cluster_name,pod_name**, was **> 0.0** d

2/21/2026

n/a

01KJ0E2MDGTCFXTQ6CENXKDRKB commerce high resolved incident

Kiosk orders at V000005 spectro site not reaching KDS pack monitors due to unable to poll errors

2/21/2026

n/a

01KJ0BD06ZB2XBZ1AV88CPAAPJ shared medium resolved incident

Unable to perform evening checks in HutBot at Pizza Hut Delivery - Redditch due to no active shifts

Users at Pizza Hut Delivery - Redditch are unable to perform evening checks in the HutBot app due to a message indicating there are no active shifts, despite using up-to-date iPhones.

2/21/2026

n/a

01KJ09X27HQ9BTEQ8VSR47FCPF shared medium resolved incident

Stores with blank Owner or DMA details detected

2/21/2026

n/a

01KJ09TET9Z78VKNV0CVJ8984V shared medium resolved incident

Stores with blank owner or DMA details in tbcdb.ti_incident_store_alignment or tbcdb.align_dim

2/21/2026

n/a

01KJ088YT6CTT1FJHS3Z76S9XH pos medium resolved incident

Discrepancy between Alignment store status and POS transaction activity

2/21/2026

n/a

01KJ07RGM3D7Q7W8QQYZRXDW2E coach medium resolved incident

Stores not appearing for ROCC Audits in Byte Coach Tool

Some stores that are open in SSAM are not appearing in the Byte Coach Tool for ROCC Audits, as reported by a user accessing the Restaurant Hub application.

2/21/2026

n/a

01KJ06ZHP9QQHV1JJQERAPXSSC shared medium resolved incident

Mismatch of LMS store alignment records leading to incorrect user hierarchy and certification

2/21/2026

n/a

01KJ0236QD2X5B4RRCMDDG35FW shared medium resolved incident

VOC survey data mismatch between Athena and Redshift tables for multiple dates

2/21/2026

n/a

01KHZRM4RMYY7Y85MB3567PVB0 commerce medium resolved incident

UE_O2 daily_order files missing in S3 for 20260221

2/21/2026

n/a

01KHZG59BPTVBC7ABCG9WCPD8Z shared medium resolved incident

20260221|P2:FIN Please check FIN Files availability for kfc and ph

2/21/2026

n/a

01KHZ7EZEG8Z0BKP6BJNEXHPMD shared medium resolved incident

ALARM: "tb-ecomm-prod-hybris-incrementals-alarm" in US East (N. Virginia)

2/21/2026

n/a

01KHZ5YVV3EK1PR0VS61CRC3WT shared medium resolved incident

[Astro Alert] WARNING: DAG run failed for tango_sites_s3_sf_extr

**Alert Notification Time:** not set **DAG Name:** 2026-02-21T04:05:05Z **Deployment Name:** tango_sites_s3_sf_extr **Workspace Name:** ph_us_astro3_dataeng_hp_prod **DAG Fail Time:** PizzaHut Data Prod

2/21/2026

n/a

01KHZ2GJPKZ7XJBHEEE6FA8AFQ shared medium resolved incident

Unable to submit report in HutBot app for store 1PR328-42 due to frozen submit option

A report could not be submitted to store 1PR328-42 using the HutBot app, as the submit option is frozen and the data is not backed up on the mobile device.

2/21/2026

n/a

01KHYVFX98MG0PY1SZJMDNK57S shared medium resolved incident

Duplicate customer email IDs in customer360.customer_signal_profile table

C360_O67 : 53 duplicate customer email IDs present in customer360.customer_signal_profile table Confluence P&P Link- C360_O67_P&P

2/21/2026

n/a

01KHYRAESG6HAGRJ2829S48AEZ commerce medium resolved incident

Ecomm order_lineitems data completeness for YumEcomm at 79.53% in ecomm_dv.sat_sh_order_entry

C360_yumecomm_O43 : 79.5331130418707828900% of records are there in ecomm_dv.sat_sh_order_entry Confluence P&P Link- DO_C360_yumecomm_O43_P&P

2/21/2026

n/a

01KHYMQTW44B170NJY2DGGEDWV commerce medium resolved incident

0.09% of Order Details missing in Lakehouse Headers for 2026-02-19 daily load

C360_13_Hybris : 0.09% of Order Details which do not have records in Lakehouse Headers for 2026-02-19 Confluence P&P Link- C360_O13_Hybris_P&P

2/20/2026

n/a

01KHYH6XC54CEFZ8G0XSD6MVBQ commerce medium resolved incident

GH_O2 sat_gh_order_adjustment files not received in S3 for 20260220

2/20/2026

n/a

01KHYB5920QKDW92XN0G18YBFB shared medium resolved incident

Stores with blank Owner or DMA details

2/20/2026

n/a

01KHYAH21HNR0NPA2HB8KBQB3A commerce medium resolved incident

Low data completeness for order_lineitems in ecomm_dv.sat_sh_order_entry (21.7%)

C360_yumecomm_O43 : 21.6863721219663970200% of records are there in ecomm_dv.sat_sh_order_entry Confluence P&P Link- DO_C360_yumecomm_O43_P&P

2/20/2026

n/a

01KHY96TJYYRW11V0Z9RQJHAYY coach medium resolved incident

Unable to enter QR code to launch Best Voices Survey in Micro App

Users are unable to enter the QR code to launch the Best Voices Survey within the best voices micro app of Byte Coach.

2/20/2026

n/a

01KHY8NX3RKX2N7674SG27S27R commerce medium resolved incident

All Order Details missing matching Headers in C360_O10_Grubhub Redshift daily data load

C360_10_grubhub : 100.0 % of Order Details which do not have records in Headers for 2026-02-20 Confluence P&P Link- C360_O10_Grubhub_P&P

2/20/2026

n/a

01KHY8NH1N3RB7ZSSFTZD5PWFG commerce medium resolved incident

All Grubhub order headers missing details in Customer Order Details for 2026-02-20

C360_O5_Grubhub : distinct orders - 100.0% Order Headers do not have records in Details for 2026-02-20 Confluence P&P Link- C360_O5_O6_P&P

2/20/2026

n/a

01KHY8NGN1DCY8D111VCW9527X commerce medium resolved incident

0.09% of Order Details missing Headers in C360_O10_Hybris Redshift daily load

C360_10_hybris : 0.09 % of Order Details which do not have records in Headers for 2026-02-20 Confluence P&P Link- C360_O10_Hybris_P&P

2/20/2026

n/a

01KHY4SW3BRJ4FYA0V71XMWDJ6 commerce medium resolved incident

Browser synthetic test failure on PH US Pick Up Order Flow 4 - Wings Menu

This browser synthetic test is basic. It tests for basic user functionality on Pizza Hut's website and refreshes orders on Customer Support with default filter. • If this fails, please help determine which step and try to manually recreate. • Identify if there are any deployments that may have caused this to fail. • Review with Support leads and then PH team if requiring additional help. • Plans to improve this in the future. • @webhook-byte-incident-io • @caleb.harris@yum.com The monitor was last triggered at Fri Feb 20 2026 12:25:41 CST. ---- [[Test Status](https://app.datadoghq.com/syntheti

2/20/2026

n/a

01KHY3ZT22G0S6EV1D4S8AVBHY shared medium resolved incident

Incorrect Champs ID for KFC 147 and KFC137 in reporting

An error with the CHAMPS ID has been reported for KFC stores 147 and 137 in reporting, specifically affecting the Domo Reports - Aggregator Data application and the Routines Summary (12).xlsx data flow.

2/20/2026

n/a

01KHY27W02P4MQJSGXFC323JV5 commerce medium resolved incident

Lower than usual payment authorization attempts detected by tender type for ph_mx

• Org: ph_mx • Env: • Service: payment-dgs • TenderType: **Key Information** • Payment authorization attempts are lower than usual. • This anomaly tracks payment auth's and triggers when rates are below normal levels by Tender Type. I.e. Apple Pay, Visa, Mastercard, etc. • These can be caused by an issue stemming from Fiserv, bank or payment processor • This monitor does not give insight into root cause **Troubleshooting Steps** • Identify which tender type is impacted [Payment Authorization by tender type](https://app.datadoghq.com/logs?query=kube_cluster_name%3Aprod%2A%20service%3A%28commerc

2/20/2026

n/a

Manuela Padilla

01KHY272JVD5N871W2BG02FGHW shared medium resolved incident

ARL mic6152 missing stores under their alignment

Incident INC11809924 reports that ARL mic6152 is missing stores under their alignment and has been assigned to the YUM GLB EasyOps group.

2/20/2026

n/a

01KHY0HJ0K66CNEB3FZXRR8MJH shared medium resolved incident

Uber Eats daily data missing for 2026-02-20 (UE_O1)

2/20/2026

n/a

01KHY099VGTT20EAXE62DSZNMG shared medium resolved incident

HME data mismatch between Athena and Redshift tables for multiple stores

2/20/2026

n/a

01KHY05ZG6MKF9GFJ23NWJTE9Q shared medium resolved incident

[Astro Alert] WARNING: DAG run failed for sf2fourth_recon_trigger

**Alert Notification Time:** not set **DAG Name:** 2026-02-20T17:04:47Z **Deployment Name:** sf2fourth_recon_trigger **Workspace Name:** ph_us_astro3_dataeng_hp_prod **DAG Fail Time:** PizzaHut Data Prod

2/20/2026

n/a

01KHXXR54HVFE2RKCV1QZCQEQR commerce low resolved incident

ordering-gateway - Elevated 401 responses for /cart endpoint

**Problem**: We observed a sharp increase in 401 errors on the POST /ordering-gateway/carts endpoint since approximately 10:05 CST, primarily affecting web channel requests. **Impact**: No business continuity impact for Pizza Hut US. Order submissions and confirmations are at or above historical averages. **Causes**: A misconfigured test environment, likely set up by a contractor or QA resource, used staging authentication tokens against production endpoints. This led to a high volume of unauthorized requests from the WEB channel. **Steps to resolve**: Live incident communications have conclud

2/20/2026

n/a

Matthew Walters