On May 1, 2026, we experienced an incident affecting a subset of schedules in the US region following a recent infrastructure update.
The issue has been resolved, and all affected schedules have been restored.
As part of an ongoing scalability improvement, we recently updated scheduling infrastructure in the US region to a new architecture. During this transition, a legacy execution path remained in the codebase as a fallback mechanism.
On May 1, a bug caused the system to revert to the legacy path. This resulted in inconsistent state between the old and new scheduling systems for some users.
The incident affected a limited number of users in the US region.
Most users were not affected, and the vast majority of schedules continued operating normally throughout the incident.
Users who did not update schedules during the transition window continued operating normally throughout the incident.
A subset of users who created, edited, paused, or deleted schedules between April 24 and May 1 may have experienced one or more of the following:
Schedules created after the transition may have stopped executing briefly before recovery.
During the transition, the new scheduling infrastructure became the source of truth for schedule state.
Due to a bug, the system unexpectedly reverted traffic to the legacy scheduling path, which began accepting updates independently from the new system.
This caused the two systems to diverge and resulted in inconsistent schedule state for affected users.
After identifying the issue, we:
In some cases, schedules that had previously been paused or deleted were restored to avoid permanent data loss.
We are implementing several changes to prevent similar incidents:
We apologize for the disruption and appreciate everyone’s patience while we resolved the issue.