Service disruption affecting multiple functions

Incident Report for Fluid Attacks

Postmortem

Impact

At least one user observed failures in several features of the platform. The issue started on UTC-5 25-11-27 09:25 and was proactively discovered 1.9 hours (TTD) later by a staff member who noticed that some processes were not running as expected. Shortly after this internal detection, customer reports also began to arrive, confirming the problem. The problem was resolved in 7.6 hours (TTF), resulting in a total window of exposure of 9.6 hours (WOE) [1].

Cause

The infrastructure used by some services was decommissioned, even though those services were still dependent on it. This led to interruptions in functionalities related to repository cloning, reattacks, report generation, and other operations handled by the affected components [2].

Solution

New infrastructure definitions were created, and the systems were updated to use them. This included refreshing the internal configurations so that all processes pointed to the correct, active infrastructure [3].

Conclusion

To prevent similar issues in the future, we are improving how infrastructure ownership is structured. Each component will become clearly responsible for the infrastructure it depends on, making those relationships visible and reducing the chances of accidental removal of shared resources. INFRASTRUCTURE_ERROR < INCOMPLETE_PERSPECTIVE

Posted Nov 28, 2025 - 08:35 GMT-05:00

Resolved

The incident has been resolved, and all affected operations are operating as expected.
Posted Nov 27, 2025 - 19:39 GMT-05:00

Update

The solution is currently being implemented. We will provide another update once the services are fully stabilized
Posted Nov 27, 2025 - 15:16 GMT-05:00

Update

We have traced the source of the issue and are actively working on the necessary corrections.
Posted Nov 27, 2025 - 13:03 GMT-05:00

Identified

It has been identified that several processes responsible for tasks such as updating Git roots, executing scans, generating reports, and other core services, are currently failing. Our engineering team is actively working to restore normal functionality.
Posted Nov 27, 2025 - 12:15 GMT-05:00
This incident affected: Platform, Cloning, and Scanning.