By Vince Condarcuri
Publication Date: 2025-11-14 22:58:00
Recent cloud outages are becoming more expensive, with some reports saying that they now cost about $14,000 per minute. This is nearly 10% more than in 2022. This has put pressure on site reliability engineers to fix issues quickly. Therefore, companies are turning to artificial intelligence for help. These systems help spot the cause of problems, but aren’t yet trusted to fix things on their own, mainly because they lack an undo button and a clear way to reverse mistakes. However, tech giant IBM (IBM) and the University of Illinois at Urbana-Champaign have created a system called STRATUS to help solve this issue.
Meet Your ETF AI Analyst
Like the undo shortcut Ctrl+Z, STRATUS can automatically roll back bad decisions made by AI agents during incident responses. For example, if a fix makes things worse, STRATUS resets the system to a safe checkpoint in order to try another solution. Interestingly, in tests using open-source cloud benchmarks from Microsoft (MSFT) and IBM,…

