SRE
Why Up to 70% of SRE Initiatives Stall Before They Scale — and How to Break the Plateau
Many SRE initiatives stall because organizations adopt the title without the principles. True SRE success requires leadership vision, cultural change, shared KPIs and continuous maturity measurement—not tools alone ...
From Cloud to Cognitive Infrastructure: How AI is Redefining the Next Frontier of SRE
As organizations embrace artificial intelligence (AI) workloads alongside traditional cloud systems, site reliability engineering (SRE) must evolve to manage an entirely new class of infrastructure — intelligent, hybrid and graphics processing unit ...
Ciroos.AI Preps AI SRE Agents Trained to Automate Incident Management
Ciroos.AI this week emerged from stealth to provide early access to a set of artificial intelligence (AI) agents that have been trained to augment site reliability engineers (SREs). Fresh off raising $21 ...
Unlocking Accountability: How Real-Time App Monitoring Empowers Engineering Teams
Real-time app monitoring is about fundamentally shifting your mindset toward a culture of accountability and continuous improvement ...
Best of 2023: Microservices Sucks — Amazon Goes Back to Basics
In this week’s #TheLongView: Amazon Prime Video has ditched its use of microservices-cum-serverless, reverting to a traditional, monolithic architecture. It vastly improved the workload’s cost and scalability ...
Three Strategies for Reducing MTTD and MTTR as Outage Costs Spiral
In a business environment where most interactions with customers, suppliers and business partners are conducted digitally, downtime has become a problem and an existential threat ...
Microsoft kills Python 3.7 ¦ … and VBScript ¦ Exascaling ARM on Jupiter
In this week’s #TheLongView: VS Code drops support for Python 3.7, Windows drops VBScript, and Europe plans the fastest ARM supercomputer ...
Oracle Bill is 5x Client’s Budget ¦ Toyota Out of Space
In this week’s The Long View: Birmingham looks like the Detroit of the UK—is it Oracle’s fault? Plus: Was Toyota’s factory failure caused by running out of disk space? ...
IBM LLM AI: COBOL to Java ASAP ¦ ARM IPO is GO!
In this week’s #TheLongView: Translating legacy COBOL code to a slightly more modern language, and Arm will go public (again) next month ...
80% of Bosses ‘Regret’ Stopping WFH ¦ PSA: Disable STS!
In this week’s #TheLongView: Rethinking return-to-office mandates and a ridiculous, ancient Windows bug ...
2024—Year of the Linux Desktop? ChromeOS Reflects its Inner Penguin ¦ GNOME Rethink
In this week’s #TheLongView: Can the Linux desktop installed base break the mythical 10% barrier? Google has been refactoring ChromeOS, and GNOME is working on new window manager ideas ...
AI ‘is Getting Worse’ ¦ AI ‘Will Lose India Jobs’ (Probably Isn’t ¦ Probably Won’t)
In this week’s #TheLongView, a conundrum: On the one hand, researchers say ChatGPT is losing the plot; and on the other, outsourced coding jobs in India will be replaced by AI ...

