Blameless post-mortems flip the script, transforming incidents into structured opportunities for learning, accountability and resilience.
Logz.io Leverages AI to Identify Anomalies in Real-Time
Logz.io added a real-time anomaly detection capability to its observability platform that simplifies correlation of the impact IT events have on business processes.
Elastic Previews Unified Query Language for Search Platform
Elastic this week previewed a standard query language that can be used across its portfolio to streamline investigations into IT and cybersecurity events.
IT Service Incidents Are Becoming More Frequent, Survey Says
A Transposit survey found the majority of respondents saw more frequent service incidents that affected their customers over the past 12 months.
New Relic Extends AI to Identify Alert Coverage Gaps
New Relic added additional AI capabilities to its observability platform to detect and resolve alert coverage gaps.
Unleashing AI in SRE: A New Dawn for Incident Management
In my recent blog, Revolutionizing the Nine Pillars of SRE with AI-Engineered Tools, I indicated that AI could assist with incident management by automating the detection and triage of incidents and helping to quickly identify the root cause. In this blog, I explain in more detail how AI-engineered tools can be used to improve the […]
The DevOps Pendulum: Agility Vs. Control
Engineering, like everything else in life, is all about balance. In DevOps, the balance is between control and agility.
Logz.io Taps AI to Surface Incident Response Recommendations
Logz.io this week added a supervised machine learning capability to its observability platform that reduces mean-time-to-remediation by surfacing recommendations for resolving incidents. Asaf Yigal, vice president of product for Logz.io, said the Alert Recommendation capability added to the Logz.io Open360 platform uses artificial intelligence (AI) to model the steps a DevOps team needs to complete […]
PagerDuty Bets on AIOps and Automation to Simplify IT Management
PagerDuty this week made generally available an artificial intelligence for IT operations (AIOps) platform that leverages the data model embedded in its incident management software to reduce the amount of time required for an AI platform to learn how an IT environment operates. Jonathan Rende, senior vice president and general manager for PagerDuty, said that […]
Blameless Integrates Incident Management Platform With Opsgenie
Blameless this week announced it has integrated its namesake incident management solution with the Opsgenie alerting platform from Atlassian. Aaron Lober, director of product marketing for Blameless, said the Blameless platform is now integrated with both Opsgenie and PagerDuty, the two most widely used platforms for managing alerts. Blameless provides an incident management platform that […]
- « Previous Page
- 1
- 2
- 3
- 4
- 5
- …
- 7
- Next Page »






