The Model Context Protocol (MCP) standardizes how AI agents access tools, APIs and data. Learn how SREs can leverage MCP to build smarter, automated workflows.
SRE in the Age of AI: What Reliability Looks Like When Systems Learn
As AI and ML become core production components, SRE is evolving from managing deterministic systems to ensuring the reliability of dynamic, learning systems. New metrics, workflows, guardrails and cross-disciplinary practices are redefining reliability in the age of adaptive software.
New Relic Enhances Azure Integration with AI-Powered Observability Tools
New Relic Inc. unveiled a suite of intelligent observability integrations with Microsoft Azure on Tuesday to streamline incident response and boost developer productivity as enterprises rush to adopt artificial intelligence (AI) workflows. The company’s new AI Model Context Protocol (MCP) Server now feeds real-time observability data directly into Azure’s SRE Agent and Microsoft Foundry, eliminating […]
AI-Driven Performance Testing: A New Era for Software Quality
Discover how AI and large language models (LLMs) are revolutionizing performance testing—shifting from reactive load testing to predictive, continuous assurance powered by intelligent agents and automation.
The Future of Observability: Predictive Root Cause Analysis Using AI
In the past few years, systems have become more complex than ever. Microservices, Kubernetes, cloud environments and distributed application programming interfaces (APIs) have changed how we build and manage software. However, this complexity has also made it harder to find the root cause when things go wrong. That’s where observability and artificial intelligence (AI) come together to change the game — helping us move from reactive monitoring to predictive root cause analysis (RCA). […]
Observe Adds Two AI Agents to Improve Observability
Observe Inc. introduces the AI SRE Agent and o11y.ai Agent to its observability platform—empowering DevOps teams to automate incident triage, generate OpenTelemetry code, and query application performance using natural language for faster, smarter debugging.
OpenTelemetry and AI are Unlocking Logs as the Essential Signal for “Why”
Logs reveal the “why” behind failures. Learn how OpenTelemetry and AI transform raw log data into structured, actionable insights for modern observability.
The Agentic AI-Driven Future of Telemetry
Telemetry is evolving from passive data to AI-grade fuel. Learn how agentic telemetry fuses human and machine context to power self-healing, intelligent systems.
The Breakneck Future of Codegen: Why AI SWE Must Be Matched with AI SRE
AI codegen is transforming software development — but as speed and complexity increase, so does fragility. AI for site reliability will need to keep pace to avoid system breakdown and engineer burnout.
Grafana Labs Extends AI Capabilities of Observability Platform
Grafana Labs this week made generally available an artificial intelligence (AI) agent, dubbed Grafana Assistant, for its namesake dashboard in addition to previewing Grafana Assistant Investigations, an AI incident management tool that analyzes the observability stack, generates findings and hypotheses, and surfaces actionable recommendations for mitigation and remediation. Announced at its ObservabilityCON 2025 conference, Grafana […]









