
AI
How Causal Reasoning Addresses the Limitations of LLMs in Observability
Causal reasoning with AI agents enable proactive incident prevention, automated remediation, and a path toward autonomous service reliability.
AI
Causal reasoning with AI agents enable proactive incident prevention, automated remediation, and a path toward autonomous service reliability.
Blog
Most developers use automatic instrumentation without knowing how it actually works. This post breaks down the key techniques behind it—not to build your own, but to understand what’s really happening when things "just work."
Causality
More telemetry doesn’t guarantee more understanding. In many cases, it gives you the illusion of control while silently eroding your ability to reason about the system.
observability
In 'Rethinking Reliability for Distributed Systems,' Endre Sara shared a common story: a large-scale customer, running mature microservices in Kubernetes with full observability coverage, still struggles to understand what’s broken during a high-stakes business event.
observability
A few weeks back, I joined Charity Majors, Paige Cruz, Avi Freedman, Shahar Azulay, and Adam LaGreca for a roundtable on the state of modern observability. It was an honest conversation about where we are, what’s broken, and where things are heading. You can read the full summary on
Causality
When it comes to observability and IT operations, our goal should be to get humans out of the loop as much as possible.
Webinar
“You actually cannot do meaningful reasoning especially when it comes to root cause analysis with LLMs or machine learning alone. You need more than that.” -Shmuel Kliger, Founder of Causely