
Blog
Using OpenTelemetry and the OTel Collector for Logs, Metrics, and Traces
This production-focused guide offers an understanding of what OpenTelemetry is, its core components, and a detailed look at the OTel Collector.
Blog
This production-focused guide offers an understanding of what OpenTelemetry is, its core components, and a detailed look at the OTel Collector.
Blog
Bridging the gap between observability data and actionable insight
Causely product
We’ll introduce the 6 common components and 7 AI Workers of our Causal Reasoning Platform, explaining how the platform works to enable autonomous service reliability.
Causality
Collecting “more data” has been the defining characteristic of observability practices and tools for the last few decades. But over-collection creates inefficiencies, noise, and cost without adding meaningful value. This trajectory must and can be changed.
Causality
By identifying potential risks in real time, predicting future demand, and adapting resources dynamically, teams can maintain reliability even under extreme conditions. This isn’t about eliminating unpredictability; it’s about building systems that respond intelligently to it.
DevOps & SRE
Making changes to production environments is one of the riskiest parts of managing complex systems. In 2025, let's transform how changes are made, empowering teams to anticipate risks, validate decisions, and protect system stability—all before the first line of code is deployed.
Causality
Explore the challenges of multi-team escalations, and the capabilities needed to address them. We’ll show how observability can be transformed to make escalations less contentious and more productive.
Causality
SREs and developers can make troubleshooting more manageable in 2025 by adopting systems that solve the root cause analysis problem.
Podcast
Adriana Villela (Dynatrace) and Reese Lee (New Relic) interviewed Causely Co-founder Endre Sara, along with several other OpenTelemetry users and contributors, during KubeCon NA 2024.
Blog
CPU throttling is a frequent challenge in containerized environments, particularly for resource-intensive applications. It happens when a container surpasses its allocated CPU limits, prompting the scheduler to restrict CPU usage. While this mechanism ensures fair resource sharing, it can significan
Assuring application reliability is a persistent challenge faced by every IT organization, complicated by rapid technology evolution and the increased emphasis on lean engineering. One trend among progressive companies is to designate a “Service Owner” who is responsible for making…
Based on my LinkedIn news feed, it must be that time of year when thousands of open source enthusiasts congregate to talk tech at various parties, dinners, and other networking events surrounding KubeCon. In fact, we’re hosting a couple of…