Why is my Datadog bill so high?

Datadog charges per host, per custom metric, per log GB ingested, and per APM trace. Bills triple quickly because each new container, microservice, or custom metric adds incremental cost. Committed use discounts require upfront contracts that can lock you in even as usage changes.

How much does Datadog cost per host per month?

Datadog infrastructure monitoring costs approximately $15–$23 per host/month depending on commitment. Add APM ($31–$40/host/month), log management ($0.10/GB ingested + $0.05/GB indexed), and custom metrics ($0.05 per metric/month above the 100 per host included). A typical 50-host environment with moderate observability can easily reach $8,000–$15,000/month.

What is a cheap alternative to Datadog?

The most cost-effective alternative is the open source stack: Prometheus for metrics, Grafana for dashboards, and Loki for logs. Self-hosted, this stack can cost 80–95% less than Datadog for the same coverage — the main cost is engineering time to set up and maintain. Grafana Cloud also offers a generous free tier and much lower costs than Datadog at scale.

How do I reduce my monitoring costs?

Key strategies: (1) Reduce log volume via sampling and filtering at the agent level, (2) Reduce custom metrics by using histograms instead of individual gauges, (3) Right-size retention — most data past 30 days is rarely queried, (4) Use tiered storage for long-term retention, (5) Consider open source for infrastructure monitoring and paid tools only for APM.

What are the hidden costs of observability tools?

Hidden costs include: overage charges when you exceed committed usage, per-seat pricing for dashboards and alerts, data retention costs beyond 15 days, support tiers that cost extra, professional services for onboarding, and vendor lock-in migration costs when you eventually switch. These can add 30–60% to the base platform cost.

Updated April 2026

Cloud Monitoring Cost Benchmarks 2026: What Companies Actually Spend

Key Finding

The median company spends 7-8% of cloud infrastructure budget on observability. Companies above 15% are overpaying. Companies below 3% are under-monitoring and accepting higher incident risk. These benchmarks are synthesised from published industry research by OneUptime, Honeycomb, Elastic, and multiple vendor-neutral analyst reports from 2024-2026.

One of the most common questions engineering leaders ask is "how much should we be spending on monitoring?" The answer has been frustratingly vague: most articles cite "5-15% of cloud spend" without methodology, context, or granularity. This page provides the most detailed monitoring cost benchmarks available anywhere, broken down by company size, application architecture, and industry vertical.

These benchmarks are not primary research from a single survey. They are a synthesis of the best available data from multiple sources: the OneUptime observability tax analysis, Honeycomb's cost of observability research, Elastic's 2024-2026 observability trends reports, published case studies from organisations that have shared their monitoring spend data, and our own analysis of public pricing across six major vendors. Where sources disagree, we provide the range and explain the discrepancy. Our goal is to give you the most honest, useful benchmark possible, even when the data is imperfect.

Understanding where your monitoring spend falls relative to these benchmarks is the first step toward optimisation. If you are in the healthy range (3-8%), your focus should be on maintaining cost efficiency as you grow. If you are above 10%, there are almost certainly opportunities to reduce costs without impacting monitoring quality. If you are below 3%, you may be accepting undetected incidents and longer mean time to resolution (MTTR) that cost more in downtime than the monitoring would.

Benchmark by Company Size

Company size is the strongest predictor of monitoring spend because it correlates with infrastructure complexity, team size, and vendor pricing tier. Startups on free tiers spend near zero, while enterprises with complex multi-cloud, Kubernetes-based architectures can spend $200,000+ per month. The percentage of cloud budget spent on monitoring tends to increase with company size because larger organisations have more complex architectures that generate more telemetry data per host. This data is drawn from vendor pricing at typical deployment sizes and validated against published case studies.

Company Size	Typical Hosts	Monthly Spend	% of Cloud	Typical Vendor
Startup (1-50 employees)	5-20	$0-$500	3-5%	New Relic Free / Grafana Cloud Free
SMB (50-200 employees)	20-100	$500-$5,000	5-8%	Datadog Pro / Grafana Cloud
Mid-Market (200-1000)	100-500	$5,000-$30,000	7-10%	Datadog / Dynatrace
Enterprise (1000+)	500-5,000+	$30,000-$200,000+	8-15%	Datadog Enterprise / Dynatrace

Benchmark by Application Architecture

Application architecture has a dramatic impact on monitoring cost per host. The shift from monolithic to microservices to Kubernetes-native architectures progressively increases the volume of telemetry data generated per unit of infrastructure. A single monolithic application on 50 servers generates a predictable, manageable volume of metrics and logs. The same application decomposed into 200 microservices on 50 Kubernetes nodes generates 3-5x more metrics (due to inter-service communication, pod lifecycle events, and label cardinality), 2-3x more log volume (each service has its own log stream), and distributed traces that did not exist in the monolithic architecture. This multiplier effect is the primary reason why monitoring costs have grown faster than infrastructure costs over the past five years.

Monolith

Cost per host: $15-25/mo

Lower metrics volume, simpler tracing

Microservices

2-3x

Cost per host: $30-75/mo

More traces, inter-service metrics, higher cardinality

Kubernetes-Native

3-5x

Cost per host: $45-125/mo

Pod churn, label cardinality, container billing

Serverless-Heavy

Variable

Cost per host: $0.02-0.10/invocation

Per-invocation pricing, cold start monitoring

Benchmark by Industry

Industry vertical affects monitoring spend through regulatory requirements, uptime expectations, and typical architecture patterns. FinTech companies consistently spend the most on monitoring because financial regulators require comprehensive audit trails, high-frequency trading systems demand sub-millisecond monitoring granularity, and the cost of downtime (both direct revenue loss and regulatory penalties) justifies higher monitoring investment. Healthcare companies face similar compliance requirements. SaaS companies represent the median, while media and gaming companies typically spend less because their compliance requirements are lower, even though their data volumes can be very high.

Industry	Monitoring as % of Cloud	Primary Driver
FinTech	10-15%	Compliance, audit trails, high-frequency monitoring
SaaS	7-10%	Median industry, standard observability needs
E-Commerce	6-12%	Seasonal spikes, peak-based billing
Healthcare	8-14%	Compliance, data retention requirements
Media/Gaming	5-8%	High data volume but less compliance need

The Observability Tax: Where Do You Fall?

The concept of an "observability tax" has gained traction in engineering circles as monitoring costs have grown from a rounding error to a significant line item in cloud budgets. Based on our synthesis of available industry data, we define three spending zones that characterise the health of your monitoring investment.

Healthy: 3-7%

Your monitoring spend is proportional to your infrastructure complexity. You are collecting the telemetry needed for effective incident detection and debugging without significant waste. Focus on maintaining this efficiency as you grow by implementing cost controls proactively. At this level, you likely have good log filtering policies, appropriate trace sampling, and managed custom metrics cardinality.

Review Needed: 7-15%

You are spending more than average on monitoring relative to your infrastructure. This may be justified by compliance requirements, complex architecture, or high uptime SLAs, but it warrants investigation. Common causes at this level include uncontrolled log volume growth, custom metrics sprawl, tool overlap, and overly generous retention policies. Implement the top 3-4 cost reduction strategies to bring spend into the healthy range. A focused 30-day optimisation effort typically yields 20-30% savings.

Cost Crisis: 15%+

Your monitoring spend is significantly above industry benchmarks. At this level, monitoring costs are likely growing faster than infrastructure costs, and continuing on the current trajectory will make monitoring one of your largest cloud expenses within 12 months. Immediate action needed: conduct a full monitoring cost audit, implement aggressive log and metrics filtering, evaluate vendor alternatives, and consider partial migration to open source for cost-sensitive components. A comprehensive optimisation program can typically reduce costs by 40-60%.

How to Calculate Your Monitoring Spend Percentage

Calculating your monitoring cost as a percentage of total cloud spend is straightforward but requires gathering data from multiple sources. Follow these five steps to determine where your organisation stands relative to the benchmarks above. This exercise typically takes 1-2 hours and provides the data foundation for any cost optimisation initiative. We recommend repeating this calculation quarterly to track trends and catch cost creep early.

Calculate total cloud infrastructure spend

Sum your monthly bills from AWS, GCP, Azure, and any other infrastructure providers. Include compute, storage, networking, and managed services. Exclude SaaS subscriptions that are not infrastructure (e.g., Slack, Jira). A typical mid-market company spends $50,000-500,000/month on cloud infrastructure.

Sum all monitoring and observability costs

Include every monitoring vendor bill: Datadog, New Relic, Grafana Cloud, Splunk, PagerDuty, Opsgenie, and any other observability tools. Include the infrastructure cost of self-hosted monitoring (Prometheus servers, Grafana instances, log storage). Include the portion of engineering salaries dedicated to monitoring tool maintenance if you operate self-hosted systems (typically 10-25% of a platform engineer's time).

Calculate the percentage

Divide total monitoring costs by total cloud infrastructure costs and multiply by 100. For example: $15,000 monitoring / $200,000 cloud = 7.5%. This is your observability spend ratio.

Compare against benchmarks

Use the company size, architecture, and industry tables above to find the relevant benchmark range. If your percentage is above the range, you have a cost optimisation opportunity. If it is below, verify that your monitoring coverage is adequate for your SLA requirements.

Track monthly and set a target

Add this calculation to your monthly FinOps review. Set a target range based on your industry and architecture benchmarks. Alert when the ratio exceeds your target by more than 2 percentage points, as this indicates unexpected cost growth that warrants investigation.

Frequently Asked Questions

How much should I budget for observability?

For most companies, budget 5-10% of your total cloud infrastructure spend for observability. Startups with simple architectures can budget 3-5% using free tiers and basic monitoring. Mid-market companies with microservices architectures should budget 7-10%. Enterprises with Kubernetes, multi-cloud, and compliance requirements should budget 8-12%. For concrete numbers: if you spend $100,000/month on AWS/GCP/Azure infrastructure, expect to spend $5,000-$10,000/month on monitoring tools. This covers infrastructure metrics, APM for critical services, log management with 15-30 day retention, and basic alerting. Budget an additional 20-30% above vendor costs for hidden costs like custom metrics overages and retention extensions.

What percentage of cloud spend goes to monitoring?

The median company spends 7-8% of cloud infrastructure budget on monitoring and observability tools. The range varies significantly: startups on free tiers spend under 3%, while large enterprises with complex architectures and compliance requirements can spend 12-15%. FinTech companies average 10-15% due to regulatory audit trail requirements. The observability spend ratio has been increasing year-over-year as architectures become more complex (microservices, Kubernetes, serverless) and generate more telemetry data per unit of infrastructure. Companies that do not actively manage monitoring costs typically see the ratio increase by 2-3 percentage points annually as data volumes grow.

What is the total cost of ownership for monitoring?

Total cost of ownership for monitoring includes four components that many teams overlook. Vendor licensing or subscription costs are the most visible, accounting for 40-70% of total cost. Infrastructure costs for running monitoring systems (Prometheus servers, log storage, trace databases) account for 10-30% of total cost, though this is zero for fully hosted vendors like Datadog. Engineering time for setup, maintenance, dashboarding, and alert tuning accounts for 15-30% of total cost and is the most commonly underestimated component. Opportunity costs from vendor lock-in, migration barriers, and the engineering time spent on monitoring tool management instead of product development represent the remaining 5-15%. A comprehensive TCO assessment should include all four components, not just the vendor invoice.