Load-Based Splitting

To optimize your cluster’s performance, CockroachDB can split frequently accessed keys into smaller ranges. In conjunction with , load-based splitting distributes load evenly across your cluster.

Enable and disable load-based splitting

Load-based splitting is enabled by default. To enable and disable load-based splitting, set the kv.range_split.by_load_enabled . For example, to disable load-based splitting, execute:

> SET CLUSTER SETTING kv.range_split.by_load_enabled = false;

When to enable load-based splitting

Load-based splitting is on by default and beneficial in almost all situations.

When to disable load-based splitting

You might want to disable load-based splitting when troubleshooting range-related issues under the guidance of Cockroach Labs support.

Control load-based splitting threshold

To control the load-based splitting threshold, set the kv.range_split.load_qps_threshold to the queries-per-second (QPS) at which you want to consider splitting a range (defaults to 2500). For example:

> SET CLUSTER SETTING kv.range_split.load_qps_threshold = 2000;

When to modify the load-based splitting threshold

Some workloads might find splitting ranges more aggressively (i.e., a lower QPS threshold) can improve performance. On the other hand, some workloads with very large machines might want to increase the QPS threshold to split more conservatively.

How load-based splitting works

Whenever a range exceeds the cluster’s setting for kv.range_split.load_qps_threshold, the range becomes eligible for load-based splitting. At that point, begin gathering per-key metrics to determine whether or not a split would benefit the cluster’s performance based on the following heuristics:

Balance factor: If you perform a split, would there be a balance of load on both sides of the split? For example, if 99% of the traffic is for a single key, splitting it apart from the others in the range will not have a substantial impact on performance. However, if 60% of the queries are for a single key, it is likely to benefit the cluster to move the key to its own range.
Split crossing: If you perform a split, how many queries would have to cross this new range boundary? Because involving multiple ranges in a query incurs greater overhead than a single range, splitting a range could actually degrade performance. For example, if the range is involved in many SELECT COUNT(*)... operations, splitting the range in two could negatively impact performance.

Determine a split point

Using the per-key metrics gathered once a node exceeds the kv.range_split.load_qps_threshold value, estimate a good place to perform a split by determining the total operations over the first set of keys that exceed the kv.range_split.load_qps_threshold. For example, if the operations on a single key exceed the kv.range_split.load_qps_threshold value, it’s a good candidate to split the range at that point. Another example is that if the range has equal access among all keys, whose total operation exceeds kv.range_split.load_qps_threshold. By splitting the range at the first set of keys whose total operations exceed the threshold, you can reduce the load on the node. If the “other” range still exceeds the threshold, it will eventually be split again. In both of these examples, the split would only occur if the balance factor and split crossing heuristics determined the split would produce better results.

Why load-based splitting works

CockroachDB creates a relatively even distribution of leaseholders throughout your cluster. ( are a single replica of a range that both serve reads and coordinate write operations.) However, without load-based splitting this distribution is created without considering the load present on any set of keys. This means that even with an equitable distribution of leases throughout the cluster, some leases will generate more traffic for the node that houses them than others. Because each node can only provide so much throughput, a single node can become a bottleneck for providing access to a subset of data in your cluster. However, by leveraging load-based splitting, the cluster can understand load on a per-range basis and split up ranges that generate a significant amount of load into multiple ranges, and therefore multiple leaseholders. This lets the cluster express its load as a function of leases; so the roughly equal distribution of leases also generates a roughly equal distribution of traffic, preventing individual nodes from becoming bottlenecks for your cluster. This benefit is further amplified by , which ensures that all nodes contain replicas with a roughly equal load. By evenly distributing load throughout your cluster, it’s easier to prevent bottlenecks from arising, as well as simplifying hardware forecasting.

Monitor load-based splitting

Log message

The log message

no split key found: ...

indicates that CockroachDB attempted to due to load, but could not identify a key at which to split. You can usually ignore this log message. However, if it appears repeatedly, it may indicate a load imbalance in the cluster. A load imbalance might occur if a range cannot be split because it contains a . For more information about how to reduce hotspots on your cluster, refer to . The log message ends with one of the following patterns that describe the observed load:

popular key detected, clear direction detected
popular key detected, no clear direction
no popular key, clear direction detected
no popular key, no clear direction

These patterns may indicate either or both of the following conditions:

popular key detected indicates that a significant percentage of reads or writes targets a single row within the range of data.
clear direction detected indicates that accesses within the range progress steadily in one direction, either increasing or decreasing key order, which generally indicates an .

Metrics

These correlate with the potential conditions described in the load-based splitting log message. You can monitor how often the load-based splitter fails to find a split key, and whether this is due to a popular key or a clear access direction.

Metric	Description
`kv.loadsplitter.nosplitkey`	Load-based splitter could not find a split key.
`kv.loadsplitter.popularkey`	Load-based splitter could not find a split key and the most popular sampled split key occurs in >= 25% of the samples.
`kv.loadsplitter.cleardirection`	Load-based splitter observed an access direction greater than 80% decreasing (left) or increasing (right) in the samples.

Get Started

CockroachDB and AI

Feature Overview

Data Resilience

Connect to an Application

Self-Hosted Deployments

Schema Design

Reads and Writes

Stream Data

Cross-Cluster Replication

Multi-Region Capabilities

Optimize Performance

Integrate

Enable and disable load-based splitting

When to enable load-based splitting

When to disable load-based splitting

Control load-based splitting threshold

When to modify the load-based splitting threshold

How load-based splitting works

Determine a split point

Why load-based splitting works

Monitor load-based splitting

Log message

Metrics

See also

​Enable and disable load-based splitting

​When to enable load-based splitting

​When to disable load-based splitting

​Control load-based splitting threshold

​When to modify the load-based splitting threshold

​How load-based splitting works

​Determine a split point

​Why load-based splitting works

​Monitor load-based splitting

​Log message

​Metrics

​See also

Enable and disable load-based splitting

When to enable load-based splitting

When to disable load-based splitting

Control load-based splitting threshold

When to modify the load-based splitting threshold

How load-based splitting works

Determine a split point

Why load-based splitting works

Monitor load-based splitting

Log message

Metrics

See also