[CASSANDRA-13215] Cassandra nodes startup time 20x more after upgarding to 3.x - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 3.11.2, 4.0-alpha1, 4.0
Component/s: Legacy/Core
Labels:
None
Environment:

Cluster setup: two datacenters (dc-main, dc-backup).
dc-main - 9 servers, no vnodes
dc-backup - 6 servers, vnodes

Description

CompactionStrategyManage.getCompactionStrategyIndex is called on each sstable at startup. And this function calls StorageService.getDiskBoundaries. And getDiskBoundaries calls AbstractReplicationStrategy.getAddressRanges.
It appears that last function can be really slow. In our environment we have 1545 tokens and with NetworkTopologyStrategy it can make 1545*1545 computations in worst case (maybe I'm wrong, but it really takes lot's of cpu).

Also this function can affect runtime later, cause it is called not only during startup.

I've tried to implement simple cache for getDiskBoundaries results and now startup time is about one minute instead of 25m, but I'm not sure if it's a good solution.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

simple-cache.patch
13/Feb/17 13:22
3 kB
Viktor Kuzmin

Issue Links

duplicates

CASSANDRA-13923 Flushers blocked due to many SSTables

Resolved

CASSANDRA-13937 Cassandra node's startup time increased after increase count of big tables

Resolved

Activity

People

Assignee:: Marcus Eriksson

Reporter:: Viktor Kuzmin

Authors:: Marcus Eriksson

Reviewers:: Paulo Motta

Votes:: 3 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 13/Feb/17 13:22

Updated:: 15/May/20 08:04

Resolved:: 24/Nov/17 13:24