Fix TokenAwarePolicy to call child.make_query_plan once #634

Copilot · 2025-12-27T10:44:02Z

Pre-review checklist

I have split my patch into logically separate commits.
All commit messages clearly explain what they change and why.
I added relevant tests for new features and bug fixes.
All commits compile, pass static checks and pass test.
PR description sums up the changes and reasons why they should be introduced.
I have provided docstrings for the public items that I want to introduce.
I have adjusted the documentation in ./docs/source/.

Description

Optimizes TokenAwarePolicy.make_query_plan to call child.make_query_plan only once instead of up to 3 times in some scenarios, improving performance by eliminating redundant query plan generation.

Problem

In the original implementation, TokenAwarePolicy.make_query_plan could call child.make_query_plan up to three times:

Line 509: When query/routing_key/keyspace is None (early return)
Line 520: When using tablet-based routing
Line 538: To get the rest of the cluster hosts

This redundant computation was unnecessary and could impact performance, especially with complex child policies.

Changes Made

cassandra/policies.py

Modified TokenAwarePolicy.make_query_plan to call child.make_query_plan once at the beginning of the method
Converted the generator result to a list for reuse throughout the method
Reused this list for early returns, tablet replica filtering, and yielding the rest of the cluster
Added clear comment explaining the optimization

tests/unit/test_policies.py

Added a new comprehensive test test_child_make_query_plan_called_once that validates the optimization works in all scenarios:
- With routing key and keyspace
- Without routing key
- Without keyspace
- With tablets enabled
Updated existing test assertion in _assert_shuffle to expect one call for both tablet and vnode scenarios

Performance Impact

Before: Up to 3 calls to child.make_query_plan in worst-case scenarios
After: Always exactly 1 call to child.make_query_plan

The optimization converts the generator to a list for reuse. This is an acceptable trade-off as query plans are typically small (< 100 hosts even in large clusters) and the generator was already being fully materialized in the worst-case scenario.

Testing

✅ All 83 tests in tests/unit/test_policies.py pass
✅ New test specifically validates the optimization
✅ No security vulnerabilities detected by CodeQL scanner

Original prompt

This section details on the original issue you should resolve

<issue_title>Call child.make_query_plan in TokenAwarePolicy.make_query_plan only once</issue_title>
<issue_description>In some scenarios TokenAwarePolicy.make_query_plan can call child.make_query_plan three times.
Let's make sure it is called only once.

python-driver/cassandra/policies.py

Lines 506 to 539 in 2739b29

def make_query_plan(self, working_keyspace=None, query=None):

keyspace = query.keyspace if query and query.keyspace else working_keyspace

child = self._child_policy

if query is None or query.routing_key is None or keyspace is None:

for host in child.make_query_plan(keyspace, query):

yield host

return

replicas = []

if self._tablets_routing_v1:

tablet = self._cluster_metadata._tablets.get_tablet_for_key(

keyspace, query.table, self._cluster_metadata.token_map.token_class.from_key(query.routing_key))

if tablet is not None:

replicas_mapped = set(map(lambda r: r[0], tablet.replicas))

child_plan = child.make_query_plan(keyspace, query)

replicas = [host for host in child_plan if host.host_id in replicas_mapped]

if not replicas:

replicas = self._cluster_metadata.get_replicas(keyspace, query.routing_key)

if self.shuffle_replicas:

shuffle(replicas)

for replica in replicas:

if replica.is_up and child.distance(replica) in [HostDistance.LOCAL, HostDistance.LOCAL_RACK]:

yield replica

for host in child.make_query_plan(keyspace, query):

# skip if we've already listed this host

if host not in replicas or child.distance(host) == HostDistance.REMOTE:

yield host

</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes Call child.make_query_plan in TokenAwarePolicy.make_query_plan only once #358

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

mykaul · 2025-12-29T17:16:10Z

@copilot please complete the work on this PR.

Co-authored-by: mykaul <4655593+mykaul@users.noreply.github.com>

Copilot · 2025-12-29T17:24:17Z

@copilot please complete the work on this PR.

Completed! The optimization ensures child.make_query_plan is called only once in all scenarios. See commit 507b252.

Summary of changes:

Modified TokenAwarePolicy.make_query_plan to call child.make_query_plan once and reuse the result
Added comprehensive test coverage to verify the optimization
All 83 existing tests pass
No security issues detected

tests/unit/test_policies.py

+        keyspace = 'keyspace'
+        routing_key = 'routing_key'
+        query = Statement(routing_key=routing_key, keyspace=keyspace)
+        qplan = list(policy.make_query_plan(keyspace, query))


tests/unit/test_policies.py

+        # Test case 2: Without routing key (should call once)
+        child_policy.reset_mock()
+        query = Statement(routing_key=None, keyspace=keyspace)
+        qplan = list(policy.make_query_plan(keyspace, query))


tests/unit/test_policies.py

+        # Test case 3: Without keyspace (should call once)
+        child_policy.reset_mock()
+        query = Statement(routing_key=routing_key, keyspace=None)
+        qplan = list(policy.make_query_plan(None, query))


tests/unit/test_policies.py

+
+        child_policy.reset_mock()
+        query = Statement(routing_key=routing_key, keyspace=keyspace, table='test_table')
+        qplan = list(policy.make_query_plan(keyspace, query))


Initial plan

3481055

Copilot AI assigned Copilot and mykaul Dec 27, 2025

Copilot started work on behalf of mykaul December 27, 2025 10:44 View session

Copilot AI requested a review from mykaul December 27, 2025 10:45

Copilot started work on behalf of mykaul December 29, 2025 17:16 View session

Optimize TokenAwarePolicy to call child.make_query_plan only once

507b252

Co-authored-by: mykaul <4655593+mykaul@users.noreply.github.com>

github-code-quality bot found potential problems Dec 29, 2025

View reviewed changes

Copilot AI changed the title ~~[WIP] Fix TokenAwarePolicy to call child.make_query_plan once~~ Fix TokenAwarePolicy to call child.make_query_plan once Dec 29, 2025

Copilot finished work on behalf of mykaul December 29, 2025 17:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix TokenAwarePolicy to call child.make_query_plan once #634

Fix TokenAwarePolicy to call child.make_query_plan once #634

Uh oh!

Copilot AI commented Dec 27, 2025 •

edited

Loading

Uh oh!

mykaul commented Dec 29, 2025

Uh oh!

Copilot AI commented Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	def make_query_plan(self, working_keyspace=None, query=None):
	keyspace = query.keyspace if query and query.keyspace else working_keyspace

	child = self._child_policy
	if query is None or query.routing_key is None or keyspace is None:
	for host in child.make_query_plan(keyspace, query):
	yield host
	return

	replicas = []
	if self._tablets_routing_v1:
	tablet = self._cluster_metadata._tablets.get_tablet_for_key(
	keyspace, query.table, self._cluster_metadata.token_map.token_class.from_key(query.routing_key))

	if tablet is not None:
	replicas_mapped = set(map(lambda r: r[0], tablet.replicas))
	child_plan = child.make_query_plan(keyspace, query)

	replicas = [host for host in child_plan if host.host_id in replicas_mapped]

	if not replicas:
	replicas = self._cluster_metadata.get_replicas(keyspace, query.routing_key)

	if self.shuffle_replicas:
	shuffle(replicas)

	for replica in replicas:
	if replica.is_up and child.distance(replica) in [HostDistance.LOCAL, HostDistance.LOCAL_RACK]:
	yield replica

	for host in child.make_query_plan(keyspace, query):
	# skip if we've already listed this host
	if host not in replicas or child.distance(host) == HostDistance.REMOTE:
	yield host

Fix TokenAwarePolicy to call child.make_query_plan once #634

Are you sure you want to change the base?

Fix TokenAwarePolicy to call child.make_query_plan once #634

Uh oh!

Conversation

Copilot AI commented Dec 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pre-review checklist

Description

Problem

Changes Made

cassandra/policies.py

tests/unit/test_policies.py

Performance Impact

Testing

Comments on the Issue (you are @copilot in this section)

Uh oh!

mykaul commented Dec 29, 2025

Uh oh!

Copilot AI commented Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 27, 2025 •

edited

Loading