Use the correct version of PreparedMetadata for each node #815

cvybhu · 2023-09-20T20:03:36Z

Each unprepared statement has to be prepared on all nodes.
The driver sends a PREPARE request to every node, and in response it gets PreparedMetadata which contains information about this prepared statement.

Currently we assume that every node will return the same PreparedMetadata, after all we prepare the same statement on all of them. It turns out that this assumption isn't always true. During a rolling update different nodes might run different versions of Scylla, and different versions of Scylla might generate different PreparedMetadata for the same statement.
The driver ignores the fact that PreparedMetadata is different on some nodes, and because of this it might send a malformed request to them.

For example we could have the following query:

CREATE TABLE tks.tab (p int, c int, PRIMARY KEY (p, c))
SELECT p FROM tks.tab WHERE p = :x AND c = :x

scylladb/scylladb@19a6e69 changed how prepared queries look like when the same named bind variable is used multiple times.

For the above example, here's how the prepared metadata looks like on 2021.1.16:

PreparedMetadata {
    flags: 1,
    col_count: 2,
    pk_indexes: [
        PartitionKeyIndex {
            index: 0,
            sequence: 0,
        },
    ],
    col_specs: [
        ColumnSpec {
            table_spec: TableSpec {
                ks_name: "tks",
                table_name: "tab",
            },
            name: "x",
            typ: Int,
        },
        ColumnSpec {
            table_spec: TableSpec {
                ks_name: "tks",
                table_name: "tab",
            },
            name: "x",
            typ: Int,
        },
    ],
}

And here's how it looks on 2022.2.12:

PreparedMetadata {
    flags: 1,
    col_count: 1,
    pk_indexes: [],
    col_specs: [
        ColumnSpec {
            table_spec: TableSpec {
                ks_name: "tks",
                table_name: "tab",
            },
            name: "x",
            typ: Int,
        },
    ],
}

Note that the number of bind variables has changed - on 2022.1.16 Scylla expects two bound variables, but in 2022.2.12 it expects to see only one.

If we try to do a rolling update from 2021.1.16 to 2022.2.12 and run the example query, the following will happen:

All nodes are on the old version
Driver starts up and prepares the query, it receives PreparedMetadata generated by the old version
The rolling update starts
One node goes down and comes back up, but it now runs the new version of Scylla
Driver tries to execute a prepared statement on this node, but receives UNPREPARED, as this node's cache has been cleared during the restart
Driver sends PREPARE request, and receives the PreparedMetadata generated by the new version of Scylla
Driver ignores that this node responded with a different PreparedMetadata
Driver tries to send queries to the new node, but it uses the old PreparedMetadata
The queries fail because the driver sends two bind variables, but the new Scylla expects only one

The driver should be aware that some nodes require the old PreparedMetadata, and some require the new one. It should keep all of the required versions around and use the correct one for each node.

Refs: #575

The text was updated successfully, but these errors were encountered:

mykaul · 2023-09-21T07:02:13Z

I wonder if it's a Rust specific issue - do other drivers behave correctly?

avelanarius · 2024-01-16T09:57:52Z

I don't think this issue is solveable for a case of two clients.

Let's suppose that client A does this:

All nodes are on the old version
Driver starts up and prepares the query, it receives PreparedMetadata generated by the old version
The rolling update starts
One node goes down and comes back up, but it now runs the new version of Scylla
Driver tries to execute a prepared statement on this node, but receives UNPREPARED, as this node's cache has been cleared during the restart
Driver sends PREPARE request, and receives the PreparedMetadata generated by the new version of Scylla
Driver ignores that this node responded with a different PreparedMetadata
8.Driver tries to send queries to the new node, but it uses the old PreparedMetadata
The queries fail because the driver sends two bind variables, but the new Scylla expects only one

and in the meantime client B:

All nodes are on the old version
Driver starts and prepares the query, it receives PreparedMetadata generated by the old version
In the meantime client A does all steps 1. - 9.
Now if the client B tries to execute previously prepared query, it won't have to reprepare the query - client A already did. Therefore client B will have no idea that anything change and it will try to use the old prepared metadata.

Therefore closing this issue, I don't think that there is a good solution for this without changing the protocol (V5 protocol solved this problem).

Lorak-mmk · 2024-01-16T11:24:51Z

If protocol V5 solved this, shouldn't we keep this open until we and Scylla introduce support for it?

mykaul · 2024-01-16T11:31:19Z

If protocol V5 solved this, shouldn't we keep this open until we and Scylla introduce support for it?

That's going to take some time...

Lorak-mmk · 2024-01-16T11:34:15Z

Having this issue open doesn't hurt anybody, and it describes a bug that users might encounter

This was referenced Sep 22, 2023

cql3/prepare_context: fix generating pk_indexes for duplicate named bind variables scylladb/scylladb#15526

Merged

Cassandra-incompatible behavior of prepared statement with same-named bind markers scylladb/scylladb#15559

Closed

Lorak-mmk self-assigned this Nov 15, 2023

avelanarius closed this as completed Jan 16, 2024

avelanarius closed this as not planned Won't fix, can't repro, duplicate, stale Jan 16, 2024

Lorak-mmk reopened this Jan 16, 2024

Lorak-mmk removed their assignment Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the correct version of PreparedMetadata for each node #815

Use the correct version of PreparedMetadata for each node #815

cvybhu commented Sep 20, 2023 •

edited

Loading

mykaul commented Sep 21, 2023

avelanarius commented Jan 16, 2024

Lorak-mmk commented Jan 16, 2024

mykaul commented Jan 16, 2024

Lorak-mmk commented Jan 16, 2024

Use the correct version of PreparedMetadata for each node #815

Use the correct version of PreparedMetadata for each node #815

Comments

cvybhu commented Sep 20, 2023 • edited Loading

mykaul commented Sep 21, 2023

avelanarius commented Jan 16, 2024

Lorak-mmk commented Jan 16, 2024

mykaul commented Jan 16, 2024

Lorak-mmk commented Jan 16, 2024

cvybhu commented Sep 20, 2023 •

edited

Loading