-
Notifications
You must be signed in to change notification settings - Fork 329
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compacted SST file reappears in region #2013
Comments
We suspect it may related to some checkpoint issue:
We can see two checkpoint ran with same |
The reason of this bug is the dirty The previous checkpoint deleted all actions between 211~253, while the latter checkpoint losts these actions, which includes those removing SST files. |
fix: dirty last checkpoint metadata file when enable object store caching, #2013
fix: dirty last checkpoint metadata file when enable object store caching, #2013
fix: dirty last checkpoint metadata file when enable object store caching, #2013
Looks like this problem reappeared.
|
fix: dirty last checkpoint metadata file when enable object store caching, GreptimeTeam#2013
What type of bug is this?
Unexpected error
What subsystems are affected?
Datanode
What happened?
pod_name:greptimedb-frontend-5db6cc4db4-2sgpm
pod_namespace:greptimedb
message:2023-07-14T08:35:21.681888Z ERROR client::database: Failed to do Flight get, addr: greptimedb-datanode-0.greptimedb-datanode.greptimedb:4001, code: Internal error, source: Failed to do Flight get, code: Internal error, source: NotFound (persistent) at read, context: { response: Parts { status: 404, version: HTTP/1.1, headers: {"x-amz-request-id": "[...]", "x-amz-id-2": "[...]", "x-amz-delete-marker": "true", "x-amz-version-id": "[...]", "content-type": "application/xml", "transfer-encoding": "chunked", "date": "Fri, 14 Jul 2023 08:35:21 GMT", "server": "AmazonS3"} }, service: s3, path: data/[...]/public/1968/1968_0000000000/4caac3ad-4ad7-4b3b-8a97-f00b39bfe57a.parquet, range: 0- } => S3Error { code: "NoSuchKey", message: "The specified key does not exist.", resource: "", request_id: "RG8E0ZM8E94P029B" }
Datanode logs:
Looks like the compacted SST file
4caac3ad-4ad7-4b3b-8a97-f00b39bfe57a
is deleted, but after a week, it reappears in region's SST file list.What operating system did you use?
NA
Relevant log output and stack trace
No response
How can we reproduce the bug?
NA
The text was updated successfully, but these errors were encountered: