Small(?) multipart upload has errors when determining mime type. #3658

chrisvanrun · 2024-10-25T15:51:30Z

Sentry issue: An error occurred (InvalidRange) when calling the GetObject operation: The requested range is not satisfiable.

The mimetype_from_file() uses the boto client to get the initial range, however, it seems that the upload may be less than 2048 bytes long. Not sure why it is a multi-part... I'd expect the multipart part to have a cutoff at a certain size. Need to investigate.

A quick fix would be wrapping it in a try-except as follows:

try:
    header = self._client.get_object(
        Bucket=self.bucket,
        Key=self.key,
        Range="bytes=0-2047",
    )["Body"].read()
except self._client.exceptions.InvalidRange as e:
    # Fallback if range is out of bounds
    header = self._client.get_object(Bucket=self.bucket, Key=self.key)["Body"].read()

The text was updated successfully, but these errors were encountered:

jmsmkn · 2024-10-28T08:15:37Z

The suggested error handling there is not correct, you would need to catch botocore.exceptions.ClientError. It would be something like:

        except botocore.exceptions.ClientError as error:
            if error.response["Error"]["Code"] == "InvalidRange":
                header = self._client.get_object(Bucket=self.bucket, Key=self.key)["Body"].read()
            else:
                raise error

However, I wonder if that could lead to a DOS in case in some weird situation with partial uploads the first bytes are missing or something. No idea here, but maybe? We can anyway be a bit more defensive:

object_head = s3_client.head_object(Bucket=self.bucket, Key=self.key)
object_size = int(response['ContentLength'])
max_bytes = min(2047, object_size)
header = self._client.get_object(
        Bucket=self.bucket,
        Key=self.key,
        Range=f"bytes=0-{max_bytes}",
    )["Body"].read()

jmsmkn · 2024-10-28T08:28:49Z

Actually, I don't think this explanation makes sense. We have a test with a small file introduced in #3350.

jmsmkn · 2024-10-28T08:29:33Z

It is a problem with a zero bytes file.

Closes #3658

jmsmkn assigned jmsmkn and unassigned jmsmkn Oct 28, 2024

jmsmkn added a commit that referenced this issue Oct 28, 2024

Fix mimetype determination for empty files

abe014a

Closes #3658

jmsmkn mentioned this issue Oct 28, 2024

Fix mimetype determination for empty files #3661

Merged

jmsmkn closed this as completed in #3661 Oct 28, 2024

jmsmkn added a commit that referenced this issue Oct 28, 2024

Fix mimetype determination for empty files (#3661)

cda0704

Closes #3658

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small(?) multipart upload has errors when determining mime type. #3658

Small(?) multipart upload has errors when determining mime type. #3658

chrisvanrun commented Oct 25, 2024 •

edited

Loading

jmsmkn commented Oct 28, 2024

jmsmkn commented Oct 28, 2024

jmsmkn commented Oct 28, 2024

Small(?) multipart upload has errors when determining mime type. #3658

Small(?) multipart upload has errors when determining mime type. #3658

Comments

chrisvanrun commented Oct 25, 2024 • edited Loading

jmsmkn commented Oct 28, 2024

jmsmkn commented Oct 28, 2024

jmsmkn commented Oct 28, 2024

chrisvanrun commented Oct 25, 2024 •

edited

Loading