Refrain from transcoding SBE field names in snake_case #79

salsferrazza · 2022-12-20T01:54:39Z

I believe there is an option in the SBE decoding library that snake cases all of the decoded field names, this should be suppressed and default to verbatim transcoding of the field name as specified in the schema.

mservidio · 2022-12-22T03:06:27Z

@salsferrazza Yes, field names are converted using this:

def convert_to_underscore(name):
    name = name.strip('@').strip('#')
    sub_str = re.sub('(.)([A-Z][a-z]+)', r'\1_\2', name)
    return re.sub('([a-z0-9])([A-Z])', r'\1_\2', sub_str).lower()

However, naming requirements differ per output type. IE: BigQuery won't accept a dash '-' in a column name. So if we considered doing something like this we still need some way to sanitize field names based on the output type requirements.

See: https://cloud.google.com/bigquery/docs/schemas

mservidio assigned salsferrazza Dec 22, 2022

mservidio added the enhancement New feature or request label Dec 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refrain from transcoding SBE field names in snake_case #79

Refrain from transcoding SBE field names in snake_case #79

salsferrazza commented Dec 20, 2022

mservidio commented Dec 22, 2022 •

edited

Loading

Refrain from transcoding SBE field names in snake_case #79

Refrain from transcoding SBE field names in snake_case #79

Comments

salsferrazza commented Dec 20, 2022

mservidio commented Dec 22, 2022 • edited Loading

mservidio commented Dec 22, 2022 •

edited

Loading