-
Notifications
You must be signed in to change notification settings - Fork 163
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Column profiled as int but should be text/string #1130
Labels
Bug
Something isn't working
Comments
And here's an even simpler reproducible example:
And the output:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
General Information:
Ubuntu 22.04
3.10.12
0.10.9
Describe the bug:
I have a parquet file column
org_number
that should be treated astext
but is being profiled into anint
.Pandas info reports it as an object:
When I use Pandas
describe()
, it doesn't show any numeric statistics like min, max, stddev, etc. which is correct.The output from the profiler:
To Reproduce:
The code I'm using:
I can't provide the raw data but I can test things. The data is interesting in that it's almost integer, but many of the entries have 0's prepended as you can see in the samples.
Expected behavior:
I would expect the type to be string/text.
Screenshots:
Additional context:
The text was updated successfully, but these errors were encountered: