Skip to content

Commit

Permalink
🐛 Fix a source of spurious data differences in datadiff
Browse files Browse the repository at this point in the history
  • Loading branch information
larsyencken committed Dec 14, 2023
1 parent 9a8df70 commit 736c992
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions etl/tempcompare.py
Original file line number Diff line number Diff line change
Expand Up @@ -137,8 +137,8 @@ def series_equals(
# then compare the rest of non-nan values (we cannot compare the entire series because pandas
# complains with `TypeError: boolean value of NA is ambiguous`)
ix = ~s1.isnull() & ~s2.isnull()
eq[ix] |= s1[ix].eq(s2[ix])
return eq
eq[ix] |= s1[ix].eq(s2[ix]) # NOTE: this can change dtype of eq to object
return eq.astype(bool)


def df_equals(df1: pd.DataFrame, df2: pd.DataFrame, **kwargs) -> pd.DataFrame:
Expand Down

0 comments on commit 736c992

Please sign in to comment.