- For
clean_factor()
switched the names and values oflevels
- Fix CRAN check error
- For
clean_Date()
andclean_POSIXct()
: allow argumentmax_date
to be the same length asx
- Fix for
digits
informat()
- Fix for currency symbols
- Fix CRAN check error
- CRAN fix for macOS (r-release-macos-arm64)
- Fix for latest R-devel, that does not allow
digits = 0
forformat()
clean_Date()
now supports month-year format for which it sets the day as 1:clean_Date("March") #> (assuming format 'mmmm') #> [1] "2021-03-01" clean_Date("March 2020") #> (assuming format 'mmmm yyyy')
#> [1] "2020-03-01"
* `freq()` now contains a `wt` argument to set the weights. The default (`NULL`) yields the old behaviour.
* Fixed a bug in `clean_POSIXct()` that led to the warning `Incompatible methods ("Ops.POSIXt", "Ops.Date") for ">"`
# cleaner 1.5.1
* New function `format_p_value()` to format raw p values according to the APA guideline
* `clean_Date()` now works with POSIX standards:
```r
clean_Date("2020-11-12 12:24:12")
clean_Date(c("2020-11-12 12:24:12", "2020-11-13"), guess_each = TRUE)
- Currency now prints and formats without symbols as default, use
as_symbol = TRUE
to print/format with currency symbols - Support for older versions of R (v3.2)
-
New function
format_names()
to quickly and easily change names ofdata.frame
columns,list
s orcharacter
vectors.df <- data.frame(old.name = "test1", value = "test2") format_names(df, snake_case = TRUE) format_names(df, camelCase = TRUE) format_names(df, c(old.name = "new_name", value = "measurement")) library(dplyr) starwars %>% format_names(camelCase = TRUE) %>% # column names mutate(name = name %>% format_names(snake_case = TRUE)) # values in column
-
New generic function
na_replace()
to replaceNA
values in any data type. Its default replacement value is dependent on the data type that is given as input:0
for numeric values and classmatrix
,FALSE
for classlogical
, today for classDate
, and""
otherwise.na_replace(c(1, 2, NA, NA)) #> [1] 1 2 0 0 na_replace(c(1, 2, NA, NA), replacement = -1) #> [1] 1 2 -1 -1 library(dplyr) starwars %>% na_replace(hair_color) # only replace NAs in this column starwars %>% na_replace() # replace NAs in all columns ("" for hair_color and 0 for birth_year)
-
Support for the upcoming R 4.1.0
- New function
rdate()
to generate random dates (in analogy to e.g.runif()
) - Frequency tables (
freq()
):- Added availability of data to header
- Fix for using
na.rm
- Fix for transforming to a visual histogram with
hist()
- New method for using
format()
on a frequency table - New method for transforming the values of a frequency table to a vector with
as.vector()
, which also supports dateslibrary(dplyr) library(cleaner) data.frame(dates = rdate(100)) %>% freq(dates) %>% as.vector()
- Fix for
clean_Date()
not accepting alreadyPOSIX
orDate
input - When using
clean_Date(..., guess_each = TRUE)
it now accepts theformat
parameter as a vector of options to let it choose from clean_Date()
andclean_POSIXct
gained a parametermax_date
(that defaults to today), so that they will never return years beyond a specified date:# old clean_Date("23-01-67") #> [1] "2067-01-23" # new clean_Date("23-01-67") #> [1] "1967-01-23" #> Warning: Some years were decreased by 100 to not exceed today. #> Use clean_Date(..., max_date = Inf) to prevent this. clean_Date("23-01-67", max_date = Inf) #> [1] "2067-01-23"
- Cleaned all code using the
lintr
package
- Fixed a bug when using a
percentage
class into thepercentage()
function, i.e.percentage(as.percentage(1))
would fail - Fixed extremely small percentages, like
as.percentage(2.5e-14)
- Added functions
clean_double()
andclean_integer()
- Added a method for
median()
in percentages - Fixed a bug where
NA
in percentages would not be formatted correctly - Fixed a bug in frequency tables where sometimes the number of digits used for percentages would be astronomical
- DUE TO CRAN POLICY: RENAMED TO PACKAGE TO
cleaner
- Added support for percentages as a new class:
as.percentage()
andclean_percentage()
. They also come with 'S3 methods' forprint
,format
,sum
,min
andmax
. - More robust coercing of dates
- Support for negative values for
clean_numeric()
,clean_percentage()
andclean_currency()
- Fix for
clean_character()
on R v3.5 and lower - Fix for digits in frequency tables for numeric values
- Added support for currency as a new class:
as.currency()
andclean_currency()
. They also come with 'S3 methods' forprint
,format
,sum
,min
andmax
. - Added
clean_POSIXct()
to clean date/time objects top_freq()
now correctly selects bottoms items using negative a number for nfreq.default()
is now exported for use in other packages- All numeric calculation in the header of frequency tables now use the same algorithm as used by Minitab and SPSS (see 'Type 6' on
stats::quantile()
) - More robust results for
clean_character()
, it also keeps in-between spaces now clean_numeric()
now supports currency- Fix for
freq()
where the precentage of NAs in the header was not calculated right
- First release