Skip to content

Commit

Permalink
update news
Browse files Browse the repository at this point in the history
  • Loading branch information
assignUser committed Jul 20, 2024
1 parent ed67a42 commit 8b4d81d
Showing 1 changed file with 24 additions and 5 deletions.
29 changes: 24 additions & 5 deletions r/NEWS.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,12 +21,31 @@

# arrow 17.0.0

* R functions that users write that use functions that Arrow supports in dataset queries now can be used in queries too. Previously, only functions that used arithmetic operators worked. For example, `time_hours <- function(mins) mins / 60` worked, but `time_hours_rounded <- function(mins) round(mins / 60)` did not; now both work. These are automatic translations rather than true user-defined functions (UDFs); for UDFs, see `register_scalar_function()`. (#41223)
## New features

* R functions that users write that use functions that Arrow supports in dataset
queries now can be used in queries too. Previously, only functions that used
arithmetic operators worked.
For example, `time_hours <- function(mins) mins / 60` worked,
but `time_hours_rounded <- function(mins) round(mins / 60)` did not;
now both work. These are automatic translations rather than true user-defined
functions (UDFs); for UDFs, see `register_scalar_function()`. (#41223)
* `mutate()` expressions can now include aggregations, such as `x - mean(x)`. (#41350)
* `summarize()` supports more complex expressions, and correctly handles cases where column names are reused in expressions.
* The `na_matches` argument to the `dplyr::*_join()` functions is now supported. This argument controls whether `NA` values are considered equal when joining. (#41358)
* R metadata, stored in the Arrow schema to support round-tripping data between R and Arrow/Parquet, is now serialized and deserialized more strictly. This makes it safer to load data from files from unknown sources into R data.frames. (#41969)
* The minimum version of the Arrow C++ library the Arrow R package can be built with has been bumped to 15.0.0 (#42241)
* `summarize()` supports more complex expressions, and correctly handles cases
where column names are reused in expressions.
* The `na_matches` argument to the `dplyr::*_join()` functions is now supported.
This argument controls whether `NA` values are considered equal when joining. (#41358)
* R metadata, stored in the Arrow schema to support round-tripping data between
R and Arrow/Parquet, is now serialized and deserialized more strictly.
This makes it safer to load data from files from unknown sources into R data.frames. (#41969)

## Minor improvements and fixes
* Turn on the S3 and ZSTD features by default for macOS. (#42210)
* Fix bindings in Math group generics. (#43162)
* Fix a bug in our implementation of `pull` on grouped datasets, it now
returns the expected column. (#43172)
* The minimum version of the Arrow C++ library the Arrow R package can be built
with has been bumped to 15.0.0 (#42241)

# arrow 16.1.0

Expand Down

0 comments on commit 8b4d81d

Please sign in to comment.