From 8b4d81d552a46faf9292c51baf8f8ce6d65a9118 Mon Sep 17 00:00:00 2001 From: Jacob Wujciak-Jens Date: Sat, 20 Jul 2024 05:41:26 +0200 Subject: [PATCH] update news --- r/NEWS.md | 29 ++++++++++++++++++++++++----- 1 file changed, 24 insertions(+), 5 deletions(-) diff --git a/r/NEWS.md b/r/NEWS.md index 6159f3863c375..0e6e4634a0af8 100644 --- a/r/NEWS.md +++ b/r/NEWS.md @@ -21,12 +21,31 @@ # arrow 17.0.0 -* R functions that users write that use functions that Arrow supports in dataset queries now can be used in queries too. Previously, only functions that used arithmetic operators worked. For example, `time_hours <- function(mins) mins / 60` worked, but `time_hours_rounded <- function(mins) round(mins / 60)` did not; now both work. These are automatic translations rather than true user-defined functions (UDFs); for UDFs, see `register_scalar_function()`. (#41223) +## New features + +* R functions that users write that use functions that Arrow supports in dataset + queries now can be used in queries too. Previously, only functions that used + arithmetic operators worked. + For example, `time_hours <- function(mins) mins / 60` worked, + but `time_hours_rounded <- function(mins) round(mins / 60)` did not; + now both work. These are automatic translations rather than true user-defined + functions (UDFs); for UDFs, see `register_scalar_function()`. (#41223) * `mutate()` expressions can now include aggregations, such as `x - mean(x)`. (#41350) -* `summarize()` supports more complex expressions, and correctly handles cases where column names are reused in expressions. -* The `na_matches` argument to the `dplyr::*_join()` functions is now supported. This argument controls whether `NA` values are considered equal when joining. (#41358) -* R metadata, stored in the Arrow schema to support round-tripping data between R and Arrow/Parquet, is now serialized and deserialized more strictly. This makes it safer to load data from files from unknown sources into R data.frames. (#41969) -* The minimum version of the Arrow C++ library the Arrow R package can be built with has been bumped to 15.0.0 (#42241) +* `summarize()` supports more complex expressions, and correctly handles cases + where column names are reused in expressions. +* The `na_matches` argument to the `dplyr::*_join()` functions is now supported. + This argument controls whether `NA` values are considered equal when joining. (#41358) +* R metadata, stored in the Arrow schema to support round-tripping data between + R and Arrow/Parquet, is now serialized and deserialized more strictly. + This makes it safer to load data from files from unknown sources into R data.frames. (#41969) + +## Minor improvements and fixes +* Turn on the S3 and ZSTD features by default for macOS. (#42210) +* Fix bindings in Math group generics. (#43162) +* Fix a bug in our implementation of `pull` on grouped datasets, it now + returns the expected column. (#43172) +* The minimum version of the Arrow C++ library the Arrow R package can be built + with has been bumped to 15.0.0 (#42241) # arrow 16.1.0