From b05f76613878d16c674069b3d677a983e4720bf2 Mon Sep 17 00:00:00 2001 From: Katie Lamb Date: Mon, 2 Dec 2024 13:46:39 -0500 Subject: [PATCH] update release notes --- docs/release_notes.rst | 8 +++++++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/docs/release_notes.rst b/docs/release_notes.rst index d5e86fc28..19d7abac7 100644 --- a/docs/release_notes.rst +++ b/docs/release_notes.rst @@ -57,11 +57,17 @@ EPA CEMS ~~~~~~~~ * Added 2024 Q3 of CEMS data. See :issue:`3943` and :pr:`3948`. -FERC to EIA Record Linkage +Record Linkage ^^^^^^^^^^^^^^^^^^^^^^^^^^ * Updated the ``splink`` FERC to EIA development notebook to be compatible with the latest version of ``splink``. This notebook is not run in production but is helpful for visualizing model weights and what is happening under the hood. +* Updated ``pudl.analysis.record_linkage.name_cleaner`` company name cleaning + module to be more efficient by removing all ``.apply`` and instead use + ``pd.Series.replace`` to make regex replacement rules vectorized. Also removed + some of the allowed replacement rules to make the cleaner simpler and more + effective. This module runs approximately 3x faster now when cleaning a + string Series. .. _release-v2024.10.0: