Skip to content

Commit

Permalink
[#141] Add a docstring for core.transforms.generate_transforms()
Browse files Browse the repository at this point in the history
  • Loading branch information
riley-harper committed Aug 27, 2024
1 parent ada39ea commit 378125e
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions hlink/linking/core/transforms.py
Original file line number Diff line number Diff line change
Expand Up @@ -336,6 +336,16 @@ def generate_transforms(
is_a: bool,
id_col: str,
) -> DataFrame:
"""Generate feature selection columns and return the input dataframe with these new columns attached.
Args:
spark: the Spark session
df_selected: the input Spark DataFrame
feature_selections: a list of feature selections to compute
link_task: the current LinkTask
is_a: whether this is dataset A (True) or dataset B (False)
id_col: the name of the identifier column in the input data frame
"""
not_skipped_feature_selections = [
c
for c in feature_selections
Expand Down

0 comments on commit 378125e

Please sign in to comment.