Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

kta-intel · 2024-11-15T19:40:33Z

This PR enables a TaskRunner-based federated XGBoost using the bootstrap aggregation

Specifically this PR:

creates an xgb_higgs task runner workspace to train on the higgs dataset [ref] with all required code (i.e. src/taskrunner.py, src/dataloader.py ,plan/*.yaml, etc.
adds a tasks_xgb.yaml to enable new FedBaggingXGBoost aggregation when running xgb training workloads
adds delta_updates parameter to Aggregator in order to bypass delta updating (for deep learning models getting weight deltas makes sense since the model size should stay relatively consistent, for tree-based algorithms, this makes less sense because more trees are added over time)
- delta_updates is set to true by default to preserve normal behavior. xgboost taskrunner explicitly sets it to false to bypass it
introduces new loader_xgb.py as the backend / superclass to src/dataloader.py
introduces new runner_xgb.py as the backend / superclass to src/taskrunner.py
introduces new federated boostrap algorithm for xgboost in aggregation_function.fed_bagging which bags the latest trees to a global model, consistent with currently accept federated xgboost algorithms in the industry

Signed-off-by: kta-intel <[email protected]>

This reverts commit d3937ef. Signed-off-by: kta-intel <[email protected]>

Signed-off-by: kta-intel <[email protected]>

kta-intel added 15 commits November 8, 2024 09:05

initial xgboost workspace commit

33d304f

Signed-off-by: kta-intel <[email protected]>

updating taskrunner and aggregation function

93dc8b4

Signed-off-by: kta-intel <[email protected]>

runner updates

52fea84

Signed-off-by: kta-intel <[email protected]>

logic for loader

1275fd6

Signed-off-by: kta-intel <[email protected]>

enabling work

49f5cdf

Signed-off-by: kta-intel <[email protected]>

further enabling work

ddece36

Signed-off-by: kta-intel <[email protected]>

fix first round local validation

c7e2d76

Signed-off-by: kta-intel <[email protected]>

remove need to convert to float64

9d385a7

Signed-off-by: kta-intel <[email protected]>

fix model save

ce4b34f

Signed-off-by: kta-intel <[email protected]>

remove set_trace and fix spacing

70e4171

Signed-off-by: kta-intel <[email protected]>

rename workspace and fix plan

3d2df78

Signed-off-by: kta-intel <[email protected]>

fix lint

54cdc5e

Signed-off-by: kta-intel <[email protected]>

more formatting fixes

51a0afa

Signed-off-by: kta-intel <[email protected]>

revert space removal

d3937ef

Signed-off-by: kta-intel <[email protected]>

Revert "revert space removal"

dd2027c

This reverts commit d3937ef. Signed-off-by: kta-intel <[email protected]>

kta-intel force-pushed the xgboost-fedbagging branch from 6c79178 to dd2027c Compare November 15, 2024 21:50

kta-intel added 3 commits November 15, 2024 13:53

revert changes on interface.plan

e008e4a

Signed-off-by: kta-intel <[email protected]>

remove from history. unchanged

3cbd5e5

Signed-off-by: kta-intel <[email protected]>

reverting back to fresh state for interface.plan

051d8fc

Signed-off-by: kta-intel <[email protected]>

kta-intel changed the title ~~[WIP] Enable federated XGBoost using bootstrap aggregation in Task Runner~~ Enable federated XGBoost using bootstrap aggregation in Task Runner Nov 15, 2024

kta-intel marked this pull request as ready for review November 15, 2024 22:14

kta-intel requested review from MasterSkepticista, teoparvanov and psfoley November 15, 2024 22:14

kta-intel and others added 4 commits November 15, 2024 17:18

Merge branch 'securefederatedai:develop' into xgboost-fedbagging

58172c1

move delta_updates below assigner in args

a8d9b59

Signed-off-by: kta-intel <[email protected]>

add delta_update default to True, remove from yaml

5f1d909

Signed-off-by: kta-intel <[email protected]>

enable modin pandas

3670bd0

Signed-off-by: kta-intel <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

kta-intel commented Nov 15, 2024 •

edited

Loading

Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

Are you sure you want to change the base?

Enable federated XGBoost using bootstrap aggregation in Task Runner #1151

Conversation

kta-intel commented Nov 15, 2024 • edited Loading

kta-intel commented Nov 15, 2024 •

edited

Loading