Add basic imputation for measurement values. #461

egillax · 2024-06-10T10:53:52Z

I think a basic functionality that the package should have built in is basic imputation. Imputing based on mean, median or some other simply calculated single value. This would primarily be useful for measurement values. Would also be nice to have a threshold for including a certain measurement, i.e. if missing values are more than 50% don't include them. Or a threshold based on absolute number of measurements, since otherwise the imputation would be to noisy.

We have today the new age stratified imputation but I think that is maybe more for advanced use cases.

This should be very straightforward to add using the featureEngineering api.

The text was updated successfully, but these errors were encountered:

egillax · 2024-11-08T10:36:20Z

See scikit-learn for inspiration: https://scikit-learn.org/1.5/modules/impute.html

egillax added the enhancement label Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add basic imputation for measurement values. #461

Add basic imputation for measurement values. #461

egillax commented Jun 10, 2024

egillax commented Nov 8, 2024

Add basic imputation for measurement values. #461

Add basic imputation for measurement values. #461

Comments

egillax commented Jun 10, 2024

egillax commented Nov 8, 2024