Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add basic imputation for measurement values. #461

Open
egillax opened this issue Jun 10, 2024 · 1 comment
Open

Add basic imputation for measurement values. #461

egillax opened this issue Jun 10, 2024 · 1 comment

Comments

@egillax
Copy link
Collaborator

egillax commented Jun 10, 2024

I think a basic functionality that the package should have built in is basic imputation. Imputing based on mean, median or some other simply calculated single value. This would primarily be useful for measurement values. Would also be nice to have a threshold for including a certain measurement, i.e. if missing values are more than 50% don't include them. Or a threshold based on absolute number of measurements, since otherwise the imputation would be to noisy.

We have today the new age stratified imputation but I think that is maybe more for advanced use cases.

This should be very straightforward to add using the featureEngineering api.

@egillax
Copy link
Collaborator Author

egillax commented Nov 8, 2024

See scikit-learn for inspiration: https://scikit-learn.org/1.5/modules/impute.html

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant