Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Palantir Data Characterization: Quantitative Data #62

Open
hlehmann17 opened this issue Jul 13, 2020 · 0 comments
Open

Palantir Data Characterization: Quantitative Data #62

hlehmann17 opened this issue Jul 13, 2020 · 0 comments
Labels
Harmonization & Analytics Issues which involve both Data Ingestion & Harmonization & Analytics workstreams

Comments

@hlehmann17
Copy link
Collaborator

hlehmann17 commented Jul 13, 2020

Generalization: Normalizing Values In Measurement, Observation Where Implemented
For all numerical concept IDs [ OR set of given concept IDs] List of target concept ids   Concept ID white list 
Get frequency distributions of units   Data Quality Portal
...Unmapped, Mapped to 0, some other information
...Are we allowed to impute units? (from population? from other measurements within patient)
Determine significance of the distribution    
....Is there problematic data partner? Look for units = 0  
....Do different source data models (even tables within CDMs) behave differently? e.g., Obs_Clin
Articulate a formula for normalizing values and units    
Fill the (new) column, Normalized_value, Normalized_unit
@DaveraGabriel DaveraGabriel added the Harmonization & Analytics Issues which involve both Data Ingestion & Harmonization & Analytics workstreams label Jul 15, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Harmonization & Analytics Issues which involve both Data Ingestion & Harmonization & Analytics workstreams
Projects
None yet
Development

No branches or pull requests

2 participants