Andromeda object with JSON files instead of RDS files #22

lhjohn · 2021-06-29T10:28:41Z

Hi,

I am looking into the possibility of writing and reading Andromeda data objects directly from other programming languages or web interfaces such as C++ or Python. Currently, we can read and write the covariate data in Python or C++ using SQLite (example).

However, there are a number of RDS files in the Andromeda object (cohort.rds, metaData.rds, outcomes.rds, etc..), which can be read exclusively by R.

How feasible would it be to natively support a JSON version of these RDS files in the Andromeda object.

This would allow us to implement our "own version of Andromeda" using C++ or Python but still keep compatibility with OHDSI's R eco system.

msuchard · 2021-06-29T18:10:58Z

An alternative to re-engineering Andromeda to support yet another language, maybe we could try:

https://github.com/ofajardo/pyreadr

ablack3 · 2021-07-01T18:47:26Z

I certainly like the idea of being able to read covariate data in python. Are there any OHDSI python packages where a loadAndromeda function could live? I guess it would be in https://github.com/OHDSI/DeepPatientLevelPrediction.

lhjohn · 2021-07-01T19:22:47Z

We are working on an initial implementation in OHDSI/DeepPatientLevelPrediction, as this package will be build primarily using PyTorch.

I am using @msuchard suggested package pyreadr. It seems to work fine for unnested, standard dataframes.

schuemie · 2021-07-06T07:52:07Z

It might be important to separate two things:

On the one hand we have Andromeda objects that are mainly a SQLite database (zipped when stored) and some R attributes (typically an R list object with some meta-data).

On the other hand there are more complex objects that include Andromeda objects. I think the PlpData object is actually an R list where one member is an Andromeda object.

In packages that I've been involved in (FeatureExtraction, CohortMethod, SelfControlledCaseSeries) the data objects inherit from Andromeda objects, and are therefore still just a SQLite database with some meta data as attributes. For example, the covariateData object created by FeatureExtraction is an Andromeda object with several tables in the SQLite database and a single 'metaData' attribute that is a list with two members: the populationSize (numeric) and a cohortId vector of numeric.

ablack3 · 2021-10-25T23:00:52Z

I think Andromeda is responsible for saving and restoring user defined attributes which could be any R object. @schuemie Would it be reasonable to restrict what types of attributes can be assigned to an Andromeda object? For example is fitted model a valid andromeda attribute?

library(Andromeda)
and <- andromeda(cars = cars)
attr(and, "model") <- lm(speed ~ dist, and$cars)

# I dont think I can convert a fitted model to json easily
jsonlite::toJSON(and$model)

schuemie · 2021-10-26T04:58:11Z

Yes, I don't see an issue with restricting the attributes to objects that can be converted to JSON.

One annoyance I've found when converting R objects to JSON is that the object class attribute is lost. I've written some code that preserves object attributes like these in the JSON, as you can see here. I recommend also using that here as well.

lhjohn mentioned this issue Jun 29, 2021

Read Andromeda object directly from Python OHDSI/DeepPatientLevelPrediction#7

Closed

This was referenced Nov 20, 2022

Switch to Arrow backend #40

Merged

v1.0 release candidate #42

Closed

lhjohn mentioned this issue Jul 3, 2024

Code base refactor - Data class OHDSI/PatientLevelPrediction#469

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Andromeda object with JSON files instead of RDS files #22

Andromeda object with JSON files instead of RDS files #22

lhjohn commented Jun 29, 2021

msuchard commented Jun 29, 2021

ablack3 commented Jul 1, 2021 •

edited

Loading

lhjohn commented Jul 1, 2021

schuemie commented Jul 6, 2021

ablack3 commented Oct 25, 2021

schuemie commented Oct 26, 2021

Andromeda object with JSON files instead of RDS files #22

Andromeda object with JSON files instead of RDS files #22

Comments

lhjohn commented Jun 29, 2021

msuchard commented Jun 29, 2021

ablack3 commented Jul 1, 2021 • edited Loading

lhjohn commented Jul 1, 2021

schuemie commented Jul 6, 2021

ablack3 commented Oct 25, 2021

schuemie commented Oct 26, 2021

ablack3 commented Jul 1, 2021 •

edited

Loading