Adding Completeness Module #141

johnsam7 · 2024-02-29T16:56:51Z

Added completeness as a separate module

LukasAdamowicz

Overall, this is a good start, but a couple main points that should be addressed:

To match the rest of SKDH, ideally this gets refactored so that the completeness module can be used with any incoming data - and can be used in any pipeline to produce an output file indicating the completeness
This will need unit tests

Minor, but you should also run python-black on these files to ensure consistent formatting

requirements.txt

src/skdh/completeness/__init__.py

src/skdh/completeness/complete.py

src/skdh/completeness/parse.py

src/skdh/completeness/utils.py

LukasAdamowicz · 2024-03-01T12:48:38Z

src/skdh/completeness/utils.py

Also in general, some of these might need to be broken down smaller so that unit testing can be better achieved - right now a lot of functions have many many code branches (ie if statements) which makes it hard to achieve full coverage of unit tests

Let's discuss this

src/skdh/utility/__init__.py

LukasAdamowicz · 2024-03-18T15:27:14Z

test notification

LukasAdamowicz

Overall, its looking a bit better. Main comments:

there is so much code here that needs comments, both in-line comments explaining what blocks/lines of code are supposed to do, as well as function documentation strings
Functions likely need to be broken down still smaller so that they dont have so much branching logic, which will make it easier to test. Either that or find ways to write it without if statements
I still want to remove any study specific information or things from this
data loading should be removed, and either handled with specific functions in DHEAP extensions. Also written in such a way that it can work with default loaded data from single CSVs, etc

src/skdh/completeness/__init__.py

src/skdh/completeness/complete.py

LukasAdamowicz · 2024-04-01T16:35:29Z

src/skdh/completeness/complete.py

+            time_periods=None,
+            timescales=None):
+
+        check_hyperparameters_init(


good candidate for function for the class

I would also put this after the super() call, that way the signature for the class is as it was called by the user

LukasAdamowicz · 2024-04-01T16:37:24Z

src/skdh/completeness/complete.py

+        self.time_periods = time_periods
+        self.timescales = timescales
+
+    def load_subject_data(self, subject_folder, subject, measures):


I really want to get away from loading data in anything other than the io module, since that is where data ingestion is supposed to happen. If we need to write another reader class, that we can discuss. But the new MultiReader class might be able to handle everything thats needed for this module to run

LukasAdamowicz · 2024-04-01T16:38:42Z

src/skdh/completeness/complete.py

+            fname = subject_folder + '/' + measure + '.csv'
+            df_raw = pd.read_csv(fname)
+
+            assert 'Time Unix (ms)' in df_raw.keys(), '"Time Unix (ms)" column is missing from file ' + fname


In general, asserts are really only supposed to be used for testing. Better practice is to raise actual errors that are at least somewhat descriptive of what is the error. For most of these, ValueError would be appropriate

LukasAdamowicz · 2024-04-01T16:58:09Z

src/skdh/completeness/visualizations.py

+                             gap_size_mins=5):
+    """
+    Version of plot overview that plots several data streams for only one device/subject.
+    :param data: list where each element is a df with one column and a time stamp index. Each df will be plotted in one


will fix documentation for all functions and styling in the end when we're happy with the code

src/skdh/completeness/visualizations.py

LukasAdamowicz · 2024-04-01T17:03:37Z

test/completeness/conftest.py

+def completeness_sub_data():
+    cwd = str(Path.cwd())
+    if cwd.split('/')[-1] == "completeness":
+        subject_folder = cwd + '/data/'


cwd is not goint to work since these tests can be run from multiple places. Please see other tests that load data (ie gait) for how to handle this properly

LukasAdamowicz · 2024-04-01T17:04:00Z

test/completeness/conftest.py

+    data_gaps = np.array([np.timedelta64(10, 'm'), np.timedelta64(30, 'm'), np.timedelta64(1, 'h')])
+    subject = 'test_sub'
+    time_periods = 'daily'
+    pipe = skdh.completeness.AssessCompleteness(ranges, data_gaps, time_periods, timescales)


your actual runs that are being tested need to be in the test_completeness.py file

misread, but setup should be in the actual tests.

LukasAdamowicz · 2024-04-01T17:05:51Z

test/completeness/test_completeness_pipe.py

+
+
+# Test input check on real data
+def test_1_load_data(data_dic):


if using conftest, the arguments need to be the same as the function name, and the value is the value returned

…ant and cleaned up

johnsam7 added 2 commits February 5, 2024 13:44

completeness module

315a538

fixes and updated parsing

a1f822d

johnsam7 requested a review from LukasAdamowicz February 29, 2024 16:56

LukasAdamowicz requested changes Mar 1, 2024

View reviewed changes

LukasAdamowicz and others added 4 commits March 20, 2024 08:05

adding second start for implementation of completeness as an option

1950322

refactoring and adding unit tests

d4f8390

refactoring and added unit tests

f73b8fb

merge in updates from main branches

2dd5e75

LukasAdamowicz reviewed Apr 1, 2024

View reviewed changes

johnsam7 added 2 commits May 3, 2024 18:02

updated code based on review, removed some functions that were redund…

3bee5bd

…ant and cleaned up

updated meson.build

f562dd1

LukasAdamowicz changed the base branch from main to development May 9, 2024 19:37

johnsam7 and others added 6 commits July 29, 2024 12:24

fixed bugs in summary_metrics

53132e1

fixed a bug

2dace34

Merge branch 'development' into completeness

6394950

fixed an edge case bug in plot_overview

4f6ef86

updated viz

eaeb6a1

fixed bug in assertion

1a399ec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Completeness Module #141

Adding Completeness Module #141

johnsam7 commented Feb 29, 2024

LukasAdamowicz left a comment

LukasAdamowicz Mar 1, 2024

johnsam7 May 3, 2024

LukasAdamowicz commented Mar 18, 2024

LukasAdamowicz left a comment

LukasAdamowicz Apr 1, 2024

LukasAdamowicz Apr 1, 2024

LukasAdamowicz Apr 1, 2024

LukasAdamowicz Apr 1, 2024

LukasAdamowicz Apr 1, 2024

johnsam7 May 3, 2024

LukasAdamowicz Apr 1, 2024

LukasAdamowicz Apr 1, 2024

LukasAdamowicz Apr 1, 2024

LukasAdamowicz Apr 1, 2024



		# Test input check on real data
		def test_1_load_data(data_dic):

Adding Completeness Module #141

Are you sure you want to change the base?

Adding Completeness Module #141

Conversation

johnsam7 commented Feb 29, 2024

LukasAdamowicz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LukasAdamowicz commented Mar 18, 2024

LukasAdamowicz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment