fix: make variable naming consistent with ISA standard #13

sellth · 2023-06-07T13:06:54Z

Nicolai-vKuegelgen

While I agree with harmonising this overal, I'm not sure that a blanket change to all variable names is the right approach here:

for one thing, the stem cell core templates actually ask for sample names, since we use the cellline names as source
From what I've seen so far many templates use the same name for sample & source. While this is of course not ideal/desribale, it is often not quite avoidable without a specific experimental design (that the cookiecutter templates can't really build upon since the have to be very general). Therefore the cookiecutter json may conceptually only ask for a single 'name' for a sample and then use that for both source and sample name. For most experimental people (or just people not accustomed to ISA) sample name is much more intuitive description than source name in these cases.

...es/isatab-stem_cell_core_sc/{{cookiecutter.__output_dir}}/s_{{cookiecutter.s_file_name}}.txt

cubi_isa_templates/isatab-stem_cell_core_sc/cookiecutter.json

...{cookiecutter.__output_dir}}/a_{{cookiecutter.assay_prefix}}_{{cookiecutter.assay_name}}.txt

Nicolai-vKuegelgen · 2023-06-15T07:56:27Z

Another thing (I almost forgot):

This only seems to the address the variable names, however many templates also use Sample Name as the first column in the s-file instead of Source Name, so maybe this should/needs to be changed as well? Otherwise the variable renaming makes little sense.
If we do change this I'm not sure what the effects on pipelines that might depend in these templates will be.

sellth · 2023-06-15T08:43:37Z

Thanks Nicolai for looking into this.

for one thing, the stem cell core templates actually ask for sample names, since we use the cellline names as source

That was indeed an oversight in the stem_cell_core_sc template which is fixed now. I left _bulk unchanged because of this.

From what I've seen so far many templates use the same name for sample & source. […] For most experimental people (or just people not accustomed to ISA) sample name is much more intuitive description than source name in these cases.

Most templates derive their Source Names from the Sample Names, but I would agree with Mikko that this is a bit confusing in the context of ISA-tabs and also experimentally. I would expect Sample Names to be derived from the Source Names plus a suffix (optionally). That is how I defined it in for the MC template, there is source_names and sample_suffix in the cookiecutter.json.

This only seems to the address the variable names, however many templates also use Sample Name as the first column in the s-file instead of Source Name, so maybe this should/needs to be changed as well? Otherwise the variable renaming makes little sense.

Not sure what you mean by this. s_ files need to start with a Source Name column to be standard compliant and all do so right now.

Nicolai-vKuegelgen · 2023-06-15T14:33:04Z

Most templates derive their Source Names from the Sample Names, but I would agree with Mikko that this is a bit confusing in the context of ISA-tabs and also experimentally. I would expect Sample Names to be derived from the Source Names plus a suffix (optionally). That is how I defined it in for the MC template, there is source_names and sample_suffix in the cookiecutter.json.

The templates might do indeed do this, but I would argue that most users generally do not, since they only come up with source names when they start entering things into sodar (they will always have some sort of sample name ready).
Maybe the more important questions to answer for is: who will use these templates or rather who do we want to use them?
For larger projects (with inevitably closer cubi collaboration), someone will probably figure out a good way to organise and derive sample and source names. But smaller projects that - maybe one day? - some can just create & fill the samplehseet from within sodar this is not the case, and these people likely will come with a list of samples and names, but not source names.

Not sure what you mean by this. s_ files need to start with a Source Name column to be standard compliant and all do so right now.

Ah you're right I must have confused some (older?) things here or maybe I just remebered the start of some a-files ...

Nicolai-vKuegelgen

Code wise the changes all look good to me.

I have some concerns about the idea/purpose of this change (see my other comment), but that's not easily addressed either way.

sellth · 2023-06-20T09:11:03Z

As this is not really urgent, let's not do anything hastily and talk once I'm back in Berlin.

fix: make variable naming consistent with ISA standard

6291386

sellth requested review from mikkonie and Nicolai-vKuegelgen June 7, 2023 13:06

Nicolai-vKuegelgen requested changes Jun 15, 2023

View reviewed changes

revert changes to stem_cell_core_sc template

9272559

sellth requested a review from Nicolai-vKuegelgen June 15, 2023 08:51

Nicolai-vKuegelgen approved these changes Jun 15, 2023

View reviewed changes

sellth mentioned this pull request Jul 21, 2023

Add support for cookiecutter prompts in sheet templates bihealth/sodar-server#1726

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: make variable naming consistent with ISA standard #13

fix: make variable naming consistent with ISA standard #13

sellth commented Jun 7, 2023

Nicolai-vKuegelgen left a comment

Nicolai-vKuegelgen commented Jun 15, 2023

sellth commented Jun 15, 2023 •

edited

Loading

Nicolai-vKuegelgen commented Jun 15, 2023

Nicolai-vKuegelgen left a comment

sellth commented Jun 20, 2023

fix: make variable naming consistent with ISA standard #13

Are you sure you want to change the base?

fix: make variable naming consistent with ISA standard #13

Conversation

sellth commented Jun 7, 2023

Nicolai-vKuegelgen left a comment

Choose a reason for hiding this comment

Nicolai-vKuegelgen commented Jun 15, 2023

sellth commented Jun 15, 2023 • edited Loading

Nicolai-vKuegelgen commented Jun 15, 2023

Nicolai-vKuegelgen left a comment

Choose a reason for hiding this comment

sellth commented Jun 20, 2023

sellth commented Jun 15, 2023 •

edited

Loading