Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model with no population split? #94

Open
kristinaleilani opened this issue Jul 11, 2024 · 3 comments
Open

Model with no population split? #94

kristinaleilani opened this issue Jul 11, 2024 · 3 comments

Comments

@kristinaleilani
Copy link

Hello,
Does GADMA consider models with no population split?

I am comparing two fish populations from distant locations that show little genetic differentiation, and I'm not actually sure if they are divergent populations. When I run GADMA, the best model shows a recent split with migration. I've tried running with only Initial structure [1,1], and I tried with both Initial structure [1,1] and Final structure[2,1], and both outputs show a recent split with migration. I'm wondering if a model with no split was considered? Or is my best model better than one with no split?

Thanks in advance!

@noscode
Copy link
Collaborator

noscode commented Jul 15, 2024

Hi @kristinaleilani,

That is a very good question. GADMA does not consider history without population split for two populations by default. However, you easily can do it. You have to remake your data as data for one population and run demographic inference for it. If you have VCF data, you need to remake popmap file with population labels so that all samples are from one population. If you have SFS file generated for dadi, you can call marginalize([0]) function (dadi manual here). Once data is ready, you can run GADMA inference for one population. As you use [1, 1] and [2, 1] structures for two populations, you can use [1], [2] and [3] structures for one population.

It is unclear for me though the best way to determine which model is the best, as basically you will have two models for two different datasets (1D and 2D). You say that 2D model has a very recent split event, I agree that it indicates that two populations probably are one population. Unfortunately, I do not know any statistical way to choose between 1D and 2D models. You cannot use neither likelihood nor AIC values to compare models.

Best regards,
Ekaterina

@z0on
Copy link

z0on commented Jul 15, 2024 via email

@noscode
Copy link
Collaborator

noscode commented Jul 15, 2024

Hi Misha,

Nice to hear from you. Hmm, I see, you want to have a zero time of the epoch after split, right? This make sense to me. I think right now the only way to achieve this is to use the custom model. I guess, you can still perform an automatic inference for one population using structures, but add an additional split in the model code for AIC comparison.

I can think of allowing GADMA to infer [X, 0] structure (which does the same), but I am not sure how difficult it will be and how much time will it take to implement.

Best regards,
Ekaterina

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants