Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Revisit how to trim years/strata for Gulf of Mexico #124

Open
mpinsky opened this issue Apr 10, 2020 · 4 comments
Open

Revisit how to trim years/strata for Gulf of Mexico #124

mpinsky opened this issue Apr 10, 2020 · 4 comments

Comments

@mpinsky
Copy link
Collaborator

mpinsky commented Apr 10, 2020

We currently trim to strata sampled at least 10 times between 2008 and 2018, and we only use data from >2008. This ends up losing a lot of data (about 25 years).

Can we instead trim to strata sampled consistently through time?

@df511
Copy link
Contributor

df511 commented Mar 30, 2021

If we trim from 2008 onward and to strata with more than 10 years coverage, we lose ~6% of the data (plot 1, top). If we only trim by strata surveyed for more than 15 years, leaving pre-2008 data in, we lose ~18% of data and coverage is still fairly sparse (plot 2, bottom).

image
image

@df511
Copy link
Contributor

df511 commented Apr 6, 2021

@mpinsky I was just re-running the full script to make sure there are no lingering bugs, and found that the code used to trim the Gulf of Mexico data and generate the above plot no longer produces the same plot. I must not have been overwriting the lower plots, because all of the percentages removed are consistent with the old plot (20.3% of data removed), but this makes more sense in looking at the original data. What do you think about removing the whole eastern half of the Gulf of Mexico data? Is it still worth it? I would likely still have to trim more to get more consistent temporal coverage. Showing this plot for consistency, but I'll drop a plot with better temporal coverage below.
image

@df511
Copy link
Contributor

df511 commented Apr 6, 2021

Here's a better filter. We lose 23% of the original data, but get decent temporal coverage. @mpinsky
image

@mpinsky
Copy link
Collaborator Author

mpinsky commented Apr 6, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants