Skip to content

Commit

Permalink
update data tab
Browse files Browse the repository at this point in the history
  • Loading branch information
naustica committed Feb 5, 2024
1 parent f3aca08 commit d25ead8
Show file tree
Hide file tree
Showing 4 changed files with 28 additions and 28 deletions.
12 changes: 6 additions & 6 deletions data.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -74,11 +74,11 @@ An overview of our data warehouse including procedures to load the data into Big

| Snapshot | Directory | Table | Schema | Procedure | Last Changed | Coverage | Number of rows |
|------------|---------------|-----------------------|-----------------------------------|-----------|--------------|-----------|----------------------|
| 2023-12-20 | authors/ | openalex.authors | schema_openalex_author.json | [Repo](https://github.com/naustica/openalex) | 04.01.2024 | All | 89.840.202 |
| 2023-12-21 | institutions/ | openalex.institutions | schema_openalex_institutions.json | [Repo](https://github.com/naustica/openalex) | 04.01.2024 | All | 107.538 |
| 2023-12-21 | sources/ | openalex.sources | schema_openalex_sources.json | [Repo](https://github.com/naustica/openalex) | 04.01.2024 | All | 249.407 |
| 2023-12-20 | works/ | openalex.works | schema_openalex_work.json | [Repo](https://github.com/naustica/openalex) | 04.01.2024 | All | 246.880.876 |
| 2023-12-21 | funders/ | openalex.funders | schema_openalex_funders.json | [Repo](https://github.com/naustica/openalex) | 04.01.2024 | All | 32.437 |
| 2023-12-21 | publishers/ | openalex.publishers | schema_openalex_publishers.json | [Repo](https://github.com/naustica/openalex) | 04.01.2024 | All | 10.249 |
| 2024-01-24 | authors/ | openalex.authors | schema_openalex_author.json | [Repo](https://github.com/naustica/openalex) | 02.02.2024 | All | 90.079.708 |
| 2024-01-25 | institutions/ | openalex.institutions | schema_openalex_institutions.json | [Repo](https://github.com/naustica/openalex) | 02.02.2024 | All | 106.631 |
| 2024-01-25 | sources/ | openalex.sources | schema_openalex_sources.json | [Repo](https://github.com/naustica/openalex) | 02.02.2024 | All | 250.686 |
| 2024-01-24 | works/ | openalex.works | schema_openalex_work.json | [Repo](https://github.com/naustica/openalex) | 02.02.2024 | All | 248.072.830 |
| 2024-01-25 | funders/ | openalex.funders | schema_openalex_funders.json | [Repo](https://github.com/naustica/openalex) | 02.02.2024 | All | 32.433 |
| 2024-01-25 | publishers/ | openalex.publishers | schema_openalex_publishers.json | [Repo](https://github.com/naustica/openalex) | 02.02.2024 | All | 10.249 |

:::
34 changes: 17 additions & 17 deletions docs/data.html
Original file line number Diff line number Diff line change
Expand Up @@ -2470,62 +2470,62 @@ <h2 id="status-openalex">Status Openalex</h2>
</thead>
<tbody>
<tr class="odd">
<td>2023-12-20</td>
<td>2024-01-24</td>
<td>authors/</td>
<td>openalex.authors</td>
<td>schema_openalex_author.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.01.2024</td>
<td>02.02.2024</td>
<td>All</td>
<td>89.840.202</td>
<td>90.079.708</td>
</tr>
<tr class="even">
<td>2023-12-21</td>
<td>2024-01-25</td>
<td>institutions/</td>
<td>openalex.institutions</td>
<td>schema_openalex_institutions.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.01.2024</td>
<td>02.02.2024</td>
<td>All</td>
<td>107.538</td>
<td>106.631</td>
</tr>
<tr class="odd">
<td>2023-12-21</td>
<td>2024-01-25</td>
<td>sources/</td>
<td>openalex.sources</td>
<td>schema_openalex_sources.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.01.2024</td>
<td>02.02.2024</td>
<td>All</td>
<td>249.407</td>
<td>250.686</td>
</tr>
<tr class="even">
<td>2023-12-20</td>
<td>2024-01-24</td>
<td>works/</td>
<td>openalex.works</td>
<td>schema_openalex_work.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.01.2024</td>
<td>02.02.2024</td>
<td>All</td>
<td>246.880.876</td>
<td>248.072.830</td>
</tr>
<tr class="odd">
<td>2023-12-21</td>
<td>2024-01-25</td>
<td>funders/</td>
<td>openalex.funders</td>
<td>schema_openalex_funders.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.01.2024</td>
<td>02.02.2024</td>
<td>All</td>
<td>32.437</td>
<td>32.433</td>
</tr>
<tr class="even">
<td>2023-12-21</td>
<td>2024-01-25</td>
<td>publishers/</td>
<td>openalex.publishers</td>
<td>schema_openalex_publishers.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.01.2024</td>
<td>02.02.2024</td>
<td>All</td>
<td>10.249</td>
</tr>
Expand Down
8 changes: 4 additions & 4 deletions docs/search.json
Original file line number Diff line number Diff line change
Expand Up @@ -5,21 +5,21 @@
"title": "About us",
"author": [],
"contents": "\n\nContents\nAbout this Blog!\nJournal publications\nTheses\nSoftware\nThird-party funded projects\n\nAbout this Blog!\nWelcome to our Blog! Here, you’ll find insights from our work as Data Analysts in the domain of scholarly communication. With this blog, we want to engage with the broader community about how to support data-driven workflows and decision-making around scholarly communication with R.\nWe are based at the Göttingen State and University Library, one of the largest academic libraries in Germany. We are using R-based tools in our everyday work and contribute to R package developments and training activities. In this blog, you’ll find news and case-studies around:\nOpen Access and Open Science Analytics\nR Packages making use of open databases and helping us in our work\nR Tools for interactive visualizations and dashboard developments\nR-related training and outreach activities\nWe want to thank Maëlle Salmon for encouraging us to start a blog about our work. As a technical framework for the blog, we are using Distill for R Markdown, a new web publishing format optimized for scientific and technical writing.\nDr. Anne Hobert, Nick Haupka, Najko Jahn\nJournal publications\nWe also publish in scholarly journals about our work.\nFraser, N., Hobert, A., Jahn, N., Mayr, P., & Peters, I. (2023). No deal: German researchers’ publishing and citing behaviors after Big Deal negotiations with Elsevier.\nQuantitative Science Studies, 4(2), 325–352. https://doi.org/10.1162/qss_a_00255\nTaubert, N., Hobert, A., Jahn, N., Bruns, A., & Iravani, E. (2023). Understanding differences of the OA uptake within the German university landscape (2010–2020): part 1—journal-based OA. Scientometrics, 128(6), 3601–3625. https://doi.org/10.1007/s11192-023-04716-3\nHaupka, N., Jahn, N., & Hobert, N. (2022). Praxisbericht Big Scholarly Data an der SUB Göttingen. LIBREAS. Library Ideas, 41 (2022). https://libreas.eu/ausgabe41/haupka/\nJahn, N., Matthias, L., & Laakso, M. (2022). Toward transparency of\nhybrid open access through publisher‐provided metadata: An article‐level\nstudy of Elsevier. Journal of the Association for Information Science\nand Technology, 73(1), 104–118. https://doi.org/10.1002/asi.24549\nJahn, N., Held, M., Walter, H., Haupka, N., & Hillenkötter, K. (2022).\nHOAD: Data Analytics für mehr Transparenz bei\nOpen-Access-Transformationsverträgen. ABI Technik, 42(1), 64–69.\nhttps://doi.org/10.1515/abitech-2022-0007\nStisser, A., Jahn, N., & Schmidt, B. (2022). Stand und Perspektiven bibliometriegestützter Open-Access-Services an Universitäten in Deutschland. Bibliothek Forschung und Praxis, 46(2), 275–283. https://doi.org/10.1515/bfp-2021-0098\nHobert, A., Jahn, N., Mayr, P., Schmidt, B., & Taubert, N. (2021). Open\naccess uptake in Germany 2010–2018: adoption in a diverse research\nlandscape. Scientometrics, 126(12), 9751–9777.\nhttps://doi.org/10.1007/s11192-021-04002-0\nLaakso, M., Matthias, L., & Jahn, N. (2021). Open is not forever: A\nstudy of vanished open access journals. Journal of the Association for\nInformation Science and Technology, 72(9), 1099–1112.\nhttps://doi.org/10.1002/asi.24460 (Featured in Nature, Science and CNN)\nJahn, N., Hobert, A., & Haupka, N. (2021). Entwicklung und Typologie des\nDatendiensts Unpaywall. Bibliothek Forschung und Praxis, 45(2),\n293–303. https://doi.org/10.1515/bfp-2020-0115\nMatthias, L., Jahn, N., & Laakso, M. (2019). The Two-Way Street of Open\nAccess Journal Publishing: Flip It and Reverse It. Publications, 7(2),\n23. https://doi.org/10.3390/publications7020023\nTheses\nHaupka, N. (2021). Analyse der Entwicklung des Open Access-Discovery-Services Unpaywall seit 2018 [Bachelor Thesis, Hochschule Hannover]. https://doi.org/10.25968/opus-1899\nSoftware\nR-Packages (selection):\nJahn, N. europepmc: R Interface to the Europe PubMed Central RESTful Web Service. https://CRAN.R-project.org/package=europepmc | https://docs.ropensci.org/europepmc/\nChamberlain, S., Zhu, H., Jahn, N., Boettiger, C., Ram, K. rcrossref: Client for Various ‘CrossRef’ ‘APIs’. https://CRAN.R-project.org/package=rcrossref https://docs.ropensci.org/rcrossref/\nJahn, N (2022). roadoi: Find Free Versions of Scholarly Publications via Unpaywall. https://CRAN.R-project.org/package=roadoi | https://docs.ropensci.org/roadoi/.\nDashboards (selection):\nHybrid Open Access Dashboard (HOAD). See our blog post: https://www.coalition-s.org/blog/introducing-the-hybrid-open-access-dashboard-hoad/\nmetacheck: Open Access Metadata Compliance Checker\nOpen Access uptake in Germany 2010-2018: Interactive Supplement\nThird-party funded projects\nBMBF\nKompetenznetzwerk Bibliometrie: Komparative Analyse und Kuratierung Deutscher Metadaten in Offenen Bibliometriedaten, Teilprojekt: Bereitstellung und Analyse Dokumenttypen\nKompetenznetzwerk Bibliometrie, Teilprojekt: Datenergänzung: Open-Access-Nachweise\nindi:oa - Verantwortungsbewusste Bewertung und Qualitätssicherung von Open-Access Publikationen mittels bibliometrischer Indikatoren (concluded)\nOAUNI - Entwicklung und Einflussfaktoren des Open-Access-Publizierens an Universitäten in Deutschland (concluded)\nDFG\nHybrid OA Dashboards: Mehrwertorientierte Analytics-Anwendungen zur Förderung der Kostentransparenz bei Transformationsverträgen (concluded)\nEuropean Commission\nOn-Merrit (concluded)\nOpenAIRE Nexus (concluded)\n\n\n\n",
"last_modified": "2024-01-15T10:21:22+01:00"
"last_modified": "2024-02-05T09:56:34+01:00"
},
{
"path": "data.html",
"title": "Open Scholarly Data @ SUB Göttingen - Overview",
"author": [],
"contents": "\n\nContents\nStatus Crossref\nStatus Unpaywall\nStatus Openalex\n\nWe use Google Big Query to work with large open scholarly data. Our main data sources are Unpaywall, Crossref and OpenAlex.\nAn overview of our data warehouse including procedures to load the data into BigQuery can be found below.\nStatus Crossref\nCurrent Snapshot (cr_instant)\n\nSnapshot\nFile\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2023/12\nall.json.tar.gz\ncr_instant.snapshot\nschema_crossref.json\nRepo\n15.01.2024\n2013-2023\n45.680.245\n\nHistorical Snapshots (cr_history)\n\nSnapshot\nFile\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2018/04\nall.json.tar.gz\ncr_history.cr_apr18\nschema_crossref.json\nRepo\n20.02.2022\n2013-2018\n16.766.035\n2019/04\nall.json.tar.gz\ncr_history.cr_apr19\nschema_crossref.json\nRepo\n29.10.2021\n2013-2019\n20.715.644\n2020/04\nall.json.tar.gz\ncr_history.cr_apr20\nschema_crossref.json\nRepo\n29.10.2021\n2013-2020\n25.334.525\n2021/04\nall.json.tar.gz\ncr_history.cr_apr21\nschema_crossref.json\nRepo\n29.10.2021\n2013-2021\n30.579.119\n2022/04\nall.json.tar.gz\ncr_history.cr_apr22\nschema_crossref.json\nRepo\n14.05.2022\n2013-2022\n35.939.195\n2023/04\nall.json.tar.gz\ncr_history.cr_apr23\nschema_crossref.json\nRepo\n07.05.2023\n2013-2023\n41.767.461\n\nStatus Unpaywall\nCurrent Snapshot (upw_instant)\n\nSnapshot\nFile\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2022/03\nunpaywall_snapshot_2022-03-09T083001.jsonl.gz\nupw_instant.snapshot\nbq_schema_mar22.json\nRepo\n14.03.2022\n2008-2022\n67.424.819\n\nHistorical Snapshots (upw_history)\n\nSnapshot\nFile\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2018/03\nunpaywall_snapshot_2018-03-29T113154.jsonl.gz\nupw_history.upw_Mar18_08_20\nbq_schema_mar18.json\nRepo\n29.10.2021\n2008-2018\n36.557.043\n2019/02\nunpaywall_snapshot_2019-02-21T031509.jsonl.gz\nupw_history.upw_Feb19_08_19\nbq_schema_feb19.json\nRepo\n10.11.2021\n2008-2019\n42.143.979\n2020/02\nunpaywall_snapshot_2020-02-25T115244.jsonl.gz\nupw_history.upw_Feb20_08_20\nbq_schema_feb20.json\nRepo\n30.10.2021\n2008-2020\n49.717.710\n2021/02\nunpaywall_snapshot_2021-02-18T160139.jsonl.gz\nupw_history.upw_Feb21_08_21\nbq_schema_feb21.json\nRepo\n29.10.2021\n2008-2021\n58.437.927\n2022/03\nunpaywall_snapshot_2022-03-09T083001.jsonl.gz\nupw_history.upw_Mar22_08_22\nbq_schema_mar22.json\nRepo\n14.03.2022\n2008-2022\n67.424.819\n\nStatus Openalex\n\nSnapshot\nDirectory\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2023-12-20\nauthors/\nopenalex.authors\nschema_openalex_author.json\nRepo\n04.01.2024\nAll\n89.840.202\n2023-12-21\ninstitutions/\nopenalex.institutions\nschema_openalex_institutions.json\nRepo\n04.01.2024\nAll\n107.538\n2023-12-21\nsources/\nopenalex.sources\nschema_openalex_sources.json\nRepo\n04.01.2024\nAll\n249.407\n2023-12-20\nworks/\nopenalex.works\nschema_openalex_work.json\nRepo\n04.01.2024\nAll\n246.880.876\n2023-12-21\nfunders/\nopenalex.funders\nschema_openalex_funders.json\nRepo\n04.01.2024\nAll\n32.437\n2023-12-21\npublishers/\nopenalex.publishers\nschema_openalex_publishers.json\nRepo\n04.01.2024\nAll\n10.249\n\n\n\n\n",
"last_modified": "2024-01-15T10:21:22+01:00"
"contents": "\n\nContents\nStatus Crossref\nStatus Unpaywall\nStatus Openalex\n\nWe use Google Big Query to work with large open scholarly data. Our main data sources are Unpaywall, Crossref and OpenAlex.\nAn overview of our data warehouse including procedures to load the data into BigQuery can be found below.\nStatus Crossref\nCurrent Snapshot (cr_instant)\n\nSnapshot\nFile\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2023/12\nall.json.tar.gz\ncr_instant.snapshot\nschema_crossref.json\nRepo\n15.01.2024\n2013-2023\n45.680.245\n\nHistorical Snapshots (cr_history)\n\nSnapshot\nFile\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2018/04\nall.json.tar.gz\ncr_history.cr_apr18\nschema_crossref.json\nRepo\n20.02.2022\n2013-2018\n16.766.035\n2019/04\nall.json.tar.gz\ncr_history.cr_apr19\nschema_crossref.json\nRepo\n29.10.2021\n2013-2019\n20.715.644\n2020/04\nall.json.tar.gz\ncr_history.cr_apr20\nschema_crossref.json\nRepo\n29.10.2021\n2013-2020\n25.334.525\n2021/04\nall.json.tar.gz\ncr_history.cr_apr21\nschema_crossref.json\nRepo\n29.10.2021\n2013-2021\n30.579.119\n2022/04\nall.json.tar.gz\ncr_history.cr_apr22\nschema_crossref.json\nRepo\n14.05.2022\n2013-2022\n35.939.195\n2023/04\nall.json.tar.gz\ncr_history.cr_apr23\nschema_crossref.json\nRepo\n07.05.2023\n2013-2023\n41.767.461\n\nStatus Unpaywall\nCurrent Snapshot (upw_instant)\n\nSnapshot\nFile\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2022/03\nunpaywall_snapshot_2022-03-09T083001.jsonl.gz\nupw_instant.snapshot\nbq_schema_mar22.json\nRepo\n14.03.2022\n2008-2022\n67.424.819\n\nHistorical Snapshots (upw_history)\n\nSnapshot\nFile\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2018/03\nunpaywall_snapshot_2018-03-29T113154.jsonl.gz\nupw_history.upw_Mar18_08_20\nbq_schema_mar18.json\nRepo\n29.10.2021\n2008-2018\n36.557.043\n2019/02\nunpaywall_snapshot_2019-02-21T031509.jsonl.gz\nupw_history.upw_Feb19_08_19\nbq_schema_feb19.json\nRepo\n10.11.2021\n2008-2019\n42.143.979\n2020/02\nunpaywall_snapshot_2020-02-25T115244.jsonl.gz\nupw_history.upw_Feb20_08_20\nbq_schema_feb20.json\nRepo\n30.10.2021\n2008-2020\n49.717.710\n2021/02\nunpaywall_snapshot_2021-02-18T160139.jsonl.gz\nupw_history.upw_Feb21_08_21\nbq_schema_feb21.json\nRepo\n29.10.2021\n2008-2021\n58.437.927\n2022/03\nunpaywall_snapshot_2022-03-09T083001.jsonl.gz\nupw_history.upw_Mar22_08_22\nbq_schema_mar22.json\nRepo\n14.03.2022\n2008-2022\n67.424.819\n\nStatus Openalex\n\nSnapshot\nDirectory\nTable\nSchema\nProcedure\nLast Changed\nCoverage\nNumber of rows\n2024-01-24\nauthors/\nopenalex.authors\nschema_openalex_author.json\nRepo\n02.02.2024\nAll\n90.079.708\n2024-01-25\ninstitutions/\nopenalex.institutions\nschema_openalex_institutions.json\nRepo\n02.02.2024\nAll\n106.631\n2024-01-25\nsources/\nopenalex.sources\nschema_openalex_sources.json\nRepo\n02.02.2024\nAll\n250.686\n2024-01-24\nworks/\nopenalex.works\nschema_openalex_work.json\nRepo\n02.02.2024\nAll\n248.072.830\n2024-01-25\nfunders/\nopenalex.funders\nschema_openalex_funders.json\nRepo\n02.02.2024\nAll\n32.433\n2024-01-25\npublishers/\nopenalex.publishers\nschema_openalex_publishers.json\nRepo\n02.02.2024\nAll\n10.249\n\n\n\n\n",
"last_modified": "2024-02-05T09:56:35+01:00"
},
{
"path": "index.html",
"title": "Blog | Scholarly Communication Analytics with R",
"author": [],
"contents": "\n\n\n\n",
"last_modified": "2024-01-15T10:21:23+01:00"
"last_modified": "2024-02-05T09:56:35+01:00"
}
],
"collections": ["posts/posts.json"]
Expand Down
2 changes: 1 addition & 1 deletion docs/sitemap.xml
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@
</url>
<url>
<loc>https://subugoe.github.io/scholcomm_analytics/data.html</loc>
<lastmod>2024-01-15T10:21:04+01:00</lastmod>
<lastmod>2024-02-05T09:56:27+01:00</lastmod>
</url>
<url>
<loc>https://subugoe.github.io/scholcomm_analytics/</loc>
Expand Down

0 comments on commit d25ead8

Please sign in to comment.