Skip to content

Commit

Permalink
update data tab
Browse files Browse the repository at this point in the history
  • Loading branch information
naustica committed Jan 10, 2025
1 parent 871fc26 commit 84d9d48
Show file tree
Hide file tree
Showing 17 changed files with 1,109 additions and 1,104 deletions.
18 changes: 9 additions & 9 deletions data.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ Anyone can view and query our publicly available [Open Scholarly Data warehouse

| Snapshot | File | Table | Schema | Procedure | Last Changed | Coverage | Number of rows |
|-----------------|-----------------|---------------------|----------------------|-----------|--------------|-----------|--------------------|
| 2024/11 | all.json.tar.gz | [cr_instant.snapshot](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2scr_instant) | schema_crossref.json | [Repo](https://github.com/naustica/crossref_bq) | 09.12.2024 | 2013-2024 | 51.488.082 |
| 2024/12 | all.json.tar.gz | [cr_instant.snapshot](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2scr_instant) | schema_crossref.json | [Repo](https://github.com/naustica/crossref_bq) | 08.01.2025 | 2013-2024 | 52.114.759 |

:::

Expand Down Expand Up @@ -88,13 +88,13 @@ Anyone can view and query our publicly available [Open Scholarly Data warehouse

| Snapshot | Directory | Table | Schema | Procedure | Last Changed | Coverage | Number of rows |
|------------|---------------|-----------------------|-----------------------------------|-----------|--------------|-----------|----------------------|
| 2024-11-25 | authors/ | [openalex.authors](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_author.json | [Repo](https://github.com/naustica/openalex) | 04.12.2024 | All | 101.310.502 |
| 2024-11-25 | funders/ | [openalex.funders](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_funders.json | [Repo](https://github.com/naustica/openalex) | 04.12.2024 | All | 32.437 |
| 2024-11-25 | institutions/ | [openalex.institutions](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_institutions.json | [Repo](https://github.com/naustica/openalex) | 04.12.2024 | All | 110.138 |
| 2024-11-25 | publishers/ | [openalex.publishers](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_publishers.json | [Repo](https://github.com/naustica/openalex) | 04.12.2024 | All | 10.376 |
| 2024-11-25 | sources/ | [openalex.sources](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_sources.json | [Repo](https://github.com/naustica/openalex) | 04.12.2024 | All | 258.125 |
| 2024-11-25 | topics/ | [openalex.topics](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_topics.json | [Repo](https://github.com/naustica/openalex) | 04.12.2024 | All | 4.516 |
| 2024-11-25 | works/ | [openalex.works](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_work.json | [Repo](https://github.com/naustica/openalex) | 04.12.2024 | All | 261.381.162 |
| 2024-12-31 | authors/ | [openalex.authors](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_author.json | [Repo](https://github.com/naustica/openalex) | 07.01.2025 | All | 101.693.809 |
| 2025-01-01 | funders/ | [openalex.funders](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_funders.json | [Repo](https://github.com/naustica/openalex) | 07.01.2025 | All | 32.437 |
| 2025-01-01 | institutions/ | [openalex.institutions](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_institutions.json | [Repo](https://github.com/naustica/openalex) | 07.01.2025 | All | 110.553 |
| 2025-01-01 | publishers/ | [openalex.publishers](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_publishers.json | [Repo](https://github.com/naustica/openalex) | 07.01.2025 | All | 10.741 |
| 2025-01-01 | sources/ | [openalex.sources](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_sources.json | [Repo](https://github.com/naustica/openalex) | 07.01.2025 | All | 260.811 |
| 2024-12-30 | topics/ | [openalex.topics](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_topics.json | [Repo](https://github.com/naustica/openalex) | 07.01.2025 | All | 4.516 |
| 2024-12-31 | works/ | [openalex.works](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex) | schema_openalex_work.json | [Repo](https://github.com/naustica/openalex) | 07.01.2025 | All | 262.630.159 |

:::

Expand All @@ -104,6 +104,6 @@ Anyone can view and query our publicly available [Open Scholarly Data warehouse

| Snapshot | Directory | Table | Schema | Procedure | Last Changed | Coverage | Number of rows |
|------------|--------------|----------------------|----------------------|-----------|--------------|-----------|-----------------|
| 2024-11-25 | works/ | [resources.classification_article_reviews_november_2024](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sresources) | schema_document_types.json | [Repo](https://github.com/naustica/openalex_doctypes/tree/classifier/classifier) | 04.12.2024 | All | 152.162.123 |
| 2024-12-31 | works/ | [resources.classification_article_reviews_december24](https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sresources) | schema_document_types.json | [Repo](https://github.com/naustica/openalex_doctypes/tree/classifier/classifier) | 10.01.2025 | 2014-2024 | 58.240.262 |

:::
29 changes: 16 additions & 13 deletions docs/about.html
Original file line number Diff line number Diff line change
Expand Up @@ -20,8 +20,9 @@

<style type="text/css">code{white-space: pre;}</style>
<style type="text/css" data-origin="pandoc">
html { -webkit-text-size-adjust: 100%; }
pre > code.sourceCode { white-space: pre; position: relative; }
pre > code.sourceCode > span { line-height: 1.25; }
pre > code.sourceCode > span { display: inline-block; line-height: 1.25; }
pre > code.sourceCode > span:empty { height: 1.2em; }
.sourceCode { overflow: visible; }
code.sourceCode > span { color: inherit; text-decoration: inherit; }
Expand All @@ -32,7 +33,7 @@
}
@media print {
pre > code.sourceCode { white-space: pre-wrap; }
pre > code.sourceCode > span { display: inline-block; text-indent: -5em; padding-left: 5em; }
pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
}
pre.numberSource code
{ counter-reset: source-line 0; }
Expand Down Expand Up @@ -2188,25 +2189,29 @@ <h1>About us</h1>
<h3>Contents</h3>
<ul>
<li><a href="#about-this-blog" id="toc-about-this-blog">About this Blog!</a></li>
<li><a href="#journal-publications" id="toc-journal-publications">Journal publications</a></li>
<li><a href="#preprints" id="toc-preprints">Preprints</a></li>
<li><a href="#journal-publications" id="toc-journal-publications">Journal publications</a></li>
<li><a href="#theses" id="toc-theses">Theses</a></li>
<li><a href="#software" id="toc-software">Software</a></li>
<li><a href="#third-party-funded-projects" id="toc-third-party-funded-projects">Third-party funded projects</a></li>
</ul>
</nav>
</div>
<h2 id="about-this-blog">About this Blog!</h2>
<p>Welcome to our Blog! Here, you’ll find insights from our work as Data Analysts in the domain of scholarly communication. With this blog, we want to engage with the broader community about how to support data-driven workflows and decision-making around scholarly communication with R.</p>
<p>We are based at the <a href="https://www.sub.uni-goettingen.de/en/news/">Göttingen State and University Library</a>, one of the largest academic libraries in Germany. We are using R-based tools in our everyday work and contribute to R package developments and training activities. In this blog, you’ll find news and case-studies around:</p>
<p>Welcome to our Blog! Here, you’ll find insights from our work as Data Analysts in the domain of scholarly communication. With this blog, we want to engage with the broader community about how to support data-driven workflows and decision-making around scholarly communication.</p>
<p>We are based at the <a href="https://www.sub.uni-goettingen.de/en/news/">Göttingen State and University Library</a>, one of the largest academic libraries in Germany. We are using various data analytics tools in our everyday work and contribute to R and Python package developments and training activities. In this blog, you’ll find news and case-studies around:</p>
<ul>
<li>Open Access and Open Science Analytics</li>
<li>R Packages making use of open databases and helping us in our work</li>
<li>R Tools for interactive visualizations and dashboard developments</li>
<li>R-related training and outreach activities</li>
<li>Packages making use of open databases and helping us in our work</li>
<li>Tools for interactive visualizations and dashboard developments</li>
<li>Training and outreach activities</li>
</ul>
<p>We want to thank <a href="https://masalmon.eu/">Maëlle Salmon</a> for encouraging us to start a blog about our work. As a technical framework for the blog, we are using <a href="https://rstudio.github.io/distill/">Distill for R Markdown</a>, a new web publishing format optimized for scientific and technical writing.</p>
<p><em>Dr. Anne Hobert</em>, <em>Nick Haupka</em>, <em>Sophia Dörner</em>, <em>Najko Jahn</em></p>
<h2 id="preprints">Preprints</h2>
<p>Haupka, N., Culbert, J., Schniedermann, A., Jahn, N., Mayr, P. (2024). Analysis of the Publication and Document Types in OpenAlex, Web of Science, Scopus, Pubmed and Semantic Scholar. <a href="https://arxiv.org/abs/2406.15154" class="uri">https://arxiv.org/abs/2406.15154</a></p>
<p>Jahn, N. (2024). How open are hybrid journals included in transformative agreements? <a href="https://arxiv.org/abs/2402.18255" class="uri">https://arxiv.org/abs/2402.18255</a></p>
<p>Culbert, J., Hobert, A., Jahn, N., Haupka, N., Schmidt, M., Donner, P., Mayr, P. (2024). Reference Coverage Analysis of OpenAlex compared to Web of Science and Scopus. <a href="https://arxiv.org/abs/2401.16359" class="uri">https://arxiv.org/abs/2401.16359</a></p>
<h2 id="journal-publications">Journal publications</h2>
<p>We also publish in scholarly journals about our work.</p>
<p>Haupka, N. (2024). Analyse der Abdeckung wissenschaftlicher Publikationen auf Semantic Scholar im Kontext von Open Access. <em>Bibliothek Forschung und Praxis</em>, 48(2), 362–-373. <a href="https://doi.org/10.1515/bfp-2023-0057" class="uri">https://doi.org/10.1515/bfp-2023-0057</a></p>
Expand All @@ -2231,8 +2236,10 @@ <h2 id="journal-publications">Journal publications</h2>
<p>Laakso, M., Matthias, L., &amp; Jahn, N. (2021). Open is not forever: A
study of vanished open access journals. <em>Journal of the Association for
Information Science and Technology</em>, 72(9), 1099–1112.
<a href="https://doi.org/10.1002/asi.24460" class="uri">https://doi.org/10.1002/asi.24460</a> (<a href="https://www.asist.org/2022/06/15/2022-best-jasist-paper-award/">JASIST Best Paper Award 2022</a>. Featured in <a href="https://www.nature.com/articles/d41586-020-02610-z">Nature</a>,
<a href="https://doi.org/10.1002/asi.24460" class="uri">https://doi.org/10.1002/asi.24460</a> (<a href="https://www.asist.org/2022/06/15/2022-best-jasist-paper-award/">JASIST Best Paper Award 2022</a>. Featured in
<a href="https://www.nature.com/articles/d41586-020-02610-z">Nature</a>,
<a href="https://www.nature.com/articles/d41586-024-00616-5">Nature</a>,
<a href="https://doi.org/10.1038/d41586-024-03842-z">Nature</a>,
<a href="https://doi.org/10.1126/science.abe6998">Science</a>, <a href="https://edition.cnn.com/2020/09/15/us/vanished-open-access-journals-trnd-scn/index.html">CNN</a>,
<a href="https://www.deutschlandfunknova.de/nachrichten/wissenschaft-millionen-arbeiten-sind-nicht-richtig-archiviert">DLF</a>)</p>
<p>Jahn, N., Hobert, A., &amp; Haupka, N. (2021). Entwicklung und Typologie des
Expand All @@ -2241,10 +2248,6 @@ <h2 id="journal-publications">Journal publications</h2>
<p>Matthias, L., Jahn, N., &amp; Laakso, M. (2019). The Two-Way Street of Open
Access Journal Publishing: Flip It and Reverse It. <em>Publications</em>, 7(2),
23. <a href="https://doi.org/10.3390/publications7020023" class="uri">https://doi.org/10.3390/publications7020023</a></p>
<h2 id="preprints">Preprints</h2>
<p>Haupka, N., Culbert, J., Schniedermann, A., Jahn, N., Mayr, P. (2024). Analysis of the Publication and Document Types in OpenAlex, Web of Science, Scopus, Pubmed and Semantic Scholar. <a href="https://arxiv.org/abs/2406.15154" class="uri">https://arxiv.org/abs/2406.15154</a></p>
<p>Jahn, N. (2024). How open are hybrid journals included in transformative agreements? <a href="https://arxiv.org/abs/2402.18255" class="uri">https://arxiv.org/abs/2402.18255</a></p>
<p>Culbert, J., Hobert, A., Jahn, N., Haupka, N., Schmidt, M., Donner, P., Mayr, P. (2024). Reference Coverage Analysis of OpenAlex compared to Web of Science and Scopus. <a href="https://arxiv.org/abs/2401.16359" class="uri">https://arxiv.org/abs/2401.16359</a></p>
<h2 id="theses">Theses</h2>
<p>Haupka, N. (2021). Analyse der Entwicklung des Open Access-Discovery-Services Unpaywall seit 2018 [Bachelor Thesis, Hochschule Hannover]. <a href="https://doi.org/10.25968/opus-1899" class="uri">https://doi.org/10.25968/opus-1899</a></p>
<h2 id="software">Software</h2>
Expand Down
59 changes: 30 additions & 29 deletions docs/data.html
Original file line number Diff line number Diff line change
Expand Up @@ -20,8 +20,9 @@

<style type="text/css">code{white-space: pre;}</style>
<style type="text/css" data-origin="pandoc">
html { -webkit-text-size-adjust: 100%; }
pre > code.sourceCode { white-space: pre; position: relative; }
pre > code.sourceCode > span { line-height: 1.25; }
pre > code.sourceCode > span { display: inline-block; line-height: 1.25; }
pre > code.sourceCode > span:empty { height: 1.2em; }
.sourceCode { overflow: visible; }
code.sourceCode > span { color: inherit; text-decoration: inherit; }
Expand All @@ -32,7 +33,7 @@
}
@media print {
pre > code.sourceCode { white-space: pre-wrap; }
pre > code.sourceCode > span { display: inline-block; text-indent: -5em; padding-left: 5em; }
pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
}
pre.numberSource code
{ counter-reset: source-line 0; }
Expand Down Expand Up @@ -2226,14 +2227,14 @@ <h3 id="current-snapshot-cr_instant">Current Snapshot (cr_instant)</h3>
</thead>
<tbody>
<tr class="odd">
<td>2024/11</td>
<td>2024/12</td>
<td>all.json.tar.gz</td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2scr_instant">cr_instant.snapshot</a></td>
<td>schema_crossref.json</td>
<td><a href="https://github.com/naustica/crossref_bq">Repo</a></td>
<td>09.12.2024</td>
<td>08.01.2025</td>
<td>2013-2024</td>
<td>51.488.082</td>
<td>52.114.759</td>
</tr>
</tbody>
</table>
Expand Down Expand Up @@ -2532,74 +2533,74 @@ <h2 id="status-openalex">Status Openalex</h2>
</thead>
<tbody>
<tr class="odd">
<td>2024-11-25</td>
<td>2024-12-31</td>
<td>authors/</td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex">openalex.authors</a></td>
<td>schema_openalex_author.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.12.2024</td>
<td>07.01.2025</td>
<td>All</td>
<td>101.310.502</td>
<td>101.693.809</td>
</tr>
<tr class="even">
<td>2024-11-25</td>
<td>2025-01-01</td>
<td>funders/</td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex">openalex.funders</a></td>
<td>schema_openalex_funders.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.12.2024</td>
<td>07.01.2025</td>
<td>All</td>
<td>32.437</td>
</tr>
<tr class="odd">
<td>2024-11-25</td>
<td>2025-01-01</td>
<td>institutions/</td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex">openalex.institutions</a></td>
<td>schema_openalex_institutions.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.12.2024</td>
<td>07.01.2025</td>
<td>All</td>
<td>110.138</td>
<td>110.553</td>
</tr>
<tr class="even">
<td>2024-11-25</td>
<td>2025-01-01</td>
<td>publishers/</td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex">openalex.publishers</a></td>
<td>schema_openalex_publishers.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.12.2024</td>
<td>07.01.2025</td>
<td>All</td>
<td>10.376</td>
<td>10.741</td>
</tr>
<tr class="odd">
<td>2024-11-25</td>
<td>2025-01-01</td>
<td>sources/</td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex">openalex.sources</a></td>
<td>schema_openalex_sources.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.12.2024</td>
<td>07.01.2025</td>
<td>All</td>
<td>258.125</td>
<td>260.811</td>
</tr>
<tr class="even">
<td>2024-11-25</td>
<td>2024-12-30</td>
<td>topics/</td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex">openalex.topics</a></td>
<td>schema_openalex_topics.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.12.2024</td>
<td>07.01.2025</td>
<td>All</td>
<td>4.516</td>
</tr>
<tr class="odd">
<td>2024-11-25</td>
<td>2024-12-31</td>
<td>works/</td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sopenalex">openalex.works</a></td>
<td>schema_openalex_work.json</td>
<td><a href="https://github.com/naustica/openalex">Repo</a></td>
<td>04.12.2024</td>
<td>07.01.2025</td>
<td>All</td>
<td>261.381.162</td>
<td>262.630.159</td>
</tr>
</tbody>
</table>
Expand Down Expand Up @@ -2631,14 +2632,14 @@ <h2 id="status-openalex-document-type-classification-by-sub-göttingen">Status O
</thead>
<tbody>
<tr class="odd">
<td>2024-11-25</td>
<td>2024-12-31</td>
<td>works/</td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sresources">resources.classification_article_reviews_november_2024</a></td>
<td><a href="https://console.cloud.google.com/bigquery?ws=!1m4!1m3!3m2!1ssubugoe-collaborative!2sresources">resources.classification_article_reviews_december24</a></td>
<td>schema_document_types.json</td>
<td><a href="https://github.com/naustica/openalex_doctypes/tree/classifier/classifier">Repo</a></td>
<td>04.12.2024</td>
<td>All</td>
<td>152.162.123</td>
<td>10.01.2025</td>
<td>2014-2024</td>
<td>58.240.262</td>
</tr>
</tbody>
</table>
Expand Down
5 changes: 3 additions & 2 deletions docs/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -21,8 +21,9 @@

<style type="text/css">code{white-space: pre;}</style>
<style type="text/css" data-origin="pandoc">
html { -webkit-text-size-adjust: 100%; }
pre > code.sourceCode { white-space: pre; position: relative; }
pre > code.sourceCode > span { line-height: 1.25; }
pre > code.sourceCode > span { display: inline-block; line-height: 1.25; }
pre > code.sourceCode > span:empty { height: 1.2em; }
.sourceCode { overflow: visible; }
code.sourceCode > span { color: inherit; text-decoration: inherit; }
Expand All @@ -33,7 +34,7 @@
}
@media print {
pre > code.sourceCode { white-space: pre-wrap; }
pre > code.sourceCode > span { display: inline-block; text-indent: -5em; padding-left: 5em; }
pre > code.sourceCode > span { text-indent: -5em; padding-left: 5em; }
}
pre.numberSource code
{ counter-reset: source-line 0; }
Expand Down
Loading

0 comments on commit 84d9d48

Please sign in to comment.