ORION-DBs
  • Collections
    • Overview
    • CWTS Leiden Datasets
    • Dimensions Open Datasets
    • InSySPo Campinas Datasets
    • KB OPENBIB
    • MultiObs Campinas Datasets
    • Sesame Open Science Datasets
    • SUB Göttingen Datasets
  • News & Tutorials
  • About
  • Contribute

Datasets

  • butler_apcs
  • crossref
  • datacite
  • doaj
  • goa
  • make_data_count
  • openaire
  • openapc
  • opencitations
  • openeditors
  • pkp
  • ror
  • truthtables

Sesame Open Science

Collection maintained by Bianca Kramer (Sesame Open Science)

Info and documentation: https://github.com/bmkramer/metadata_ingest

…

butler_apcs

Description

Butler, Leigh-Ann; Hare, Madelaine; Schönfelder, Nina; Schares, Eric; Alperin, Juan Pablo; Haustein, Stefanie, 2024, “Open dataset of annual Article Processing Charges (APCs) of gold and hybrid journals published by Elsevier, Frontiers, MDPI, PLOS, Springer-Nature and Wiley 2019-2023.

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#butler-apcs
Created: Aug 26, 2025 15:17 | Location: US | View in BigQuery Console

crossref

Description

Crossref public data file (sample) and Crossref API data for members and journals

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#crossref-public-data-file
Created: Jul 08, 2025 07:53 | Location: US | View in BigQuery Console

datacite

Description

DataCite monthly data file and DataCite clients (data underlying the DataCite API clients endpoint)

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#datacite-monthly-data-file
Created: Feb 07, 2026 15:27 | Location: US | View in BigQuery Console

doaj

Description

DOAJ journal metadata

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#doaj
Created: Aug 03, 2025 20:57 | Location: US | View in BigQuery Console

goa

Description

Walt Crawford - Gold Open Access datasets: GOA5 (2014-2019) to GOA10 (2020-2024)

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#goa
Created: Aug 03, 2025 21:22 | Location: US | View in BigQuery Console

make_data_count

Description

Make Data Count - Data Citation Corpus

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#make-data-count-data-citation-corpus

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#make-data-count-data-citation-corpus
Created: Jun 04, 2026 06:25 | Location: US | View in BigQuery Console

openaire

Description

OpenAIRE Graph Dataset

Documentation of data ingest:

https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#openaire

Created: Mar 28, 2025 16:48 | Location: US | View in BigQuery Console

openapc

Description

Data from the OpenAPC initiative

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#openapc
Created: Aug 03, 2025 21:02 | Location: US | View in BigQuery Console

opencitations

Description

OpenCitations Meta - bibliographic metadata for all publications included in the OpenCitations Index

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#opencitations-meta
Created: Dec 29, 2024 11:23 | Location: US | View in BigQuery Console

openeditors

Description

Open Editors collects data about scholarly journals’ editors and editorial board members.

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest/src/branch/main/README.md#openeditors
Created: Mar 19, 2026 13:11 | Location: US | View in BigQuery Console

pkp

Description

PKP Beacon dataset - details of publications using software by the Public Knowledge Project

Documentation on data ingest: https://codeberg.org/TwoBirds/metadata_ingest#pkp
Created: Jun 05, 2025 23:52 | Location: US | View in BigQuery Console

ror

Description

Full ROR dataset

Documentation of data ingest: https://codeberg.org/TwoBirds/metadata_ingest#ror
Created: Jul 31, 2025 20:49 | Location: US | View in BigQuery Console

truthtables

Description

Processed tables indicating presence (TRUE/FALSE) and count of several metadata elemements for each record in a data source.

Documentation of data processing: https://codeberg.org/TwoBirds/metadata_ingest#truthtables Created to facilitate comparison of metadata coverage across sources
Created: Mar 28, 2025 17:39 | Location: US | View in BigQuery Console
  • The content on this website is licensed under CC0.
  • Privacy

  • Contact

  • Built with Quarto