DataCite PID Links Data File 2023

DOI:
 
Description:

The DataCite PID Links Data File contains records in JSON format for relationships between PIDs within the DataCite PIDGraph, up to the end of 2023.

Records have a core triple of an object, a subject, and the relationship between the two, and also include a selection of metadata about the relationship, including the creation date and source of the assertion, when the relationship occurred, and the types of object involved.

The PID Links data file is split into 1679 individual files, each containing 100,000 records as JSON objects, one per line (a variety of JSON known as JSON Lines), to enable easy parallel processing and extraction of data. The files are then stored in a tarball and compressed with gzip. Further guidance and tutorials for working with the PID Links data file will be available soon. In the meantime, please contact support@datacite.org if you have questions or need assistance.

Use of the DataCite PID Links Data File is subject to the DataCite Data File Use Policy.

Part of the development work for this file was supported by the FAIRCORE4EOSC project. FAIRCORE4EOSC has received funding from the EU’s Horizon Europe research and innovation programme under Grant Agreement no. 101057264.

Size:
167,844,248 records
11GiB compressed
95GiB decompressed

Get access

Please provide an email address to receive a download link to the data file.