Skip to main content

DOI Check

1. Overview and Context

This procedure checks the DOI extracted from a PDF against the PDF.

2. Triggers

The execution of this procedure is usually triggered by

3. Steps to Be Performed

  • Open the CSV file generated by the DOI Extraction procedure. The file contains a column with ordinals and a column with DOIs.
  • The first line will contain the DOI of the publication itself. Remove the DOI number and keep the empty place for the first DOI missed by the script (see below).
  • Search the PDF for the string DOI: and compare each instance against the CSV file.
  • The DOI script will have generated the following errors:
    • DOIs cut with a line break after a slash /: these are not recognized and should be added manually with the correct ordinal.
    • DOIs cut with a line break after a period .: these are only recognized up to the period, the rest (including the period) should be added to the entry.
    • DOIs cut with a line break after a dash -: these are only recognized up to and including the slash, the rest should be added to the entry.
    • DOIs at the end of a page: the script will recognize the page number as part of the DOI, and will have to be removed.
  • Once the CSV file has been corrected make sure each DOI is correctly numbered and there are no empty lines or spaces.
  • Save the CSV file.

4. Additional Information

5. Document Control

Document ID PRO-004
Document Owner Vincent
Version 1.0
Last Date of Change October 2, 2025
Next Review Due Date
Version & Change Tracking