Skip to main content

DOI Check

1. Overview and Context

This procedure checks the DOI extracted from a PDF against the PDF.

2. Triggers

The execution of this procedure is usually triggered by

3. Steps to Be Performed

  • Open the CSV file generated by the DOI Extraction procedure. The file contains a column with ordinals and a column with DOIs.
  • The first line will contain the DOI of the publication itself. Remove the DOI number and keep the empty place for the first DOI missed by the script (see below).
  • Search the PDF for the string DOI: and compare each instance against the CSV file.
  • The DOI script will have generated the following errors:
    • DOIs cut with a line break after a slash /: these are not recognized and should be added manually with the correct ordinal.
    • DOIs cut with a line break after a period .: these are only recognized up to the period, the rest (including the period) should be added to the entry.
  • DOIs cut with a line break after a dash -: these are only recognized up to and including the slash, the rest should be added to the entry.
  • DOIs at the end of a page: the script will recognize the page number as part of the DOI, and will have to be removed.
Once the CSV file has been corrected make sure each DOI is correctly numbered and there are no empty lines or spaces. Save the CSV file.

4. Additional Information

5. Document Control

Document ID PRO-004
Document Owner Vincent
Version 1.0
Last Date of Change October 2, 2025
Next Review Due Date
Version & Change Tracking