DOI Check
1. Overview and Context
This procedure checks the DOI extracted from a PDF against the PDF.
2. Triggers
The execution of this procedure is usually triggered by
3. Steps to Be Performed
- Open the CSV file generated by the DOI Extraction procedure. The file contains a column with ordinals and a column with DOIs.
- The first line will contain the DOI of the publication itself. Remove the DOI number and keep the empty place for the first DOI missed by the script (see below).
- Search the PDF for the string
DOI:
and compare each instance against the CSV file. - The DOI script will have generated the following errors:
- DOIs cut with a line break after a slash
/
: these are not recognized and should be added manually with the correct ordinal. - DOIs cut with a line break after a period
.
: these are only recognized up to the period, the rest (including the period) should be added to the entry. - DOIs cut with a line break after a dash
-
: these are only recognized up to and including the slash, the rest should be added to the entry. - DOIs at the end of a page: the script will recognize the page number as part of the DOI, and will have to be removed.
- DOIs cut with a line break after a slash
- Once the CSV file has been corrected make sure each DOI is correctly numbered and there are no empty lines or spaces.
- Save the CSV file.
4. Additional Information
5. Document Control
Document ID | PRO-004 |
Document Owner | Vincent |
Version | 1.0 |
Last Date of Change | October 2, 2025 |
Next Review Due Date | |
Version & Change Tracking |