DOI Check
1. Overview and Context
This procedure checks the DOI extracted from a PDF against the PDF.
2. Triggers
The execution of this procedure is usually triggered by
3. Steps to Be Performed
- Open the CSV file generated by the DOI Extraction procedure. The file contains a column with ordinals and a column with DOIs.
- The first line will contain the DOI of the publication itself. Remove the DOI number and keep the empty place for the first DOI missed by the script (see below).
- Search the PDF for the string
DOI:and compare each instance against the CSV file. - The DOI script will have generated the following errors:
- DOIs cut with a line break after a slash
/: these are not recognized and should be added manually with the correct ordinal. - DOIs cut with a line break after a period
.: these are only recognized up to the period, the rest (including the period) should be added to the entry. - DOIs cut with a line break after a dash
-: these are only recognized up to and including the slash, the rest should be added to the entry. - DOIs at the end of a page: the script will recognize the page number as part of the DOI, and will have to be removed.
- DOIs cut with a line break after a slash
- Once the CSV file has been corrected make sure each DOI is correctly numbered and there are no empty lines or spaces.
- Save the CSV file.
4. Additional Information
5. Document Control
| Document ID | PRO-004 |
| Document Owner | Vincent |
| Version | 1.0 |
| Last Date of Change | October 2, 2025 |
| Next Review Due Date | |
| Version & Change Tracking |