DOI Extraction
1. Overview and Context
This procedure extracts the DOIs from a PDF by means of a custom Python script and returns a CSV file.
2. Triggers
The execution of this procedure is usually triggered by
3. Steps to Be Performed
- Save a copy of the DOI Extraction Script to your hard drive as
doiextract.py - Move the PDF to the same folder
- Open the script file in a code editor
- Change
pdf_pathto the file name of the PDF (incl..pdf) - Open a command line tool like Terminal
- Navigate to the folder containing the script and the PDF
- Run
python3 doiextract.py - The script will generate a CSV file of the PDF file suffixed with
_dois.csv
4. Additional Information
5. Document Control
| Document ID | PRO-003 |
| Document Owner | Vincent |
| Version | 1.0 |
| Last Date of Change | October 2, 2025 |
| Next Review Due Date | |
| Version & Change Tracking |