Skip to main content

DOI Extraction

1. Overview and Context

This procedure extracts the DOIs from a PDF by means of a custom Python script.script and returns a CSV file.

2. Triggers

The execution of this procedure is usually triggered by

3. Steps to Be Performed

  • Save a copy of the DOI Extraction Script to your hard drive as doiextract.py
  • Move the PDF to the same folder
  • Open the script file in a code editor
  • Change pdf_path to the file name of the PDF (incl. .pdf)
  • Open a command line tool like Terminal
  • Navigate to the folder containing the script and the PDF
  • Run python3 doiextract.py
  • The script will generate a CSV file of the PDF file suffixed with _dois.csv

4. Additional Information

5. Document Control

Document ID PRO-003
Document Owner Vincent
Version 1.0
Last Date of Change October 2, 2025
Next Review Due Date
Version & Change Tracking