Skip to main content

Internet Archive Update

1. Overview and Context

This procedure updates the InternetArchive table in the SQL Database.

2. Triggers

The execution of this procedure is usually triggered by

3. Steps to Be Performed

3.1 Download

Usage data are directly extracted using the Internet Archive usage data API. We have written a script that extracts these data automatically and generates a CSV file, organized by Thoth workId and country per timeframe (usually a month).

In order to generate the data, make sure all workIds of recently published books are added, and select the correct timeframe.

3.2 Clean

  • Import as encoded as UTF-8
  • Delete header

3.3 Upload

  • UploadNavigate intoto phpMyAdmin table "InternetArchive"
  • select Import tab;
  • select file
  • click "Go."
archive files under Punctum Admin > Metrics > Internet Archive

4. Additional Information

5. Document Control

Document ID PRO-041
Document Owner Vincent
Version 1.0
Last Date of Change March 26, 2026
Next Review Due Date
Version & Change Tracking