Skip to main content

Internet Archive Update

1. Overview and Context

This procedure updates the GoogleBooksInternetArchive table in the SQL Database.

2. Triggers

The execution of this procedure is usually triggered by

3. Steps to Be Performed

3.1 Download

GoogleUsage Booksdata are directly extracted using the Internet Archive usage data areAPI. availableWe viahave written a script that extracts these data automatically and generates a CSV file, organized by Thoth workId and country per timeframe (usually a month).

In order to generate the Googledata, Playmake Bookssure Partnerall Center.

workIds
    of Gorecently topublished menubooks itemare Reports > Custom Reports. Select Report Type "Google Books Traffic Report," Organize by "Book,"added, and select datethe range.correct timeframe.

    Screenshot 2023-04-17 at 12.36.27.png

    3.2 Clean

    • Import as encoded as UTF-8
    Delete header
    Append 1 column to the left, add date in YYYY-MM-01 format
    save as XLS re-export as CSV

    3.3 Upload

    • Upload into phpMyAdmin table "GoogleBooks"InternetArchive"

    4. Additional Information

    5. Document Control

    Document ID PRO-040041
    Document Owner Vincent
    Version 1.0
    Last Date of Change March 26, 2026
    Next Review Due Date
    Version & Change Tracking