Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
22 commits
Select commit Hold shift + click to select a range
7cc0f7a
Initial issue commit.
bdgregg Feb 18, 2026
7075968
Adjusting Islandora Models.
bdgregg Feb 18, 2026
fb43989
A bit of cleanup.
bdgregg Feb 18, 2026
97078c7
Built out the PDF model, updated the model map, and explicitly set th…
bdgregg Feb 18, 2026
2d83e10
Added additional fields for PDFs and added minimal row_data for Simpl…
bdgregg Feb 18, 2026
0a92610
Added tables to the README.md file.
bdgregg Feb 18, 2026
e946d26
Updated the tables in the README.md file.
bdgregg Feb 18, 2026
cd956a3
Updated the tables in the README.md file.
bdgregg Feb 18, 2026
a6cce36
Updated the tables in the README.md file.
bdgregg Feb 18, 2026
05ed05c
Updated the tables in the README.md file.
bdgregg Feb 18, 2026
90a0624
Updated the tables in the README.md file.
bdgregg Feb 18, 2026
ea1316c
Fixed spacing in function call parameters.
bdgregg Feb 26, 2026
a7226d5
Adjust function documentation to correctly describe the return value …
bdgregg Feb 26, 2026
d2c1561
Added function documentation to process_file.
bdgregg Feb 26, 2026
f04de97
Adding some argument signatures to functions.
bdgregg Feb 26, 2026
4726124
Removed some unused functions.
bdgregg Feb 26, 2026
736b240
Added the missing $ in the regex.
bdgregg Feb 26, 2026
607681e
Remove function in preference for in-line code.
bdgregg Feb 26, 2026
91d8a30
Removed unused function dump_df_columns.
bdgregg Feb 26, 2026
fb5ca84
Moved skip patterns to the config file to allow for customization.
bdgregg Feb 27, 2026
e356d52
Updated the README.md file to address the \'skip\' parameter.
bdgregg Feb 27, 2026
454e125
Renaming of Models and a bit of clean up and more logging.
bdgregg Feb 27, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
91 changes: 37 additions & 54 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,81 +7,64 @@ few types of file directory layouts relating to these types of objects:

- Books

- Compound Books

- Manuscripts

- Newspaper Issues
- Newspaper/Serial Issues (Paged)

- Newspaper/Serial Issues (PDF)

- Audio (Oral Histories or General Audio files)

- Video (Oral Histories or General Videos)

- Images

Currently no other object types are addressed, but the script will
identify and add them as it finds them.

The script will ignore the following files and directories (including
contents):
- Image (Single Image)

- Directory named 'ignore'
- PDF (Single File)

- File named 'manuscript.csv'
- Page (Single Page)

- File named 'manuscript.xls'
Currently no other object types are addressed, but the script will
identify and add them as it finds them.

- File named 'manuscript.xlsx'
The script will ignore the files and directories (including contents) of the configuration file 'skip' parameter. This is a comma separated list of patterns that the script will not process. These are file patterns that you do not want to process.

## Google Sheet requirements:

Sheet Columns:

-----------------------------------------------------------------------
Required Columns Description
------------------------ ----------------------------------------------
'id' The PID of the object. This column must exist.

'file' Leave this empty but the column must exist.
This field will be updated by the script with
the full path to the file.

Optional Columns

'thumbnail' Used for A/V media. If a .jpg or .png file is
found the full path to the file will be added
to this column.

'transcript' Used for A/V media. If a .srt or .vtt file is
found the full path to the file will be added
to this column.


-----------------------------------------------------------------------

Script Parameters:

-------------------------------------------------------------------------
Required Parameters Description
------------------------- -----------------------------------------------
\--config-file Full or relative path to the configuration file
used for the script.
### Sheet Required Columns:

\--log-file Full or relative path to the log file that will
be generated.
|Required Columns | Description |
|--- |--- |
|'id' |The PID of the object. This column must exist.|
|'file' |Leave this empty but the column must exist. This field will be updated by the script with the full path to the file.|
|'model' |This column will be added. This is the Islandora Model.|
|'field_model' |This column will be added. This is the Islanodra Model Taxonomy ID.|
|'field_weight' |This column will be added. This is the order of the objects in the sheet if it can be determined.|
|'field_resource_type' |This column will be added. This is the Resource Type of the object in the sheet.|

\--directory Full path of the directory we wish to scan.
### Sheet Optional Columns:


|Optional Columns | Description |
|--- |--- |
|'thumbnail' |Used for A/V media. If a .jpg or .png file is found the full path to the file will be added to this column.|
|'transcript' |Used for A/V media. If a .srt or .vtt file is found the full path to the file will be added to this column.|

Optional Parameters
### Script Required Parameters:

\--in-google-sheet-id The ID number of the Google Sheet.
|Required Parameters | Description |
|--- |--- |
|\--config-file |Full or relative path to the configuration file used for the script.|
|\--log-file |Full or relative path to the log file that will be generated.|
|\--directory |Full path of the directory we wish to scan.|

\--in-google-sheet-name The Name of the Tab in the Google Sheet (E.g.:
Sheet1)
### Script Optional Parameters:

\--in-google-creds-file The full or relative path to the Google
Credentials File.
-------------------------------------------------------------------------
|Optional Parameters | Description |
|--- |--- |
|\--in-google-sheet-id |The ID number of the Google Sheet.|
|\--in-google-sheet-name |The Name of the Tab in the Google Sheet (E.g.: Sheet1)|
|\--in-google-creds-file |The full or relative path to the Google Credentials File.|

## Google Credentials File:

Expand Down
Loading