As we continue to digitize the unique collections at Schaffer Library, we generate new datasets of machine-readable text and metadata available for computational analysis and other digital scholarship methods. We now have a Schaffer Library Collections as Data GitHub page where you can download and analyze our collections data.
Data from our collections include text files from optical character recognition (OCR) extracts, structured metadata files (e.g., in CSV or TSV format), and XML files. Collections currently available include the Concordiensis, the OD Putnum Photographs, and the Jonathan Pearson Diary.