Digital Scholarship at Schaffer Library

Collections as Data

As we continue to digitize the unique collections at Schaffer Library, we generate new datasets of machine-readable text and metadata available for computational analysis and other digital scholarship methods. We now have a Schaffer Library Collections as Data GitHub page where you can download and analyze our collections data.

Data from our collections include text files from optical character recognition (OCR) extracts, structured metadata files (e.g., in CSV or TSV format), and XML files. Collections currently available include the Concordiensis, the OD Putnum Photographs, and the Jonathan Pearson Diary.

Using Data from Library Materials

Many of our vendors provide downloads, visualization tools, and tools to support analyses within their interfaces. These include: