Information Science

MetadataExtractor.py

2009
thumbnail image
Refactoring of an earlier Python program (TableScraper.py) to convert it from a procedural to an object-oriented style. The program follows links on collection list web pages for Ramsey Library Special Collections, extracts Dublin Core metadata from each page, and saves the metadata to an XML file for import into a database.