Enhancing the Performance and Extensibility of the XC’s MetadataServicesToolkit

  • Ben Anderson, eXtensible Catalog Organization, banderson@library.rochester.edu

Code4Lib 2011, Wednesday 9 February, 10:55 - 11:15

Learn how we increased the performance of the XC Metadata Services Toolkit (MST) by over 900%. The MST is an open-source Java application, that uses SOLR and MySQL to harvest (OAI-PMH) library metadata (MARC, DC), clean it up, convert and frbrize, and then make new metadata (RDA flavor, XC Schema) available for harvesting. Our first release performed too slowly with degrading performance with large record batches and we needed to enable the MST to process a library’s entire catalog in a reasonable amount of time on a common server. The MST was also intended to be extensible. Libraries will almost certainly want to customize this process in some way. Thus our second goal was to make it is as easy as possible for a developer to write a service which can be plugged into the MST.

slides video(about 133m in)