You can do a lot in 1,000 lines of code. A retrieval system blending a Dewey browser and an interactive â€˜liveâ€™ search is presented which is currently implemented in about 1,000 lines. As the user types, the system searches for records, classifies those records into DDC categories, and displays them using standard protocols. The data structures needed to support this are computed in parallel using a Python implementation of map-reduce.
Other topics include discussing the reasoning behind OCLCâ€™s recent shift to using version 2.0 of the Apache License to release our open source software.
--Thomas Hickey, Chief Scientist, OCLC