You are here

1,000 Lines of Code, and other topics from OCLC Research

Thomas Hickey

You can do a lot in 1,000 lines of code. A retrieval system blending a Dewey browser and an interactive ‘live’ search is presented which is currently implemented in about 1,000 lines. As the user types, the system searches for records, classifies those records into DDC categories, and displays them using standard protocols. The data structures needed to support this are computed in parallel using a Python implementation of map-reduce.

Other topics include discussing the reasoning behind OCLC’s recent shift to using version 2.0 of the Apache License to release our open source software.

--Thomas Hickey, Chief Scientist, OCLC

Office presentation icon 1000Lines.ppt1.08 MB