Code4Lib 2011 schedule

The schedule for the 2011 Code4Lib Conference in Bloomington, IN. If you would like this schedule in your calendering app convert the hCalendar in this page to iCal.

Pre-Conferences

Monday, February 7

08:00-09:00 - Registration / coffee
09:00-12:00 - Morning sessions
12:00-13:30 - Lunch (on your own)
13:30-16:30 - Afternoon sessions

Full Day Pre-Conferences:

CURATEcamp Hackfest (Location: Frangipani)

Videos

Morning Pre-Conferences (09:00 - 12:00)

What's New in Solr (Location: Dogwood) (slides)
Introduction to Functional Programming with Javascript (Location: Maple)
Running Cloud Servers (Location: Walnut)
Creating a New JHOVE2 Format Module (Location: Persimmon)

Afternoon Pre-Conferences (13:30 - 16:30):

Using JHOVE2 for Policy Assessment of Files (Location: Persimmon)
Publishing Historic Newspapers with NDNP Tools (Location: Oak)
VIVO Bootcamp (Location: Maple)
Islandora Repository System (Location: Walnut)
A Code4Lib Preconference Unconference (Location: Dogwood )

Conference Program

Live and archived video of the conference

Session #1 will be the activity happening at Whittenberger the morning of Tuesday Feb. 8th. The stream will go live at 8:45AM and end at 11:00AM, or terminated by the video crew when the presentations conclude. Live video. Archived stream.
Session #2 will be the activity happening at Alumni Hall for Tuesday Feb 8th. The stream will go live at 11:00AM and end at 5:30PM, or terminated by the video crew when the presentations conclude. Live video. Archived stream.
Session #3 will be the activity happening at Alumni Hall for Wednesday Feb 9th. The stream will go live at 8:50AM and end at 5:30PM, or terminated by the video crew when the presentations conclude. Live video. Archived stream (morning). Archived stream (afternoon).
Session #4 will be the activity happening at Alumni Hall for Thursday Feb 10th. The stream will go live at 8:50AM and end at 1:00PM, or terminated by the video crew when the presentations conclude. Live video. Archived stream.

(Some descriptions were abbreviated. Complete descriptions are on the individual pages for each session.)

Tuesday, February 8

The day begins at Whittenberger Auditorium. After the keynote talk, we will move to Alumni Hall for the rest of the conference.

08:00-09:00 - Registration / Breakfast
09:00-09:15 - Welcome - Brenda Johnson (Dean of Libraries, Indiana University) / Orientation / Housekeeping
09:15-09:45 - Introductory remarks - Brad Wheeler (Vice-President for Information Technology, Indiana University)
09:45-10:05 - Break
10:05-10:50 - Keynote - Diane I. Hillmann
10:50-11:10 - Move to Alumni Hall IMU
11:10-11:30 - Visualizing Library Data - Karen Coombs
Visualizations can be powerful tools to give context to library users and to provide a clear picture for data-driven decision-making in libraries. Map mashups, tag clouds and timelines can be used to show information to users in new ways and help them locate materials to meet their needs. QR codes can help link users to materials that libraries have in their collections. Charts and graphs can be used to help analyze library collections (holdings) and compare them to other libraries. This session will show prototypes which combine tools like Google Chart API, Protovis and Simile Widgets with data from WorldCat, WorldCat Registry, Classify, Terminology Services, and Dewey.info to create vivid illustrations in library user interfaces and administration tools.
11:30-11:50 - Improving the Presentation of Library Data Using FRBR and Linked Data - Anne-Lena Westrum and Asgeir Rekkavik
The Pode project at Oslo Public Library has experimented on the automated FRBRizing of catalogue records, as well as expressing bibliographic descriptions as linked data to enrich catalogue browsing with information from external sources.
11:30-11:50 - Hey, Dilbert. Where’s My Data?! - Thomas Barker
MetriDoc’s mission is to provide an open source API / tool set where users can specify dataflows and use library based services to solve data integration problems while MetriDoc worries about scalability and performance. The first half the presentation briefly discusses MetriDoc’s architecture while the remainder of the presentation will include code samples to illustrate problems it can solve. Information on how to contribute or download MetriDoc will be provided as well.
11:50-12:10 - Kuali OLE: Architecture for Diverse and Linked Data - Tim McGeary and Brad Skiles
With programming scheduled to be begin in January 2011 on the Kuali Open Library Environment (OLE), the Kuali OLE Functional Council is developing the requirements for an architecture for diverse data sets and linked data. With no frontrunner for one bibliographic data standard, and local requirements on what data will be accompanying or linked to the main record store, Kuali OLE needs to build a flexible environment for records management and access. We will present the concepts of our planned architecture, a multi-repository framework, using a document repository, a semantic repository, and a relational repository, brokered on top of the enterprise service bus of Kuali Rice. As a community source project, this is an opportunity for the Kuali OLE partners to present our plans for discussion with the community, and we look forward to feedback, questions, and comments.
12:10-12:30 - Drupal 7 as Rapid Application Development Tool - Cary Gordon
Five years ago, I discovered that the Drupal CMS had a programming framework disguised as an API, and learned that I could use it to solve problems. Drupal 7 builds on that to provide a powerful toolset for interfacing with, manipulating and presenting data. It empowers tool-builders by providing a minimal install option, along with a more powerful installation profile system makes it easier for developers to package and distribute their applications.
12:30-13:30 - Lunch (provided)
13:30-13:50 - Enhancing the Mobile Experience: Mobile Library Services at Illinois - Josh Bishoff
The University of Illinois Libraries launched a mobile interface in Spring 2010 that includes a custom mobile catalog layer built on top of VuFind. It allows patrons to request books for delivery, to browse the local and CARLI consortium catalogs, and access account information for renewal & checking hold status. This presentation will focus on new features designed to add value for the mobile user, such as adding Google map links to catalog records, offering current information for campus bus stops, and automatic device detection for users accessing the full-sized library gateway from their mobile device. I’ll discuss how developing for the mobile context, and talking to mobile users, has informed the further development & improvement of library web services overall.
13:50-14:10 - One Week, One Tool: Ultra-Rapid Open Source Development Among Strangers - Scott Hanrath
In summer 2010, the Center for History and New Media at George Mason University, supported by an NEH Summer Institute grant, gathered 12 'digital humanists' for an intense week of collaboration they dubbed 'One Week | One Tool: a digital humanities barn raising.' The group -- several of whom hang their professional hats in libraries and most of whom were previously unacquainted -- was asked to spend one week together brainstorming, specifying, building, publicizing, and releasing an open source software tool of use to the digital humanities community. The result was Anthologize, a free, open source plugin that transforms WordPress into a platform for publishing electronic texts in formats including PDF, ePub, and TEI; in other words, a "blog-to-book" tool.
14:10-14:30 - VuFind Beyond MARC: Discovering Everything Else - Demian Katz
The VuFind discovery layer has been providing a user-friendly interface to MARC records for several years now. However, library data consists of more than just MARC records, and VuFind has grown to accommodate just about anything you can throw at it. This presentation will examine the new workflows and tools that enable discovery of non-MARC resources and some of the non-traditional applications of VuFind that they make possible. Technologies covered will include OAI-PMH, XSLT, Aperture, Solr and, of course, VuFind itself.
14:30-14:50 - Letting In the light: Using Solr as an External Search Component - Jay Luker and Benoit Thiell
It’s well-established that Solr provides an excellent foundation for building a faceted search engine. But what if your application’s foundation has already been constructed? How do you add Solr as a federated, fulltext search component to an existing system that already provides a full set of well-crafted scoring and ranking mechanisms? This talk will describe a work-in-progress project at the Smithsonian/NASA Astrophysics Data System to migrate its aging search platform to Invenio, an open-source institutional repository and digital library system originally developed at CERN, while at the same time incorporating Solr as an external component for both faceting and fulltext search.
14:50-15:10 - Break
15:10-16:00 - Breakout Sessions 1 - Alumni Hall, Solarium, and Whittenberger Auditorium
16:00-17:00 - Lightning Talks 1
17:00-17:15 - Breakout reports and daily wrap-up
18:00-20:00 - Reception (sponsors DLF, Elsevier, Index Data, and OCLC)- IU Art Museum

Wednesday, February 9

Session #3 will be the activity happening at Alumni Hall for Wednesday Feb 9th. The stream will go live at 8:50AM and end at 5:30PM, or terminated by the video crew when the presentations conclude. Live video. Archived stream (morning). Archived stream (afternoon).

08:00-09:00 - Breakfast
09:00-09:15 - Housekeeping, Introductions
09:15-09:35 - A Community-Based Approach to Developing a Digital Exhibit at Notre Dame Using the Hydra Framework - Rick Johnson and Dan Brubaker Horst
We will cover: Why we chose to adopt the Hydra Framework instead of creating our own solution; Why the community based approach is so appealing; How we were welcomed into the Hydra development community; Why we chose to create something beyond basic Blacklight search and facet browse; How to create your own Digital Exhibit using Hydra including: Metadata management, Custom Browse and Search.
09:35-09:55 - Chicago Underground Library’s Community-Based Cataloging System - Margaret Heller and Nell Taylor
We have developed a unique cataloging and discovery system using Drupal, which we eventually hope to provide as a standalone module that any organization can implement as both a technical and theoretical template to start an Underground Library in its own city. Chicago Underground Library (CUL) is a replicable model for community collections. It uses the lens of an archive to examine the creative, political, and intellectual interdependencies of a city, tracing how people have worked together, who influenced whom, where ideas first developed, and how they spread from one publication to another through individuals.
09:55-10:15 - Beyond Sacrilege: A CouchApp Catalog - Gabriel Farrell
At Code4Lib 2008 Dan Scott gave us a taste of yummy CouchDB, a document-oriented database with a RESTful JSON API. Since then, CouchDB has passed the 1.0 mark and landed on desktops, the cloud, and mobile devices. With the advent of CouchApps (web apps served directly from CouchDB) applications can be built that are as easy to install as the replicating of databases (which is super easy!). I'll discuss the advantages and challenges in designing a CouchApp to be used as a catalog, repository, or directory of resources. Some things are made fairly simple, such as site templating, the outputting of documents in different formats, and the attachment of binary objects to documents. Some things, like document versioning and the modeling of data, are a little trickier, but still straightforward. And some things, such as granular authentication and the integration of search, are tangled enough to produce some head-on-wall banging. But hey, take it easy. It’s time to relax. Check out some code at http://github.com/gsf/catlg.
10:15-10:35 - Break
10:35-10:55 - Opinionated Metadata (OM): Bringing a Bit of Sanity to the World of XML Metadata - Matt Zumwalt
Opinionated Metadata (OM) grew from discussions at Code4Lib 2010. It's now an integral component in the Hydra Framework. Unlike most XML solutions, which start from schemas and build outwards, OM allows you to start from the natural vocabulary that emerges in user stories. Based on the terms that show up in those user stories, you can use OM to create a Terminology that maps each term to nodes in schema-driven XML. This Terminology then serves as a Domain Specific Language (DSL) for your code to rely on.
10:55-11:15 - Enhancing the Performance and Extensibility of the XC’s MetadataServicesToolkit - Ben Anderson
Learn how we increased the performance of the XC Metadata Services Toolkit (MST) by over 900%. The MST is an open-source Java application, that uses SOLR and MySQL to harvest (OAI-PMH) library metadata (MARC, DC), clean it up, convert and frbrize, and then make new metadata (RDA flavor, XC Schema) available for harvesting. Our first release performed too slowly with degrading performance with large record batches and we needed to enable the MST to process a library’s entire catalog in a reasonable amount of time on a common server. The MST was also intended to be extensible. Libraries will almost certainly want to customize this process in some way. Thus our second goal was to make it is as easy as possible for a developer to write a service which can be plugged into the MST.
11:15-12:00 - Ask Anything! – Facilitated by Dan Chudnov
AKA "Human Search Engine". A chance for you to ask a roomful of Code4Libbers anything that's on your mind: questions seeking answers (short or long), requests for things (hardware, software, skills, or help), or offers of things. We'll keep the pace fast, and the answers faster. Come with questions and line up at the start of the session and we'll go through as many as we can; sometimes we'll stop at finding the right person or people to answer a query and it'll be up to you to find each other after the session. Second time at Code4Lib!
12:00-13:00 - Lunch (provided)
13:00-13:20 - GIS on the Cheap - Tim Shearer (for Mike Graves)
Using a few tools that you probably already have laying around your library I'll show how OpenLayers can be used to create a dynamic interface for browsing and searching your digital collections geographically. With very little effort Solr can be made to serve up results directly into OpenLayers creating all sorts of mapping possibilities. With a little more effort a Postgres database can handle complex polygon searches. I'll talk about how we're developing a lightweight GIS framework to provide a new user experience for interacting with a number of our collections in unique ways.
13:20-13:40 - Let's Get Small: A Microservices Approach to Library Websites - Sean Hannan
Most, if not all, library websites are housed and maintained in singular, monolithic content management systems. This is fantastic if the library website is the one place your users go for library information. But it isn't. Users are going to Facebook, checking mobile applications, browsing portals as well as checking the library website. Wouldn't it be great if you could update the information on all of these sites from a single source? Why maintain the library hours in five different places? In this talk, I will show how breaking the construction of the library website into as-needed, swappable microservices can free your content to live where it needs to, as well as free you from the maintenance headaches usually involved. What kind of microservices, you ask? Well, basic templating and styling is a given, but how about a microservice that gracefully degrades your layouts for older browsers? Or enforces highfalutin typographic rules? Or optimizes your site assets to improve load times? All wonderful little black boxes that allow you to focus on the website and its content, and not the details. I promise at least one diagram. That will burn your eyes.
13:40-14:00 - fiwalk With Me: Building Emergent Pre-Ingest Workflows for Digital Archival Records using Open Source Forensic Software - Mark Matienzo
Many of the complications of born-digital records involve preparing them for transfer into a storage or preservation environment. Digital evidence of any kind is easily susceptible to unintentional and intentional modification. This presentation will describe the use of open source forensic software in pre-ingest workflows for digital archives. Digital archivists and other digital curation practitioners can develop emergent processes to prepare records for ingest and transfer using a combination of relatively simple tools. The granularity and simplicity of these tools and procedures provides the possibility for their smooth integration into a digital curation environment built on micro-services.
14:00-14:20 - (Yet Another) Home-Grown Digital Library System, Built Upon Open Source XML Technologies and Metadata Standards - David Lacy
We have recently rearchitected our homegrown digital library utilizing an all-XML framework. The system is comprised of a data repository residing in a native XML database (eXist-DB), a metadata editor constructed using a Java-based XForms processor (Orbeon Forms), and a series of services for image manipulation, OCR processing and OAI-PMH serving. In this talk, I will detail our workflow process from scanning to online publishing, demonstrate the software's flexible configuration and features, and how these steps allow rapid digital preservation and online access. Oh, and it's open source, so I'll show you where to get it as well.
14:20-14:40 - Break
14:40-15:50 - Breakout Sessions 2 - Alumni Hall, Solarium, Maple, and Walnut rooms
15:50-17:00 - Lightning Talks 2
17:00-17:15 - Breakout reports and daily wrap-up
17:15-18:00 - Tours of the IU Bloomington Data Center (2 Tours at 5:30 and 6:00 - sign-up at registration table)
18:30-20:00 - Social network dine-arounds

Thursday, February 10

Session #4 will be the activity happening at Alumni Hall for Thursday Feb 10th. The stream will go live at 8:50AM and end at 1:00PM, or terminated by the video crew when the presentations conclude. Live video. Archived stream.

08:00-09:00 - Breakfast
09:00-09:15 - Housekeeping
09:15-09:35 - Building an Open Source Staff-Facing Tablet App for Library Assessment - Jason Casden and Joyce Chapman
This talk will present a library assessment and software development perspective on the creation and utility of an open source tablet-based tool for collecting and analyzing data about the use of library physical spaces. Building on recent experience developing web-based and native-iPhone library apps, we will discuss complicating implementation-related issues such as platform dependence, intermittent network coverage (data caching), and centralized data synchronization with multiple collectors. HTML5 and co-evolving technologies (specifically, Web SQL client-side storage) can be utilized to balance the various advantages of web-based apps with the performance of native apps, but implementation choices can directly impact both the types of data that can be collected and the cost of adoption of an open source release. Finally, we will use an early prototype of this tool to demonstrate some new assessment possibilities.
09:35-09:55 - Mendeley's API and University Libraries: Three Examples to Create Value - ~~Jan Reichelt~~ Ian Mulvany
Mendeley is a technology startup that is helping to revolutionize the way research is done. Used by more than 600,000 academics and industry researchers, Mendeley enables researchers to arrange collaborative projects, work and discuss in groups, as well as share data across its web platform. Launched in London in December 2008, Mendeley is already the world’s largest research collaboration platform. Through this platform, we anonymously pools users’ research paper collections, creating a crowd-sourced research database with a unique layer of social information - each research paper is connected with socio-demographic information about its audience. Based on this platform and data, I will present three examples of how Mendeley is working to support university libraries and contribute to opening up academic research: 1) Mendeley’s integration as a workflow tool with institutional repositories with the aim of increasing IR deposit rates; 2) Application examples building on Mendeley’s API to showcase what is possible with the newly available type of usage data Mendeley is aggregating; 3) Preview of Mendeley’s library dashboard that will reveal content usage within an institution.
09:55-10:15 - Break
10:15-11:00 - Lightning Talks 3
11:00-11:20 - Practical Relevancy Testing - Naomi Dushay
Evaluating search result relevancy is difficult for any sizable amount of data, since human vetted ideal search results are essentially non-existent. This is true even for library collections, despite dedicated librarians and their familiarity with the collections. So how can we evaluate if search engine configuration changes (e.g. boosting, field analysis, search analysis settings) are an improvement? How can we ensure the results for query A don’t degrade while we try to improve results for query B? Why yes, Virginia, automatable tests are the answer. This talk will show you how you can easily write these tests from your hidden goldmine of human vetted relevancy rankings. http://www.stanford.edu/~ndushay/code4lib2011/
11:20-11:40 - Sharing Between Data Repositories - Kevin S. Clarke
Dryad is a generic subject repository that shares author submitted data with other scientific repositories. In a part "how we done it" and part "things to consider" talk, I'll discuss 1) why we chose BagIt and OAI-ORE as mechanisms for sharing our data, 2) how we've integrated with TreeBASE -- a subject repository of phylogenetic information), and 3) the possibility of this method of data sharing being adopted by other repositories within the larger DataONE community. There will be cake. (slides)
11:40-12:00 - Why (Code4) Libraries Exist - Eric Hellman
Libraries have historically delivered value to society by facilitating the sharing of books. The library "brand" is built around the building and exploitation of their collections. These collections have been acquired and owned. As ebook readers become the preferred consumption platform for books, libraries are beginning to come to terms with the fact that they don't own their digital collections, and can't share books as they'd like to. Yet libraries continue to be valuable in many ways. In this transitional period, only one thing can save libraries from irrelevance and dissipation: Code.
12:00-12:15 - Wrap-Up