You are here

code4lib 2006

Anything related to the 2006 code4lib conference to be held in Corvallis, OR, USA, in February 2006.

Anatomy of aDORe

Ryan Chute

The aDORe Archive is a write-once/read-many storage approach for Digital Objects and their constituent datastreams. First, XML-based representations of multiple Digital Objects are concatenated into a single, valid XML file named an XMLtape. Second, ARC files, as introduced by the Internet Archive, are used to contain the constituent datastreams of the Digital Objects. The software was developed by the LANL Digital Library Research & Prototyping Team and is available under GNU LGPL license.

Ryan Chute
Los Alamos National Laboratory, Research Library

Quality Metrics

Aaron Krowne

This talk will discuss the core development activities of the “Quality
Metrics” project at Emory’s Woodruff Library. This project is being
conducted under an IMLS grant to research requirements for and build
a working prototype digital library search system.

What this project is doing that is new is truly generalizing and
integrating explicit and latent quality indicators which allow
users to ascertain the fitness of digital library resources. Most
search engine components have only one indicator: content-query
similarity (“relevance”). Google only has two, adding PageRank to the
latter. Our system, QM-search, will have an unlimited number of these,
which will be customizable by the digital librarian for the target
community and collections, and even customizeable from user to user or
search to search.

Some basic examples of quality indicators that digital libraries might
be able to exploit would be activations (views online or check-outs in
circulation), selection (compilation in “bookmark” lists online or
additions to course reserves lists), extent of review (from a peer-
reviewed journal, conference, or not?), or citation-based metrics.

The ouput of QM-search will be in a completely generalized XML format,
with the search results represented as a structure based on the
structure specified in the input “organization spec”. This XML output
can be transformed into presentation HTML resembling anything from a
“linear” Google-like search results list to an A9-like column display to
more exotic groupings and breakdowns.

Requirements for QM-search are being gleaned from focus groups being
conducted at Emory (preliminary results will be shared), and development
is being conducted as a high-level layer atop the excellent Lucene open
source search engine project.

—Aaron Krowne Head of Digital Library Research Emory University General Libraries President and Founder, Office: 404-712-2810 Cell: 404-405-5766

Connecting Everything with unAPI and OPA

Dan Chudnov

unAPI is a simple-to-use, simple-to-implement API for web sites that allows rich object access and can be easily layered over existing services like Atom, OpenSearch, OAI-PMH, or SRU. OPA is a general-purpose identifier resolver that wraps API calls to heavily-used but incompatible web services like those from Amazon, Flickr, and Pubmed.

Together they will do the same thing we do every code4libcon – try to take over the world!

[Update 2006-02-28: Slides are here, in pdf.]

What Blog Applications Can Teach Us About Library Software Architecture

Casey Bisson

The number of programmers in the library world is growing and our individual efforts have shown great promise, but they exist largely as a spectacle that few libraries can enjoy. We need better means to aggregate our efforts and share solutions that can be employed by libraries without programming staff.

Looking outside libraries, we see some interesting examples in the blog world. The blog world is growing with new bloggers every day, but the most interesting aspect is how many people with limited technical skills are using (maintaining and configuring) blog applications like WordPress or Moveable Type, and how quickly the contributions of the many plugin and theme developers are implemented on those blogs. What lessons can we learn from this and how might a library application built from those lessons work? Are some software architectures better at leveraging the network effects of the growing number of developers in our community than others?

I’m working on a project that attempts to answer those questions and I hope to release a public beta shortly (update: it's WPopac, online now). I’d like to demo it and ask for participation.

Update: slides posted.

Casey Bisson
E-Learning Application Developer
Plymouth State University
Plymouth, New Hampshire

Voting on Code4Lib 2006 Presentation Proposals

Vote for the Code4lib 2006 presentations!

Please log in to participate in voting!

You may choose up to 11 proposals.

Voting closes at January 9th 11PM EST.

The 11 proposals with the most votes win. In case of a tie, we will have a "run off" election tomorrow, January 10th at 5PM - 11PM EST.

I will be deleting all votes cast before 5PM EST unless you specifically tell me that you have to vote early. So, be sure to tell me. Seriously. Send an email to

Happy voting!

one more week for proposals

The deadline for proposing a talk has been extended one week to Jan. 9. We've already got several very cool proposals but more is better, and there's plenty of room for spillover in the lightning sessions and breakouts.

To submit, read this first and then follow these instructions. Especially the 75-words-max part.

Registration is Open

Registration for Code4lib 2006 is OPEN Register early for a discount. Don't hesitate, or wait, or be late...register today!

Code4lib 2006 Call For Proposals

Call for proposals - Code4lib 2006

We are now accepting proposals for prepared talks for Code4lib 2006. Code4lib 2006 is a loosely structured conference for library technologists to commune, gather/create/share ideas and software, be inspired, and forge collaborations. It is also an outgrowth of the Access HackFest, wrapped into a conference-ish format. It is *the* event for technologists building digital libraries and digital information systems, tools, and software.

At least six time slots will be available for prepared talks. We will choose from among the proposals based on diversity of topics, usefulness, wow factor, and potential impact.

Proposals of 75 words or less are being accepted for review now. Please send your name, email address, and proposal to code4libcon at

We cannot accept every prepared talk proposal, but multiple lightning talk sessions will provide everyone who wishes to present with ample opportunity to show off. The proposal deadline is 5pm EST January 9, 2006, and proposers will be notified by Midnight January 9, 2006.

Prepared Talk Information

Prepared talks are 20 minutes, and must center on "tools" (some cool new software, software library or integration platform), "specs" (how to get the most out of some protocols, or proposals for new ones), or "challenges" (One or more big problems we should collectively address). We will evaluate proposals on criteria of usefulness, newness, geekiness, and diversity of topics.

Again, proposals should be sent to code4libcon at

code4lib Conference 2006: Schedule


Wednesday, February 15

08:30-09:30 - Registration / Continental Breakfast
09:30-09:45 - Intros, Housekeeping
09:45-10:30 - A virtual keynote with the Evergreen development team of the PINES consortium in Georgia
10:30-10:45 - Break
10:45-11:05 - ERP Options in an OSS World - Art Rhyno
11:05-11:25 - Connecting Everything with unAPI and OPA - Dan Chudnov
11:25-11:45 - WikiD - Jeff Young
11:45-13:00 - Lunch
13:00-13:20 - Lipstick on a Pig: 7 Ways to Improve the Sex Life of your OPAC - Jim Robertson
13:20-13:40 - AHAH: When Good is Better than Best - Casey Durfee
13:40-14:00 - Standards, Reusability and the Mating Habits of Learning Content - Robby Robson
14:00-14:15 - Break
14:15-15:45 - Breakout Sessions
15:45-16:00 - Break
16:00-17:15 - Lightning Talks
17:15-17:30 - Breakout Session Reports
18:30-??? - Dinner/Socializing (American Dream Pizza and Crowbar)

Thursday, February 16

08:30-09:30 - Contintental breakfast
09:30-09:45 - Housekeeping
09:45-10:30 - 1,000 Lines of Code, and other topics from OCLC Research - Thom Hickey, Chief Scientist, OCLC
10:30-10:45 - Break
10:45-11:05 - Generating Recommendations in OPACs: Initial Results and Open Areas for Exploration - Colleen Whitney
11:05-11:25 - Library Text Mining - Robert Sanderson
11:25-11:45 - Anatomy of aDORe - Ryan Chute
11:45-13:00 - Lunch (provided)
13:00-13:20 - Teaching the Library and Information Community How to Remix Information - Raymond Yee
13:20-13:40 - Two Paths to Interoperable Metadata - Devon Smith
13:40-14:00 - The Case for Code4Lib 501c(3) - Roy Tennant
14:00-14:15 - Break
14:15-15:45 - Breakout Sessions
15:45-16:00 - Break
16:00-17:15 - Lightning Talks 2
18:30-??? - Dinner/Socializing (Fox and Firkin)

Friday, February 17

08:30-09:30 - Continental Breakfast
09:30-09:45 - Housekeeping
09:45-10:05 - Quality Metrics - Aaron Krowne
10:05-10:25 - Practical Aspects of Implementing Open Source in Armenia - Tigran Zargaryan
10:25-10:45 - What Blog Applications Can Teach Us About Library Software Architecture - Casey Bisson
10:45-11:00 - Break
11:00-11:20 - Breakout Reports
11:20-11:45 - Lightning Talks 3
11:45-12:00 - Conference Awards and Wrap-Up

code4lib 2006

code4lib 2006 is...

  • a loosely structured camp/conference for library technologists to commune, gather/create/share ideas and software, be inspired, and forge collaborations
  • an outgrowth of the Access HackFest, wrapped into a conference-ish format by the folks in #code4lib
  • being planned using an ancient asynchronous messaging protocol and this site
  • open to all interested people/organizations for both planning and participation
15-17 Feb 2006
LaSells Stewart Center, Oregon State University, Corvallis, Oregon [map]

Free wireless nets are available at both the conference venue and the hotel

Conference Schedule
Presentations are being added to the Conference Schedule as we receive them.
The final list of Lightning Talks and Breakout Sessions are also available.
$125 (goes up to $175 two weeks prior to the conference)

[Registration is closed]


We have heard that the conference hotel is now booked full. There are quite a few alternative hotels in Corvallis. One recommended hotel is the Holiday Inn Express, which is located approximately a mile from the conference site. It also provides free wi-fi access.

The conference hotel is the Hilton Garden Inn, located across the street from the LaSells Stewart Center on the OSU Campus. The discounted room rate is $84/night plus tax. To make a reservation, please call 541-752-5000. Please make your reservations by January 14, 2006 in order to ensure that you receive the discounted rate and let the hotel operator know that you are with code4lib 2006.
NOTE: You must make reservations to use the shuttle services. Reservations may be made by visiting the appropriate shuttle URL below

Fly into either:

  • Portland Airport (PDX) — There is shuttle service for $43 each way, direct to the hotel. (Choose the 'Corvallis' option from the shuttle website for the shuttle schedule) (Some people have reported that the web site reservation didn't work, and they had to call through.)
  • Eugene Airport (EUG) — shuttle options
  • There is some carpooling
Types of Activites and Programs
  • Keynotes - 45 minutes, talks from the farthernmost reaches of library software development where significant real work is getting done. To provide an opportunity to see where they are up close, to learn details of their implementation choices, to hear about the challenges they're facing and how those might scale/translate to the rest of us.
  • Prepared talks - 20 minutes, which must center on "tools" (some kick-ass new software library or integration platform), "specs" (how to get the most out of some protocols, or proposals for new ones), or "challenges" (Some Big Problem we should collectively address). We will evaluate proposals on criteria of geekiness, usefulness, newness, and diversity of topics.
  • Lightning talks - 5 minutes, those interested sign up on a wiki that opens at the start of the conference. *Hard* 5 minute limit... at 4 minutes the next person gets up and starts attaching their screen, and somebody keeps a hard count.
  • Breakouts - People interested in the same project/problem can hang out in a space together for 60-90 minute blocks. Somebody near to the project/problem can coordinate discussion, set up hackers, tackle a challenge session topic, etc. Larger groups will get separate space, smaller groups can just spread out in the larger conference room space.
Planning Group
Daniel Chudnov, Edward Corrado, Andrew Forman, Jeremy Frumkin, Brad LaJeunesse, Art Rhyno, Ross Singer, Edward Summers, Roy Tennant. If you want to get involved please join and ping the email discusion list.



Subscribe to RSS - code4lib 2006