You are here

code4lib 2007

Anything having to do with the code4lib 2007 conference to be held in Athens, Georgia, Feb 28 through Mar 2.

MyResearch Portal: An XML based Catalog-Independent OPAC

Andrew Nagy

Villanova University’s Falvey Memorial Library has longed for a beautiful pig; however, we determined in early 2006 that pigs were only good at searching for truffles, so we decided to build our own OPAC.

After developing our own custom Digital Library from a Native XML Database, we quickly appreciated the ease of development with XQuery and XSLT. We then launched full speed ahead into the development of a new OPAC from scratch using XML technologies and MARCXML.

This presentation will describe the process of choosing an NXDB and optimizing it for large data set performance. Developing searches that take about 2 minutes to process and optimizing them down to about 2 seconds. I will also describe the development processes of the OPAC interface including the AJAX features we have implemented. I will share our success stories and our failures.


MyResearch Portal: An XML based Catalog-Independent OPAC (PPT - 5MB)

Application Preview
You can preview our SOLRized OPAC: MyResearch Portal


Video on Google
Download Video (Mpeg-4, 50.4 Mb. MSU Mirror)
Download Audio Only (MP3, 21.5MB, MSU Mirror)

The BibApp

Eric Larson, Nate Vack

What has your repository done for you lately? The Rails-powered BibApp is the ultimate mashup of article databases, citation management software, and digital repositories. To date, it's helped us add over 2000 articles to our repository by organizing citations, identifying articles we can archive, and generating DSpace import packages. It also lets us analyze research produced on campus, illustrating popular research topics, departmental affiliations, and collaborative relationships between researchers.

View Video

Watch at Google
Download Video (MP4, 49.4 MB, MSU Mirror)
Download Audio Only (MP3, 10.9MB, MSU Mirror)

Obstacles to Agility

Joan Starr

Academic digital libraries face serious challenges in trying to adopt agile project management techniques. While there have always been significant differences between how an academic and a corporate team might solve a problem, today, digital library and corporate offerings are often in direct competition. Time-to-market is more important than ever before. This talk will identify the most troublesome characteristics of academic bureaucracies and make suggestions for working around these obstacles.


Watch Video

Watch on Google
Download Video (MP4, 45.4 Mb, MSU Mirror)
Download Audio Only (MP3, 10MB, MSU Mirror)

Open-Source Endeca in 250 Lines or Less

Casey Durfee

I will detail how you can create an OPAC with features comparable to Endeca or AquaBrowser's search products (faceted browsing, relevancy ranking, fuzzy searching) using the open-source Apache Solr search engine and your favorite web programming language. I will present a catalog with most of Endeca's features in 250 lines of code or less and discuss performance/scalability concerns and common pitfalls when using Solr.

Presentation on the web

Watch Video

Watch at Google
Download Video (MP4, 43.2 Mb, MSU Mirror)
Download Audio Only (MP3, 8.9MB, MSU Mirror)


Bess Sadler, Tigran Zargaryan

Libraries in developing countries have difficulty implementing and supporting commercial ILS systems. Poor support for internationalized interfaces and expensive software licensing fees contribute to an increasingly unsupportable situation in libraries around the world. Electronic Information for Libraries ( is currently planning the development of "Library-in-a-Box," an open-source, fully internationalized integrated library system, designed to be easy to install and support, and with next-generation OPAC features like faceted browsing. Library-in-a-box will build on the work already done by evergreen and koha. This talk will discuss the current state and future plans of this project.

Watch Video

Watch on Google
Download Video (MP4, 42.9 Mb, MSU Mirror)
Download Audio Only (MP3, 9.5MB, MSU Mirror)

Library Data APIs Abound!

Richard Wallis

From Z39.50 to xISBN, they share the limitation of providing a single stream of data from a single source.

How to add value to data from one source with relevant data from another, and how do you orchestrate that interaction in a scalable way?

A review and practical demonstration of augmentation APIs and their orchestration in a way that would make those used to Unix Pipes principles, feel at home.


Presentation available here [PPS].

View Video

Watch on Google
Download Video - (MP4, 39.1 MB, MSU Mirror)
Download Audio Only - (MP3, 8.7MB, MSU Mirror)

Smart Subjects - Application Independent Subject Recommendations

Tito Sierra

The Smart Subjects tool attempts to increase broader user discovery of relevant library resources by serendipitously recommending library subjects related to a user's search query. The prototype tool uses large locally created subject indexes consisting of rich topical keyword content harvested from local sources. An OpenSearch interface allows this recommendation service to be integrated flexibly and easily in a variety of web applications.


Presentation [PPT - 2.4Mb]

Watch Video

Video on Google
Download Video (MPEG-4, 40.6 Mb, MSU Mirror)
Download Audio Only (MP3, 18MB, MSU Mirror)

The XQuery Exposé: Practical Experiences from a Digital Library

Kevin Clarke

Princeton University has developed their current digital collections architecture around a native XML database and the XQuery programming language. This presentation will look at XQuery's strengths and weaknesses within the context of our current development environment as well as mention other environments in which an XQuery framework would (or, perhaps, wouldn't) work. What is XQuery? Why would one want to use it? Doesn't it do the same thing as XSLT? How does fulltext searching work within, or in conjunction with, XQuery? and Will XQuery be flexible enough to adjust to changes in our backend system? are all questions that will be posed and discussed.

Slides available in HTML

View Video

Watch on Google
Download Video (MP4, 15.6 MB, MSU Mirror)
Download Audio Only (MP3, 9.5MB, MSU Mirror)

Get Groovy at Your Public Library

Amy Begg De Groff and Luis Salazar

Howard County Library (a public library system in Central Maryland) deployed an open source, Ubuntu-based system on nearly 300 computers in 6 locations.

The result..
nearly 300 computers offer customers the following:
word processing (Open Office)
web surfing (Opera and Firefox)
music and video (Real Player and MPlayer)
and communication or Instant Messaging (GAIM)

The tools..

5-7 year old Dells computers
a vendor sent right from heaven (Open Sense Solutions)
willing and carefully educated front of house staff
open minded and empowered customers
and talented, talented, talented technical staff

The cost..
$25.00 for software
$2000.00 for support
$25,000 in memory upgrades

Watch Video

Watch at Google
Download Video - (MP4, 47.9 MB, MSU Mirror)
Download Audio Only (MP3, 10.4MB, MSU Mirror)

Atom Publishing Protocol Primer

Ed Summers

The Atom Publishing Protocol is an HTTP based protocol for publishing and editing Web resources. It has direct relevance for libraries and archives that are increasingly interested in building repositories of content on the web. In this presentation I will cover why the protocol was created, how it is being developed, and how to build out and exercise a simple APP application.

[slides] [code]

Watch Video

Watch on Google
Download Video (MP4, 46.3 Mb, MSU Mirror)
Download Audio Only (MP3, 10.1MB, MSU Mirror)


Subscribe to RSS - code4lib 2007