You are here

conferences

You know, when people get together and talk about stuff.

Library Text Mining

Rob Sanderson

Using the TeraGrid1 and the SRB DataGrid2, we have sufficient
computational and storage facilities to run normally prohibitively
expensive processing tasks. By integrating text and data mining
tools3[4] within the Cheshire35 information architecture, we can
parse the natural language present in 20 million MARC records (the
University of California’s MELVYL collection) and extract information to
provide to search/retrieve applications. In this talk, we’ll discuss
the results of applying new techniques to ‘old’ data.

1: http://www.teragrid.org
2: http://www.sdsc.edu/srb
3: http://www.ailab.si/orange
4: http://www-tsujii.is.s.u-tokyo.ac.jp/
5: http://www.cheshire3.org/

Rob Sanderson, (azaroth@liv.ac.uk)

Standards, Reusability, and the Mating Habits of Learning Content

Robby Robson

Digital libraries are supposed to foster reuse of digital content but it is hard to combine content from different sources. We are building prototype software that (1) converts different types of courseware to an XML interchange format based on OpenDocument and other specs/standards (2) enables the content to be disaggregated, recombined, re-styled and endowed with SCORM reporting behaviors and (3) realizes instructional design through the use of the SCORM (or IMS) Simple Sequencing. Will demo, discuss and am happy to talk about the bigger picture of reusability in educational digital libraries and standards if given a longer slot.

Robby Robson, robby@computer.org

Anatomy of aDORe

Ryan Chute

The aDORe Archive is a write-once/read-many storage approach for Digital Objects and their constituent datastreams. First, XML-based representations of multiple Digital Objects are concatenated into a single, valid XML file named an XMLtape. Second, ARC files, as introduced by the Internet Archive, are used to contain the constituent datastreams of the Digital Objects. The software was developed by the LANL Digital Library Research & Prototyping Team and is available under GNU LGPL license.

—
Ryan Chute
Los Alamos National Laboratory, Research Library

Quality Metrics

Aaron Krowne

This talk will discuss the core development activities of the “Quality
Metrics” project at Emory’s Woodruff Library. This project is being
conducted under an IMLS grant to research requirements for and build
a working prototype digital library search system.

What this project is doing that is new is truly generalizing and
integrating explicit and latent quality indicators which allow
users to ascertain the fitness of digital library resources. Most
search engine components have only one indicator: content-query
similarity (“relevance”). Google only has two, adding PageRank to the
latter. Our system, QM-search, will have an unlimited number of these,
which will be customizable by the digital librarian for the target
community and collections, and even customizeable from user to user or
search to search.

Some basic examples of quality indicators that digital libraries might
be able to exploit would be activations (views online or check-outs in
circulation), selection (compilation in “bookmark” lists online or
additions to course reserves lists), extent of review (from a peer-
reviewed journal, conference, or not?), or citation-based metrics.

The ouput of QM-search will be in a completely generalized XML format,
with the search results represented as a structure based on the
structure specified in the input “organization spec”. This XML output
can be transformed into presentation HTML resembling anything from a
“linear” Google-like search results list to an A9-like column display to
more exotic groupings and breakdowns.

Requirements for QM-search are being gleaned from focus groups being
conducted at Emory (preliminary results will be shared), and development
is being conducted as a high-level layer atop the excellent Lucene open
source search engine project.

—Aaron Krowne Head of Digital Library Research Emory University General Libraries President and Founder, PlanetMath.org Office: 404-712-2810 Cell: 404-405-5766 akrowne@emory.edu

Connecting Everything with unAPI and OPA

Dan Chudnov

unAPI is a simple-to-use, simple-to-implement API for web sites that allows rich object access and can be easily layered over existing services like Atom, OpenSearch, OAI-PMH, or SRU. OPA is a general-purpose identifier resolver that wraps API calls to heavily-used but incompatible web services like those from Amazon, Flickr, and Pubmed.

Together they will do the same thing we do every code4libcon – try to take over the world!

[Update 2006-02-28: Slides are here, in pdf.]

What Blog Applications Can Teach Us About Library Software Architecture

Casey Bisson

The number of programmers in the library world is growing and our individual efforts have shown great promise, but they exist largely as a spectacle that few libraries can enjoy. We need better means to aggregate our efforts and share solutions that can be employed by libraries without programming staff.

Looking outside libraries, we see some interesting examples in the blog world. The blog world is growing with new bloggers every day, but the most interesting aspect is how many people with limited technical skills are using (maintaining and configuring) blog applications like WordPress or Moveable Type, and how quickly the contributions of the many plugin and theme developers are implemented on those blogs. What lessons can we learn from this and how might a library application built from those lessons work? Are some software architectures better at leveraging the network effects of the growing number of developers in our community than others?

I’m working on a project that attempts to answer those questions and I hope to release a public beta shortly (update: it's WPopac, online now). I’d like to demo it and ask for participation.

Update: slides posted.

Casey Bisson
E-Learning Application Developer
Plymouth State University
Plymouth, New Hampshire
http://oz.plymouth.edu/~cbisson/

Voting on Code4Lib 2006 Presentation Proposals

Vote for the Code4lib 2006 presentations!

Please log in to participate in voting!

You may choose up to 11 proposals.

Voting closes at January 9th 11PM EST.

The 11 proposals with the most votes win. In case of a tie, we will have a "run off" election tomorrow, January 10th at 5PM - 11PM EST.

I will be deleting all votes cast before 5PM EST unless you specifically tell me that you have to vote early. So, be sure to tell me. Seriously. Send an email to ross.singer@library.gatech.edu.

Happy voting!

code4lib card

If you have a blog please consider adding a playing card to your site:

  • http://www.code4lib.org/cards/code4lib1.png
  • http://www.code4lib.org/cards/code4lib2.png

You can use the following HTML:

<a href="http://code4lib.org/">
<img src="http://code4lib.org/cards/code4lib1.png" 
  border="0" 
  alt="code4lib" 
  title="code4lib"/>
</a>

Feel free to use the images directly from code4lib.org if you don't want to go through the effort of grabbing them. This will allow us to secretly discover your site by trawling our logs.

Having the playing card can let people know about code4lib.org who don't know about it already. For examples of how to use them check out roy or dchud.

Topic: 

Registration is Open

Registration for Code4lib 2006 is OPEN Register early for a discount. Don't hesitate, or wait, or be late...register today!

Code4lib 2006 Call For Proposals

Call for proposals - Code4lib 2006

We are now accepting proposals for prepared talks for Code4lib 2006. Code4lib 2006 is a loosely structured conference for library technologists to commune, gather/create/share ideas and software, be inspired, and forge collaborations. It is also an outgrowth of the Access HackFest, wrapped into a conference-ish format. It is *the* event for technologists building digital libraries and digital information systems, tools, and software.

At least six time slots will be available for prepared talks. We will choose from among the proposals based on diversity of topics, usefulness, wow factor, and potential impact.

Proposals of 75 words or less are being accepted for review now. Please send your name, email address, and proposal to code4libcon at lists.gatech.edu.

We cannot accept every prepared talk proposal, but multiple lightning talk sessions will provide everyone who wishes to present with ample opportunity to show off. The proposal deadline is 5pm EST January 9, 2006, and proposers will be notified by Midnight January 9, 2006.

Prepared Talk Information

Prepared talks are 20 minutes, and must center on "tools" (some cool new software, software library or integration platform), "specs" (how to get the most out of some protocols, or proposals for new ones), or "challenges" (One or more big problems we should collectively address). We will evaluate proposals on criteria of usefulness, newness, geekiness, and diversity of topics.

Again, proposals should be sent to code4libcon at lists.gatech.edu

Pages

Subscribe to RSS - conferences