You are here

metadata

"data about data" -- like that helps!

Solr-powered Buddhist texts?

I'll be leading a Solr workshop this February, in Taiwan, for the EBTI/CBETA Conference. This is a synergy of several interests of mine: library-related findability, Asian culture, and Buddhist philosophy. I'm particularly encouraged by the Integrated Buddhist Archives (IBA) proposal - seems like a perfect fit for Collex.

OLA Super Conference - Collex and Blacklight

I'm honored to be speaking at the upcoming OLA Super Conference 2008. I'm presenting "Collex: Collecting and exhibiting scholarly materials" (session #406, Thursday January 31, 10:40am) and " Blacklight: the University of Virginia's Catalog on Solr" (session #1203, Friday, February 1, 2:10pm). I'm looking forward to demonstrating these projects and sharing their latest news.

OLA Super Conference 2008 banner

A Metadata Registry

Jon Phipps - NSDL Metadata Registry, Cornell University Libraries

Request for Proposals: Innovative Uses of CrossRef Metadata

CrossRef is now accepting proposals for the most original and promising use of CrossRef's article-level metadata as part of a scholarly research/authoring tool or related library service. Free access to participating publishers' metadata and DOIs will be granted to the researcher or librarian whose proposal is selected.

Deadline for proposals: July 15, 2007

What to submit:

  • A short but thorough description of the service you would like to
    create using the CrossRef metadata, including estimated implementation time
  • Links to software/services/protocols/standards that you may have
    implemented in the past
  • Your CV

To whom:

  • Amy Brand, PhD, Director of Business & Product Development
    email: abrand at crossref dot org

More details available at CrossRef

Background on Crossref Metadata and Web Services:

CrossRef is an independent membership association, founded and directed by publishers. CrossRef helps connect users to primary research content by enabling publishers to work collectively. Our citation-linking network today covers over 27 million articles and other content items from several hundred scholarly and professional publishers.

CrossRef Web Services, or CWS, offers an easy-to-use suite of tools for authorized partners to collect metadata on a cross-publisher basis to streamline their own crawling, indexing, and linking services. One way to access the 27 million+ metadata records now registered in CrossRef is via our OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) interface. The CrossRef OAI-PMH repository interface serves as the central point for the distribution of metadata from participating publishers, utilizing a robust and widely adopted technology targeted at consumers of large quantities of metadata. Access to the CrossRef’s metadata repository is controlled by IP authentication and can be tailored to provide specific content from select publishers to each authorized recipient.

The metadata in CrossRef’s database consists of basic bibliographic information for each item, including author(s), title, journal name, ISSN, volume, issue, page, and DOI-URL pair. Here are links to some sample CWS metadata: http://www.crossref.org/oai-handler_sample.xml, http://www.crossref.org/oai-pmh_sample2.xml.

CrossRef's data repository is extensive but highly variable in quality from publisher to publisher. They have recently begun to take measures to ensure higher quality metadata. Any suggestions you may have regarding how best to achieve a higher level of data quality and consistency, as part of your proposal or otherwise, would be greatly appreciated. Different but similarly basic metadata is collected for other types of content. In addition, many publishers now also deposit references as part of their metadata.

Topic: 

Stone Soup

(my first code4lib post)

Ed Summers and William Groppe have jumped in head-strong to crafting a Ruby DSL to Solr. solrb is coming along nicely thanks to our collaborations.

Another early adopter asked a question on the solr-user e-mail list, and I replied with a lot of juicy tidbits to whet your appetite.

We're moving as fast as we can in order to use this infrastructure for the basis of Solr Flare, destined to be faceted browsing plugins for Rails allowing your Rails application to easily benefit from what Solr and solrb offer. Solr Flare will debut at the code4lib pre-conference event in whatever form it happens to be in by then.

barton data

In light of the change in the README file at MIT:

What happened to the data?
--------------------------

We are currently evaluating legal issues about ownership and licensing possibilities
and hope to be able to be back online with it soon. Wish us luck.

the torrent has been temporarily (hopefully) disabled. Thanks to MIT for continuing to investigate how to make large bibliographic data sets available to the general public.

Topic: 

MIT Catalog Data

The Simile folks made MARC, MODS and RDF versions of Barton (the MIT Library Catalog) available as a test data set.

To show code4lib's support for Simile and open-library-data we've set up some torrents for the data.

Please help us seed this data far and wide.

Topic: 

unAPI version 0

Attached is the first version of the unAPI spec. Excerpting:

Topic: 

conference schedule in hCalendar

Subscribe to RSS - metadata