You are here

Feed aggregator

Library of Congress: The Signal: Collecting and Preserving Digital Art: Interview with Richard Rinehart and Jon Ippolito

planet code4lib - 4 hours 14 min ago

Jon Ippolito, Professor of New Media at the University of Maine

As artists have embraced a range of new media and forms in the last century as the work of collecting, conserving and exhibiting these works has become increasingly complex and challenging. In this space, Richard Rinehart and Jon Ippolito have been working to develop and understand approaches to ensure long-term access to digital works. In this installment of our insights interview series I discuss Richard and Jon’s new book, “Re-collection: Art, New Media, and Social Memory.” The book offers an articulation of their variable media approach to thinking about works of art. I am excited to take this opportunity to explore the issues the book raises about digital art in particular and a perspective on digital preservation and social memory more broadly as part of our Insights Interview Series.

Trevor: The book takes a rather broad view of “new media”; everything from works made of rubber, to CDs, art installations made of branches, arrangements of lighting, commercial video games and hacked variations of video games. For those unfamiliar with your work more broadly, could you tell us a bit about your perspective on how these hang together as new media? Further, given that the focus of our audience is digital preservation, could you give us a bit of context for what value thinking about various forms of non-digital variable new media art offer us for understanding digital works?

Richard Rinehart, Director of the Samek Art Museum at Bucknell University.

Richard: Our book does focus on the more precise and readily-understood definition of new media art as artworks that rely on digital electronic computation as essential and inextricable. The way we frame it is that these works are at the center of our discussion, but we also discuss works that exist at the periphery of this definition. For instance, many digital artworks are hybrid digital/physical works (e.g., robotic works) and so the discussion cannot be entirely contained in the bitstream.

We also discuss other non-traditional art forms–performance art, installation art–that are not as new as “new media” but are also not that old in the history of museum collecting. It is important to put digital art preservation in an historical context, but also some of the preservation challenges presented by these works are shared with and provide precedents for digital art. These precedents allow us to tap into previous solutions or at least a history of discussion around them that could inform or aid in preserving digital art. And, vice versa, solutions for preserving digital art may aid in preserving these other forms (not least of which is shifting museum practices). Lastly, we bring non-digital (but still non-traditional) art forms into the discussion because some of the preservation issues are technological and media-based (in which case digital is distinct) but some issues are also artistic and theoretical, and these issues are not necessarily limited to digital works.

Jon: Yeah, we felt digital preservation needed a broader lens. The recorded culture of the 20th century–celluloid, vinyl LPs, slides–is a historical anomaly that’s a misleading precedent for preserving digital artifacts. Computer scientist Jeff Rothenberg argues that even JPEGs and PDF documents are best thought of as applications that must be “run” to be accessed and shared. We should be looking at paradigms that are more contingent than static files if we want to forecast the needs of 21st-century heritage.

Casting a wider net can also help preservationists jettison our culture’s implicit metaphor of stony durability in favor of one of fluid adaptability. Think of a human record that has endured and most of us picture a chiseled slab of granite in the British Museum–even though oral histories in the Amazon and elsewhere have endured far longer. Indeed, Dragan Espenschied has pointed out cases in which clay tablets have survived longer than stone because of their adaptability: they were baked as is into new buildings, while the original carvings on stones were chiseled off to accommodate new inscriptions. So Richard and I believe digital preservationists can learn from media that thrive by reinterpretation and reuse.

Trevor: The book presents technology, institutions and law as three sources of problems for the conservation of variable media art and potentially as three sources of possible solutions. Briefly, what do you see as the most significant challenges and opportunities in these three areas? Further, are there any other areas you considered incorporating but ended up leaving out?

Jon: From technology, the biggest threat is how the feverish marketing of our techno-utopia masks the industry’s planned obsolescence. We can combat this by assigning every file on our hard drives and gadget on our shelves a presumptive lifespan, and leaving room in our budgets to replace them once their expiration date has expired.

From institutions, the biggest threat is that their fear of losing authenticity gets in the way of harnessing less controllable forms of cultural perseverance such as proliferative preservation. Instead of concentrating on the end products of culture, they should be nurturing the communities where it is birthed and finds meaning.

From the law, the threat is DRM, the DMCA, and other mechanisms that cut access to copyrighted works–for unlike analog artifacts, bits must be accessed frequently and openly to survive. Lawyers and rights holders should be looking beyond the simplistic dichotomy of copyright lockdown versus “information wants to be free” and toward models in which information requires care, as is the case for sacred knowledge in many indigenous cultures.

Other areas? Any in which innovative strategies of social memory are dismissed because of the desire to control–either out of greed (“we can make a buck off this!”) or fear (“culture will evaporate without priests to guard it!”).

Trevor: One of the central concepts early in the book is “social memory,” in fact, the term makes its way into the title of the book. Given its centrality, could you briefly explain the concept and discuss some of how this framework for thinking about the past changes or upsets other theoretical perspectives on history and memory that underpin work in preservation and conservation?

Richard: Social memory is the long-term memory of societies. It’s how civilizations persist from year to year or century to century. It’s one of the core functions of museums and libraries and the purpose of preservation. It might alternately be called “cultural heritage,” patrimony, etc. But the specific concept of social memory is useful for the purpose of our book because there is a body of literature around it and because it positions this function as an active social dynamic rather than a passive state (cultural heritage, for instance, sounds pretty frozen). It was important to understand social memory as a series of actions that take place in the real world every day as that then helps us to make museum and preservation practices tangible and tractable.

The reason to bring up social memory in the first place is to gain a bit of distance on the problem of preserving digital art. Digital preservation is so urgent that most discussions (perhaps rightfully) leap right to technical issues and problem-solving. But, in order to effect the necessary large-scale and long-term changes in, say, museum practices, standards and policies we need to understand the larger context and historic assumptions behind current practices. Museums (and every cultural heritage institution) are not just stubborn; they do things a certain way for a reason. To convince them to change, we cannot just point at ad-hoc cases and technical problematics; we have to tie it to their core mission: social memory. The other reason to frame it this way is that new media really are challenging the functions of social memory; not just in museums, but across the board and here’s one level in which we can relate and share solutions.

These are some ways in which the social  memory allows us to approach preservation differently in the book, but here’s another, more specific one. We propose that social memory takes two forms: formal/canonical/institutional memory and informal/folkloric/personal memory (and every shade in between). We then suggest how the preservation of digital art may be aided by BOTH social memory functions.

Trevor: Many of the examples in the book focus on boundary-breaking installation art, like Flavin’s work with lighting, and conceptual art, like Nam June Paik’s work with televisions and signals, or Cory Arcangel’s interventions on Nintendo cartridges. Given that these works push the boundaries of their mediums, or focus in depth on some of the technical and physical properties of their mediums do you feel like lessons learned from them apply directly to seemingly more standardized and conventional works in new media? For instance, mass produced game cartridges or Flash animations and videos? To what extent are lessons learned about works largely intended to be exhibited art in galleries and museums applicable to more everyday mass-produced and consumed works?

Richard: That’s a very interesting question and its speaks to our premise that preserving digital art is but one form of social memory and that lessons learned therein may benefit other areas. I often feel that preserving digital art is useful for other preservation efforts because it provides an extreme case. Artists (and the art world) ensure that their media creations are about as complex as you’ll likely find; not necessarily technically (although some are technically complex and there are other complexities introduced in their non-standard use of technologies) but because what artists do is to complicate the work at every level–conceptually, phenomenologically, socially, technically; they think very specifically about the relationship between media and meaning and then they manifest those ideas in the digital object.

I fully understand that preserving artworks does not mean trying to capture or preserve the meaning of those objects (an impossible task) but these considerations must come into play when preserving art even at a material level; especially in fungible digital media. So, for just one example, preserving digital artworks will tell us a lot about HCI considerations that attend preserving other types of interactive digital objects.

Jon: Working in digital preservation also means being a bit of a futurist, especially in an age when the procession from medium to medium is so rapid and inexorable. And precisely because they play with the technical possibilities of media, today’s artists are often society’s earliest adopters. My 2006 book with Joline Blais, “At the Edge of Art,” is full of examples, whether how Google Earth came from Art+Com, Wikileaks from Antoni Muntadas, or gestural interfaces from Ben Fry and Casey Reas. Whether your metaphor for art is antennae (Ezra Pound) or antibodies (Blais), if you pay attention to artists you’ll get a sneak peek over the horizon.

Trevor: Richard suggests that the key to digital media is variability and not fixity which is the defining feature of digital media. Beyond this that conservators should move away from “outdated notions of fixity.” Given the importance of the concept of fixity in digital preservation circles, could you unpack this a bit for us? While digital objects do indeed execute and perform the fact that I can run a fixity check and confirm that this copy of the digital object is identical to what it was before seems to be an incredibly powerful and useful component of ensuring long-term access to them. Given that based on the nature of digital objects, we can actually ensure fixity in a way we never could with analog artifacts, this idea of distancing ourselves from fixity seemed strange.

Richard: You hit the nail on the head with that last sentence; and we’re hitting a little bit of a semantic wall here as well–fixity as used in computer science and certain digital preservation circles does not quite have the same meaning as when used in lay text or in the context of traditional object-based museum preservation. I was using fixity in the latter sense (as the first book on this topic, we wrote for a lay audience and across professional fields as much as possible.) Your last thought compares the uses of “fixity” as checks between analog media (electronic, reproducible; film, tape, or vinyl) compared to digital media, but in the book I was comparing fixity as applied to a different class of analog objects (physical; marble, bronze, paint) compared to digital objects.

If we step back from the professional jargon for a moment, I would characterize the traditional museological preservation approach for oil painting and bronze sculptures to be one based on fixity. The kind of digital authentication that you are talking about is more like the scientific concept of repeatability; a concept based on consistency and reproduction–the opposite of the fixity! I think the approach we outline in the book is in opposition to fixity of the marble-bust variety (as inappropriate for digital media) but very much in-line with fixity as digital authentication (as one tool for guiding and balancing a certain level of change with a certain level of integrity.) Jon may disagree here–in fact we built in these dynamics of agreement/disagreement into our book too.

Jon: I’d like to be as open-minded as Richard. But I can’t, because I pull my hair out every time I hear another minion of cultural heritage fixated on fixity. Sure, it’s nifty that each digital file has a unique cryptographic signature we can confirm after each migration. The best thing about checksums is that they are straightforward, and many preservation tools (and even some operating systems) already incorporate such checks by default. But this seems to me a tiny sliver of a far bigger digital preservation problem, and to blow it out of proportion is to perpetuate the myth that mathematical replication is cultural preservation.

Two files with different passages of 1s and 0s automatically have different checksums but may still offer the same experience; for example, two copies of a digitized film may differ by a few frames but look identical to the human eye. The point of digitizing a Stanley Kubrick film isn’t to create a new mathematical artifact with its own unchanging properties, but to capture for future generations the experience us old timers had of watching his cinematic genius in celluloid. As a custodian of culture, my job isn’t to ensure my DVD of A Clockwork Orange is faithful to some technician’s choices when digitizing the film; it’s to ensure it’s faithful to Kubrick’s choices as a filmmaker.

Furthermore, there’s no guarantee that born-digital files with impeccable checksums will bear any relationship to the experience of an actual user. Engineer and preservationist Bruno Bachiment gives the example of an archivist who sets a Web spider loose on a website, only to have the website’s owners update it in the middle of the crawling process. (This happens more often than you might think.) Monthly checksums will give the archivist confidence that she’s archived that website, but in fact her WARC files do not correspond to any digital artifact that has ever existed in the real world. Her chimera is a perversion caused by the capturing process–like those smartphone panoramas of a dinner where the same waiter appears at both ends of the table.

As in nearly all storage-based solutions, fixity does little to help capture context.  We can run checksums on the Riverside “King Lear” till the cows come home, and it still won’t tell us that boys played women’s parts, or that Elizabethan actors spoke with rounded vowels that sound more like a contemporary American accent than the King’s English, or how each generation of performers has drawn on the previous for inspiration. Even on a manuscript level, a checksum will only validate one of many variations of a text that was in reality constantly mutating and evolving.

The context for software is a bit more cut-and-dried, and the professionals I know who use emulators like to have checksums to go with their disk images. But checksums don’t help us decide what resolution or pace they should run at, or what to do with past traces of previous interactions, or what other contemporaneous software currently taken for granted will need to be stored or emulated for a work to run in the future.

Finally, even emulation will only capture part of the behaviors necessary to reconstruct digital creations in the networked age, which can depend on custom interfaces, environmental data or networks. You can’t just go around checksumming wearable hardware or GPS receivers or Twitter networks; the software will have to mutate to accommodate future versions of those environments.

So for a curator to run regular tests on a movie’s fixity is like a zookeeper running regular tests on a tiger’s DNA. Just because the DNA tests the same doesn’t guarantee the tiger is healthy, and if you want the species to persist in the long term, you have to accept that the DNA of individuals is certainly going to change.

We need a more balanced approach. You want to fix a butterfly? Pin it to a wall. If you want to preserve a butterfly, support an ecosystem where it can live and evolve.

Trevor: The process of getting our ideas out on the page can often play a role in pushing them in new directions. Are there any things that you brought into working on the book that changed in the process of putting it together?

Richard: A book is certainly slow media; purposefully so. I think the main change I noticed was the ability to put our ideas around preservation practice into a larger context of institutional history and social memory functions. Our previous expressions in journal articles or conference presentation simply did not allow us time to do that and, as stated earlier, I feel that both are important in the full consideration of preservation.

Jon: When Richard first approached me about writing this book, I thought, well it’s gonna be pretty tedious because it seemed we would be writing mostly about our own projects. At the time I was only aware of a single emulation testbed in a museum, one software package for documenting opinions on future states of works, and no more conferences and cross-institutional initiatives on variable media preservation than I could count on one hand.

Fortunately, it took us long enough to get around to writing the book (I’ll take the blame for that) that we were able to discover and incorporate like-minded efforts cropping up across the institutional spectrum, from DOCAM and ZKM to Preserving Virtual Worlds and JSMESS. Even just learning how many art museums now incorporate something as straightforward as an artist’s questionnaire into their acquisition process! That was gratifying and led me to think we are all riding the crest of a wave that might bear the digital flotsam of today’s culture into the future.

Trevor: The book covers a lot of ground, focusing on a range of issues and offering myriad suggestions for how various stakeholders could play a role in ensuring access to variable media works into the future. In all of that, is there one message or issue in the work that you think is the most critical or central?

Richard: After expanding our ideas in a book; it’s difficult to come back to tweet format, but I’ll try…

Change will happen. Don’t resist it; use it, guide it. Let art breathe; it will tell you what it needs.

Jon: And don’t save documents in Microsoft Word.

Open Knowledge Foundation: Congratulations to the Panton Fellows 2013-2014

planet code4lib - 10 hours 18 min ago

Samuel Moore, Rosie Graves and Peter Kraker are the 2013-2014 Open Knowledge Panton Fellows – tasked with experimenting, exploring and promoting open practises through their research over the last twelve months. They just posted their final reports so we’d like to heartily congratulate them on an excellent job and summarise their highlights for the Open Knowledge community.

Over the last two years the Panton Fellowships have supported five early career researchers to further the aims of the Panton Principles for Open Data in Science alongside their day to day research. The provision of additional funding goes some way towards this aim, but a key benefit of the programme is boosting the visibility of the Fellow’s work within the open community and introducing them to like-minded researchers and others within the Open Knowledge network.

On stage at the Open Science Panel Vienna (Photo by FWF/APA-Fotoservice/Thomas Preiss)

Peter Kraker (full report) is a postdoctoral researcher at the Know-Centre in Graz and focused his fellowship work on two facets: open and transparent altmetrics and the promotion of open science in Austria and beyond. During his Felowship Peter released the open source visualization Head Start, which gives scholars an overview of a research field based on relational information derived from altmetrics. Head Start continues to grow in functionality, has been incorporated into Open Knowledge Labs and is soon to be made available on a dedicated website funded by the fellowship.

Peter’s ultimate goal is to have an environment where everybody can create their own maps based on open knowledge and share them with the world. You are encouraged to contribute! In addition Peter has been highly active promoting open science, open access, altmetrics and reproducibility in Austria and beyond through events, presentations and prolific blogging, resulting in some great discussions generated on social media. He has also produced a German summary of open science activities every month and is currently involved in kick-starting a German-speaking open science group through the Austrian and German Open Knowledge local groups.

Rosie with an air quality monitor

Rosie Graves (full report) is a postdoctoral researcher at the University of Leicester and used her fellowship to develop an air quality sensing project in a primary school. This wasn’t always an easy ride, the sensor was successfully installed and an enthusiastic set of schoolhildren were on board, but a technical issue meant that data collection was cut short, so Rosie plans to resume in the New Year. Further collaborations on crowdsourcing and school involvement in atmospheric science were even more successful, including a pilot rain gauge measurement project and development of a cheap, open source air quality sensor which is sure to be of interest to other scientists around the Open Knowledge network and beyond. Rosie has enjoyed her Panton Fellowship year and was grateful for the support to pursue outreach and educational work:

“This fellowship has been a great opportunity for me to kick start a citizen science project … It also allowed me to attend conferences to discuss open data in air quality which received positive feedback from many colleagues.”

Samuel Moore (full report) is a doctoral researcher in the Centre for e-Research at King’s College London and successfully commissioned, crowdfunded and (nearly) published an open access book on open research data during his Panton Year: Issues in Open Research Data. The book is still in production but publication is due during November and we encourage everyone to take a look. This was a step towards addressing Sam’s assessment of the nascent state of open data in the humanities:

“The crucial thing now is to continue to reach out to the average researcher, highlighting the benefits that open data offers and ensuring that there is a stock of accessible resources offering practical advice to researchers on how to share their data.”

Another initiative Sam initiated during the fellowship was establishing the forthcoming Journal of Open Humanities Data with Ubiquity Press, which aims to incentivise data sharing through publication credit, which in turn makes data citable through usual academic paper citation practices. Ultimately the journal will help researchers share their data, recommending repositories and best practices in the field, and will also help them track the impact of their data through citations and altmetrics.

We believe it is vital to provide early career researchers with support to try new open approaches to scholarship and hope other organisations will take similar concrete steps to demonstrate the benefits and challenges of open science through positive action.

Finally, we’d like to thank the Computer and Communications Industry Association (CCIA) for their generosity in funding the 2013-14 Panton Fellowships.

This blog post a cross-post from the Open Science blog, see the original here.

Hydra Project: Sufia 4.2.0 released

planet code4lib - 12 hours 8 min ago

We are pleased to announce the release of Sufia 4.2.0.

This release of Sufia includes the ability to cache usage statistics in the application database, an accessibility fix, and a number of bug fixes. Thanks to Carolyn Cole, Michael Tribone, Adam Wead, Justin Coyne, and Mike Giarlo for their work on this release.

View the upgrade notes and a complete changelog on the release page: https://github.com/projecthydra/sufia/releases/tag/v4.2.0

LibUX: Who Uses Library Mobile Websites?

planet code4lib - 16 hours 30 min ago

Almost every American owns a cell phone. More than half use a smartphone and sleeps with it next to the bed. How many do you think visit their library website on their phone, and what do they do there? Heads up: this one’s totally America-centric.

Who uses library mobile websites?

Almost one in five (18%) Americans ages 16-29 have used a mobile device to visit a public library’s website or access library resources in the past 12 months, compared with 12% of those ages 30 and older.) Younger Americans’ Library Habits and Expectations (2013)

If that seems anticlimactic, consider that just about every adult in the U.S. owns a cell phone, and almost every millenial in the country is using a smartphone. This is the demographic using library mobile websites, more than half of which already have a library card.

In 2012, the Pew Internet and American Life Project found that library website users were often young, not poor, educated, and–maybe–moms or dads.

Those who are most likely to have visited library websites are parents of minors, women, those with college educations, those under age 50, and people living in households earning $75,000 or more.

This correlates with the demographics of smartphone owners for 2014.

What do they want?

This 2013 Pew report makes the point that while digital natives still really like print materials and the library as a physical space, a non-trivial number of them said that libraries should definitely move most library services online. Future-of-the-library blather is often painted in black and white, but it is naive to think physical–or even traditional–services are going away any time soon. Rather, there is already demand for complementary or analogous online services.

Literally. When asked, 45% of Americans ages 16 – 29 wanted “apps that would let them locate library materials within the library.” They also wanted a library-branded Redbox (44%), and an “app to access library services” (42%) – by app I am sure they mean a mobile-first, responsive web site. That’s what we mean here at #libux.

For more on this non-controversy, listen to our chat with Brian Pichman about web vs native.

Eons ago (2012), the non-mobile specific breakdown of library web activities looked like this:

  • 82% searched the catalog
  • 72% looked for hours, location, directions, etc.
  • 62% put items on hold
  • 51% renewed them
  • 48% were interested in events and programs – especially old people
  • 44% did research
  • 30% sought readers’ advisory (book reviews or recommendations)
  • 30% paid fines (yikes)
  • 27% signed-up for library programs and events
  • 6% reserved a room

Still, young Americans are way more invested in libraries coordinating more closely with schools, offering literacy programs, and being more comfortable ( chart ). They want libraries to continue to be present in the community, do good, and have hipster decor – coffee helps.

Webbification is broadly expected, but it isn’t exactly a kudos subject. Offering comparable online services is necessary, like it is necessary that MS Word lets you save work. A library that doesn’t offer complementary or analogous online services isn’t buggy so much as it is just incomplete.

Take this away

The emphasis on the library as a physical space shouldn’t be shocking. The opportunity for the library as a hyper-locale specifically reflecting its community’s temperament isn’t one to overlook, especially for as long as libraries tally success by circulation numbers and foot traffic. The whole library-without-walls cliche that went hand-in-hand with all that Web 2.0 stuff tried to show-off the library as it could be in the cloud, but “the library as physical space” isn’t the same as “the library as disconnected space.” The tangibility of the library is a feature to be exploited both for atmosphere and web services. “Getting lost in the stacks” can and should be relegated to just something people say than something that actually happens.

The main reason for library web traffic has been and continues to be to find content (82%) and how to get it (72%).

Bullet points
  • Mobile first: The library catalog, as well as basic information about the library, must be optimized for mobile
  • Streamline transactions: placing and removing holds, checking out, paying fines. There is a lot of opportunity here. Basic optimization of the OPAC and cart can go along way, but you can even enable self checkout, library card registration using something like Facebook login, or payment through Apple Pay.
  • Be online: [duh] Offer every basic service available in person online
  • Improve in-house wayfinding through the web: think Google Indoor Maps
  • Exploit smartphone native services to anticipate context: location, as well as time-of-day, weather, etc., can be used to personalize service or contextually guess at the question the patron needs answered. “It’s 7 a.m. and cold outside, have a coffee on us.” – or even a simple “Yep. We’re open” on the front page.
  • Market the good the library provides to the community to win support (or donations)

The post Who Uses Library Mobile Websites? appeared first on LibUX.

FOSS4Lib Recent Releases: Sufia - 4.2.0

planet code4lib - Tue, 2014-11-25 21:54
Package: SufiaRelease Date: Tuesday, November 25, 2014

Last updated November 25, 2014. Created by Peter Murray on November 25, 2014.
Log in to edit this page.

The 4.2.0 release of Sufia includes the ability to cache usage statistics in the application database, an accessibility fix, and a number of bug fixes.

Nicole Engard: Bookmarks for November 25, 2014

planet code4lib - Tue, 2014-11-25 20:30

Today I found the following resources and bookmarked them on <a href=

  • PressForward A free and open-source software project launched in 2011, PressForward enables teams of researchers to aggregate, filter, and disseminate relevant scholarship using the popular WordPress web publishing platform. Just about anything available on the open web is fair game: traditional journal articles, conference papers, white papers, reports, scholarly blogs, and digital projects.

Digest powered by RSS Digest

The post Bookmarks for November 25, 2014 appeared first on What I Learned Today....

Related posts:

  1. Code4Lib Journal
  2. Games & Meebo
  3. The Future of Bibliographic Control: A Time of Transition

District Dispatch: CopyTalk: Free Copyright Webinar

planet code4lib - Tue, 2014-11-25 19:48

Join us for our CopyTalk, our copyright webinar, on December 4 at 2pm Eastern Time. This installment of CopyTalk is entitled, “Introducing the Statement of Best Practices in Fair Use of Collections Containing Orphan Works for Libraries, Archives, and Other Memory Institutions”.

Peter Jaszi (American University, Washington College of Law) and David Hansen (UC Berkeley and UNC Chapel Hill) will introduce the “Statement of Best Practices in Fair Use of Collections Containing Orphan Works for Libraries, Archives, and Other Memory Institutions.” This Statement, the most recent community-developed best practices in fair use, is the result of intense discussion group meetings with over 150 librarians, archivists, and other memory institution professionals from around the United States to document and express their ideas about how to apply fair use to collections that contain orphan works, especially as memory institutions seek to digitize those collections and make them available online. The Statement outlines the fair use rationale for use of collections containing orphan works by memory institutions and identifies best practices for making assertions of fair use in preservation and access to those collections.

There is no need to pre-register! Just show up on December 2, at 2pm Eastern time. http://ala.adobeconnect.com/copyright/

The post CopyTalk: Free Copyright Webinar appeared first on District Dispatch.

DPLA: From the Book Patrol: A Parade of Thanksgiving Goodness

planet code4lib - Tue, 2014-11-25 19:00

Did you know that over 2,400 items related to Thanksgiving reside at the DPLA? From Thanksgiving menus from hotels and restaurants across this great land to Thanksgiving postcards to images of the fortunate and less fortunate taking part in Thanksgiving day festivities.

Here’s just a taste of Thanksgiving at the Digital Public Library of America.

Enjoy and and have a Happy Thanksgiving!

Thanksgiving Day, Raphael Tuck & Sons, 1907 Macy’s Thanksgiving Day Parade, 1932 Photograph by Alexander Alland  Japanese Internment Camp – Gila River Relocation Center, Rivers, Arizona. One of the floats in the Thanksgiving day Harvest Festival, 11/26/1942 Annual Presentation of Thanksgiving Turkey, 11/16/1967 . Then President Lyndon Baines Johnson presiding  A man with an axe in the midst of a flock of turkeys. Greenville North Carolina,1965  Woman carries Thanksgiving turkey at Thresher & Kelley Market, Faneuil Hall in Boston, 1952. Photograph by Leslie Jones  Thanksgiving Dinner Menu. Hotel Scenley, Pittsburgh, PA. 1900 More than 100 wounded Negro soldiers, sailors, marines and Coast Guardsmen were feted by The Equestriennes, a group of Government Girls, at an annual Thanksgiving dinner at Lucy D. Slowe Hall, Washington, D. C., Photograph by Helen Levitt, 1944. Volunteers of America Thanksgiving, 22 November 1956. Thanksgiving dinner line in front of Los Angeles Street Post door

District Dispatch: Have questions about WIOA?

planet code4lib - Tue, 2014-11-25 18:24

To follow up on the October 27th webinar “$2.2 Billion Reasons to Pay Attention to WIOA,” the American Library Association (ALA) today releases a list of resources and tools that provide more information about the Workforce Innovation and Opportunity Act (WIOA). The Workforce Innovation and Opportunity Act allows public libraries to be considered additional One-Stop partners, prohibits federal supervision or control over selection of library resources and authorizes adult education and literacy activities provided by public libraries as an allowable statewide employment and training activity.

Subscribe to the District Dispatch, ALA’s policy blog, to be alerted to when additional WIOA information becomes available.

The post Have questions about WIOA? appeared first on District Dispatch.

FOSS4Lib Upcoming Events: Advanced DSpace Training

planet code4lib - Tue, 2014-11-25 16:45
Date: Tuesday, March 17, 2015 - 08:00 to Thursday, March 19, 2015 - 17:00Supports: DSpace

Last updated November 25, 2014. Created by Peter Murray on November 25, 2014.
Log in to edit this page.

In-person, 3-day Advanced DSpace Course in Austin March 17-19, 2015. The total cost of the course is being underwritten with generous support from the Texas Digital Library and DuraSpace. As a result, the registration fee for the course for DuraSpace Members is only $250 and $500 for Non-Members (meals and lodging not included). Seating will be limited to 20 participants.

For more details, see http://duraspace.org/articles/2382

David Rosenthal: Dutch vs. Elsevier

planet code4lib - Tue, 2014-11-25 16:00
The discussions between libraries and major publishers about subscriptions have only rarely been actual negotiations. In almost all cases the libraries have been unwilling to walk away and the publishers have known this. This may be starting to change; Dutch libraries have walked away from the table with Elsevier. Below the fold, the details.

VNSU, the association representing the 14 Dutch research universities, negotiates on their behalf with journal publishers. Earlier this month they announced that their current negotiations with Elsevier are at an impasse, on the issues of costs and the Dutch government's Open Access mandate:
Negotiations between the Dutch universities and publishing company Elsevier on subscription fees and Open Access have ground to a halt. In line with the policy pursued by the Ministry of Education, Culture and Science, the universities want academic publications to be freely accessible. To that end, agreements will have to be made with the publishers. The proposal presented by Elsevier last week totally fails to address this inevitable change.In their detailed explanation for scientists (PDF), VNSU elaborates:
During several round[s] of talks, no offer was made which would have led to a real, and much-needed, transition to open access. Moreover, Elsevier has failed to deliver an offer that would have kept the rising costs of library subscriptions at an acceptable level. ... In the meantime, universities will prepare for the possible consequences of an expiration of journal subscriptions. In case this happens researchers will still be able to publish in Elsevier journals. They will also have access to back issues of these journals. New issues of Elsevier journals as of 1-1-2015 will not be accessible anymore.I assume that this means that post-cancellation access will be provided by Elsevier directly, rather than by an archiving service. The government and the Dutch research funder have expressed support for VNSU's position.

This stand by the Dutch is commendable; the outcome will be very interesting. In a related development, if my marginal French is not misleading me, a new law in Germany allows authors of publicly funded research to make their accepted manuscripts freely available 1 year after initial publication. Both stand in direct contrast to the French "negotiation" with Elsevier:
France may not have any money left for its universities but it does have money for academic publishers.
While university presidents learn that their funding is to be reduced by EUR 400 million, the Ministry of Research has decided, under great secrecy, to pay EUR 172 million to the world leader in scientific publishing Elsevier .

LITA: Top Technologies Webinar – Dec. 2, 2014

planet code4lib - Tue, 2014-11-25 15:56

Don’t miss the Top Technologies Every Librarian Needs to Know Webinar with Presenters: Brigitte Bell, Steven Bowers, Terry Cottrell, Elliot Polak and Ken Varnum

Offered: December 2, 2014
1:00 pm – 2:00 pm Central Time

Register Online page arranged by session date (login required)

We’re all awash in technological innovation. It can be a challenge to know what new tools are likely to have staying power — and what that might mean for libraries. The recently published Top Technologies Every Librarian Needs to Know highlights a selected set of technologies that are just starting to emerge and describes how libraries might adapt them in the next few years.

In this webinar, join the authors of three chapters from the book as they talk about their technologies and what they mean for libraries.

Hands-Free Augmented Reality: Impacting the Library Future
Presenters: Brigitte Bell & Terry Cottrell

Based on the recent surge of interest in head-mounted augmented reality devices such as the 3D gaming console Oculus Rift and Google’s Glass project, it seems reasonable to expect that the implementation of hands-free augmented reality technology will become common practice in libraries within the next 3-5 years.

The Future of Cloud-Based Library Systems
Presenters: Elliot Polak & Steven Bowers

In libraries, cloud computing technology can reduce the costs and human capital associated with maintaining a 24/7 Integrated Library System while facilitating an up-time that is costly to attain in-house. Cloud-Based Integrated Library Systems can leverage a shared system environment, allowing libraries to share metadata records and other system resources while maintaining independent local information allowing for reducing redundant workflows and yielding efficiencies for cataloging/metadata and acquisitions departments.

Library Discovery: From Ponds to Streams
Presenter: Ken Varnum

Rather than exploring focused ponds of specialized databases, researchers now swim in oceans of information. What is needed is neither ponds (too small in our interdisciplinary world) or oceans (too broad and deep for most needs), but streams — dynamic, context-aware subsets of the whole, tailored to the researcher’s short- or long-term interests.

Register Online now to join us what is sure to be an excellent and informative webinar.

Open Knowledge Foundation: Code for Africa &amp; Open Knowledge Launch Open Government Fellowship Pilot Programme: Apply Today

planet code4lib - Tue, 2014-11-25 14:22

Open Knowledge and Code for Africa launch pilot Open Government Fellowship Programme. Apply to become a fellow today. This blog announcement is available in French here and Portuguese here.

Open Knowledge and Code for Africa are pleased to announce the launch of our pilot Open Government Fellowship programme. The six month programme seeks to empower the next generation of leaders in field of open government.


We are looking for candidates that fit the following profile:

  • Currently engaged in the open government and/or related communities . We are looking to support individuals already actively participating in the open government community
  • Understands the role of civil society and citizen based organisations in bringing about positive change through advocacy and campaigning
  • Understands the role and importance of monitoring government commitments on open data as well as on other open government policy related issues
  • Has facilitation skills and enjoys community-building (both online and offline).
  • Is eager to learn from and be connected with an international community of open government experts, advocates and campaigners
  • Currently living and working in Africa. Due to limited resources and our desire to develop a focused and impactful pilot programme, we are limiting applications to those currently living and working in Africa. We hope to expand the programme to the rest of the world starting in 2015.

The primary objective of the Open Government Fellowship programme is to identify, train and support the next generation of open government advocates and community builders. As you will see in the selection criteria, the most heavily weighted item is current engagement in the open government movement at the local, national and/or international level. Selected candidates will be part of a six-month fellowship pilot programme where we expect you to work with us for an average of six days a month, including attending online and offline trainings, organising events, and being an active member of the Open Knowledge and Code for Africa communities.

Fellows will be expected to produce tangible outcomes through during their fellowship but what these outcomes are will be up to the fellows to determine. In the application, we ask fellows to describe their vision for their fellowship or, to put it another way, to lay out what they would like to accomplish. We could imagine fellows working with a specific government department or agency to make a key dataset available, used and useful by the community or organising a series of events addressing a specific topic or challenge citizens are currently facing. We do not wish to be prescriptive, there are countless possibilities for outcomes for the fellowship but successful candidates will demonstrate a vision that has clear, tangible outcomes.

To support fellows in achieving these outcomes, all fellows will receive a stipend of $1,000 per month in addition to a project grant of $3,000 to spend over the course of your fellowship. Finally, a travel stipend is available for each fellow for national and/or international travel related to furthering the objective of their fellowship.

There are up to 3 fellowship positions open for the February to July 2015 pilot programme. Due to resourcing, we will only be accepting fellowship applications from individuals living and working in Africa. Furthermore, in order to ensure that we are able to provide fellows with strong local support during the pilot phase, we will are targeting applicants from the following countries where Code for Africa and/or Open Knowledge already have existing networks: Angola, Burkina Faso, Cameroon, Ghana, Kenya, Morocco, Mozambique, Mauritius, Namibia, Nigeria, Rwanda, South Africa, Senegal, Tunisia, Tanzania, and Uganda. We are hoping to roll out the programme in other regions in autumn 2015. If you are interested in the fellowship but not currently located in one of the target countries, please get in touch.

Do you have questions? See more about the Fellowship Programme here and have a looks at this Frequently Asked Questions (FAQ) page. If this doesn’t answer your question, email us at Katelyn[dot]Rogers[at]okfn.org

Not sure if you fit the profile? Drop us a line!

Convinced? Apply now to become a Open Government fellow. If you would prefer to submit your application in French or Portuguese, translations of the application form are available in French here and in Portuguese here.

The application will be open until the 15th of December 2014 and the programme will start in February 2015. We are looking forward to hearing from you!

Raffaele Messuti: Serve deepzoom images from a zip archive with openseadragon

planet code4lib - Tue, 2014-11-25 10:00

vips is a fast image processing system. Version higher than 7.40 can generate static tiles of big images in deepzoom format, saving them directly into a zip archive.

PeerLibrary: Educators Rejoice! This Week’s Featured Content from the PeerLibrary Collections

planet code4lib - Tue, 2014-11-25 04:08

PeerLibrary’s groups and collections functionality is especially suited towards educators running classes that involve reading and discussing various academic publications. This week we would like to highlight one such collection, created for a graduate level computer science class taught by Professor John Kubiatowicz at UC Berkeley. The course, Advanced Topics in Computer Systems, requires weekly readings which are handily stored on the PeerLibrary platform for students to read, discuss, and collaborate outside of the typical classroom setting. Articles within the collection come from a variety of sources, such as the publicly available “Key Range Locking Strategies” and the closed access “ARIES: A Transaction Recovery Method”. Even closed access articles, which hide the article from unauthorized users, allow users to view the comments and annotations!

Jonathan Rochkind: “Gates Foundation to require immediate free access for journal articles”

planet code4lib - Mon, 2014-11-24 22:25

http://news.sciencemag.org/funding/2014/11/gates-foundation-require-immediate-free-access-journal-articles

Gates Foundation to require immediate free access for journal articles

By Jocelyn Kaiser 21 November 2014 1:30 pm

Breaking new ground for the open-access movement, the Bill & Melinda Gates Foundation, a major funder of global health research, plans to require that the researchers it funds publish only in immediate open-access journals.

The policy doesn’t kick in until January 2017; until then, grantees can publish in subscription-based journals as long as their paper is freely available within 12 months. But after that, the journal must be open access, meaning papers are free for anyone to read immediately upon publication. Articles must also be published with a license that allows anyone to freely reuse and distribute the material. And the underlying data must be freely available.

 

Is this going to work? Will researchers be able to comply with these requirements without harm to their careers?  Does the Gates Foundation fund enough research that new open access venues will open up to publish this research (and if so how will their operation be funded?), or do sufficient venues already exist? Will Gates Foundation grants include funding for “gold” open access fees?

I am interested to find out. I hope this article is accurate about what their doing, and am glad they are doing it if so.

The Gates Foundation’s own announcement appears to be here, and their policy, which doesn’t answer very many questions but does seem to be bold and without wiggle-room, is here.

I note that the policy mentions “including any underlying data sets.”  Do they really mean to be saying that underlying data sets used for all publications “funded, in whole or in part, by the foundation” must be published? I hope so.  Requiring “underlying data sets” to be available at all is in some ways just as big or bigger as requiring them to be available open access.


Filed under: General

FOSS4Lib Upcoming Events: BitCurator Users Forum

planet code4lib - Mon, 2014-11-24 21:55
Date: Friday, January 9, 2015 - 08:00 to 17:00Supports: BitCurator

Last updated November 24, 2014. Created by Peter Murray on November 24, 2014.
Log in to edit this page.

Join BitCurator users from around the globe for a hands-on day focused on current use and future development of the BitCurator digital software environment. Hosted by the BitCurator Consortium (BCC), this event will be grounded in the practical, boots-on-the-ground experiences of digital archivists and curators. Come wrestle with current challenges—engage in disc image format debates, investigate emerging BitCurator integrations and workflows, and discuss the “now what” of handling your digital forensics outputs.

HangingTogether: What languages do public library collections speak?

planet code4lib - Mon, 2014-11-24 21:04

Slate recently published a series of maps illustrating the languages other than English spoken in each of the fifty US states. In nearly every state, the most commonly spoken non-English language was Spanish. But when Spanish is excluded as well as English, a much more diverse – and sometimes surprising – landscape of languages is revealed, including Tagalog in California, Vietnamese in Oklahoma, and Portuguese in Massachusetts.

Public library collections often reflect the attributes and interests of the communities in which they are embedded. So we might expect that public library collections in a given state will include relatively high quantities of materials published in the languages most commonly spoken by residents of the state. We can put this hypothesis to the test by examining data from WorldCat, the world’s largest bibliographic database.

WorldCat contains bibliographic data on more than 300 million titles held by thousands of libraries worldwide. For our purposes, we can filter WorldCat down to the materials held by US public libraries, which can then be divided into fifty “buckets” representing the materials held by public libraries in each state. By examining the contents of each bucket, we can determine the most common language other than English found within the collections of public libraries in each state:

MAP 1: Most common language other than English found in public library collections, by state

As with the Slate findings regarding spoken languages, we find that in nearly every state, the most common non-English language in public library collections is Spanish. There are exceptions: French is the most common non-English language in public library collections in Massachusetts, Maine, Rhode Island, and Vermont, while German prevails in Ohio. The results for Maine and Vermont complement Slate’s finding that French is the most commonly spoken non-English language in those states – probably a consequence of Maine and Vermont’s shared borders with French-speaking Canada. The prominence of German-language materials in Ohio public libraries correlates with the fact that Ohio’s largest ancestry group is German, accounting for more than a quarter of the state’s population.

Following Slate’s example, we can look for more diverse language patterns by identifying the most common language other than English and Spanish in each state’s public library collections:

MAP 2: Most common language other than English and Spanish found in public library collections, by state

Excluding both English- and Spanish-language materials reveals a more diverse distribution of languages across the states. But only a bit more diverse: French now predominates, representing the most common language other than English and Spanish in public library collections in 32 of the 50 states. Moreover, we find only limited correlation with Slate’s findings regarding spoken languages. In some states, the most common non-English, non-Spanish spoken language does match the most common non-English, non-Spanish language in public library collections – for example, Polish in Illinois; Chinese in New York, and German in Wisconsin. But only about a quarter of the states (12) match in this way; the majority do not. Why is this so? Perhaps materials published in certain languages have low availability in the US, are costly to acquire, or both. Maybe other priorities drive collecting activity in non-English materials – for example, a need to collect materials in languages that are commonly taught in primary, secondary, and post-secondary education, such as French, Spanish, or German.

Or perhaps a ranking of languages by simple counts of materials is not the right metric. Another way to assess if a state’s public libraries tailor their collections to the languages commonly spoken by state residents is to compare collections across states. If a language is commonly spoken among residents of a particular state, we might expect that public libraries in that state will collect more materials in that language compared to other states, even if the sum total of that collecting activity is not sufficient to rank the language among the state’s most commonly collected languages (for reasons such as those mentioned above). And indeed, for a handful of states, this metric works well: for example, the most commonly spoken language in Florida after English and Spanish is French Creole, which ranks as the 38th most common language collected by public libraries in the state. But Florida ranks first among all states in the total number of French Creole-language materials held by public libraries.

But here we run into another problem: the great disparity in size, population, and ultimately, number of public libraries, across the states. While a state’s public libraries may collect heavily in a particular language relative to other languages, this may not be enough to earn a high national ranking in terms of the raw number of materials collected in that language. A large, populous state, by sheer weight of numbers, may eclipse a small state’s collecting activity in a particular language, even if the large state’s holdings in the language are proportionately less compared to the smaller state. For example, California – the largest state in the US by population – ranks first in total public library holdings of Tagalog-language materials; Tagalog is California’s most commonly spoken language after English and Spanish. But surveying the languages appearing in Map 2 (that is, those that are the most commonly spoken language other than English and Spanish in at least one state), it turns out that California also ranks first in total public library holdings for Arabic, Chinese, Dakota, French, Italian, Korean, Portuguese, Russian, and Vietnamese.

To control for this “large state problem”, we can abandon absolute totals as a benchmark, and instead compare the ranking of a particular language in the collections of a state’s public libraries to the average ranking for that language across all states (more specifically, those states that have public library holdings in that language). We would expect that states with a significant population speaking the language in question would have a state-wide ranking for that language that exceeds the national average. For example, Vietnamese is the most commonly spoken language in Texas other than English and Spanish. Vietnamese ranks fourth (by total number of materials) among all languages appearing in Texas public library collections; the average ranking for Vietnamese across all states that have collected materials in that language is thirteen. As we noted above, California has the most Vietnamese-language materials in its public library collections, but Vietnamese ranks only eighth in that state.

Map 3 shows the comparison of the state-wide ranking with the national average for the most commonly spoken language other than English and Spanish in each state:

MAP 3: Comparison of state-wide ranking with national average for most commonly spoken language other than English and Spanish

Now it appears we have stronger evidence that public libraries tend to collect heavily in languages commonly spoken by state residents. In thirty-eight states (colored green), the state-wide ranking of the most commonly spoken language other than English and Spanish in public library collections exceeds – often substantially – the average ranking for that language across all states. For example, the most commonly spoken non-English, non-Spanish language in Alaska – Yupik – is only the 10th most common language found in the collections of Alaska’s public libraries. However, this ranking is well above the national average for Yupik (182nd). In other words, Yupik is considerably more prominent in the materials held by Alaskan public libraries than in the nation at large – in the same way that Yupik is relatively more common as a spoken language in Alaska than elsewhere.

As Map 3 shows, six states (colored orange) exhibit a ranking equal to the national average; in all of these cases the language in question is French or German, languages that tend to be highly collected everywhere (the average ranking for French is four, and for German, five). Five states (colored red) exhibit a ranking that is below the national average; in four of the five cases, the state ranking is only one notch below the national average.

The high correlation between languages commonly spoken in a state, and the languages commonly found within that state’s public library collections suggests that public libraries are not homogenous, but in many ways reflect the characteristics and interests of local communities. It also highlights the important service public libraries provide in facilitating information access to community members who may not speak or read English fluently. Finally, public libraries’ collecting activity across a wide range of non-English language materials suggests the importance of these collections in the context of the broader system-wide library resource. Some non-English language materials in public library collections – perhaps the French Creole-language materials in Florida’s public libraries, or the Yupik-language materials in Alaska’s public libraries – could be rare and potentially valuable items that are not readily available in other parts of the country.

Visit your local public library … you may find some unexpected languages on the shelf.

Acknowledgement: Thanks to OCLC Research colleague JD Shipengrover for creating the maps.

Note on data: Data used in this analysis represent public library collections as they are cataloged in WorldCat. Data is current as of July 2013. Reported results may be impacted by WorldCat’s coverage of public libraries in a particular state.

 

About Brian Lavoie

Brian Lavoie is a Research Scientist in OCLC Research. Brian's research interests include collective collections, the system-wide organization of library resources, and digital preservation.

Mail | Web | LinkedIn | More Posts (6)

Karen Coyle: Multi-Entity Models.... Baker, Coyle, Petiya

planet code4lib - Mon, 2014-11-24 19:23
Multi-Entity Models of Resource Description in the Semantic Web: A comparison of FRBR, RDA, and BIBFRAME
by Tom Baker, Karen Coyle, Sean Petiya
Published in: Library Hi Tech, v. 32, n. 4, 2014 pp 562-582 DOI:10.1108/LHT-08-2014-0081
Open Access Preprint

The above article was just published in Library hi Tech. However, because the article is a bit dense, as journal articles tend to be, here is a short description of the topic covered, plus a chance to reply to the article.

We now have a number of multi-level views of bibliographic data. There is the traditional "unit card" view, reflected in MARC, that treats all bibliographic data as a single unit. There is the FRBR four-level model that describes a single "real" item, and three levels of abstraction: manifestation, expression, and work. This is also the view taken by RDA, although employing a different set of properties to define instances of the FRBR classes. Then there is the BIBFRAME model, which has two bibliographic levels, work and instance, with the physical item as an annotation on the instance.

In support of these views we have three RDF-based vocabularies:

FRBRer (using OWL)
RDA (using RDFS)
BIBFRAME (using RDFS)

The vocabularies use a varying degree of specification. FRBRer is the most detailed and strict, using OWL to define cardinality, domains and ranges, and disjointness between classes and between properties. There are, however, no sub-classes or sub-properties. BIBFRAME properties all are defined in terms of domains (classes), and there are some sub-class and sub-property relationships. RDA has a single set of classes that are derived from the FRBR entities, and each property has the domain of a single class. RDA also has a parallel vocabulary that defines no class relationships; thus, no properties in that vocabulary result in a class entailment. [1]

As I talked about in the previous blog post on classes, the meaning of classes in RDF is often misunderstood, and that is just the beginning of the confusion that surrounds these new technologies. Recently, Bernard Vatant, who is a creator of the Linked Open Vocabularies site that does a statistical analysis of the existing linked open data vocabularies and how they relate to each other, said this on the LOV Google+ group:
"...it seems that many vocabularies in LOV are either built or used (or both) as constraint and validation vocabularies in closed worlds. Which means often in radical contradiction with their declared semantics."What Vatant is saying here is that many vocabularies that he observes use RDF in the "wrong way." One of the common "wrong ways" is to interpret the axioms that you can define in RDFS or OWL the same way you would interpret them in, say, XSD, or in a relational database design. In fact, the action of the OWL rules (originally called "constraints," which seems to have contributed to the confusion, now called "axioms") can be entirely counter-intuitive to anyone whose view of data is not formed by something called "description logic (DL)."

A simple demonstration of this, which we use in the article, is the OWL axiom for "maximum cardinality." In a non-DL programming world, you often state that a certain element in your data is limited to the number of times it can be used, such as saying that in a MARC record you can have only one 100 (main author) field. The maximum cardinality of that field is therefore "1". In your non-DL environment, a data creation application will not let you create more than one 100 field; if an application receiving data encounters a record with more than one 100 field, it will signal an error.

The semantic web, in its DL mode, draws an entirely different conclusion. The semantic web has two key principles: open world, and non-unique name. Open world means that whatever the state of the data on the web today, it may be incomplete; there can be unknowns. Therefore, you may say that you MUST have a title for every book, but if a look at your data reveals a book without a title, then your book still has a title, it is just an unknown title. That's pretty startling, but what about that 100 field? You've said that there can only be one, so what happens if there are 2 or 3 or more of them for a book? That's no problem, says OWL: the rule is that there is only one, but the non-unique name rule says that for any "thing" there can be more than one name for it. So when an OWL program [2] encounters multiple author 100 fields, it concludes that these are all different names for the same one thing, as defined by the combination of the non-unique name assumption and the maximum cardinality rule: "There can only be one, so these three must really be different names for that one." It's a bit like Alice in Wonderland, but there's science behind it.

What you have in your database today is a closed world, where you define what is right and wrong; where you can enforce the rule that required elements absolutely HAVE TO be there; where the forbidden is not allowed to happen. The semantic web standards are designed for the open world of the web where no one has that kind of control. Think of it this way: what if you put a document onto the open web for anyone to read, but wanted to prevent anyone from linking to it? You can't. The links that others create are beyond your control. The semantic web was developed around the idea of a web (aka a giant graph) of data. You can put your data up there or not, but once it's there it is subject to the open functionality of the web. And the standards of RDFS and OWL, which are the current standards that one uses to define semantic web data, are designed specifically for that rather chaotic information ecosystem, where, as the third main principle of the semantic web states, "anyone can say anything about anything."

I have a lot of thoughts about this conflict between the open world of the semantic web and the needs for closed world controls over data; in particular whether it really makes sense to use the same technology for both, since there is such a strong incompatibility in underlying logic of these two premises. As Vatant implies, many people creating RDF data are doing so with their minds firmly set in closed world rules, such that the actual result of applying the axioms of OWL and RDF on this data on the open web will not yield the expected closed world results.

This is what Baker, Petiya and I address in our paper, as we create examples from FRBRer, RDA in RDF, and BIBFRAME. Some of the results there will probably surprise you. If you doubt our conclusions, visit the site http://lod-lam.slis.kent.edu/wemi-rdf/ that gives more information about the tests, the data and the test results.

[1] "Entailment" means that the property does not carry with it any "classness" that would thus indicate that the resource is an instance of that class.

[2] Programs that interpret the OWL axioms are called "reasoners". There are a number of different reasoner programs available that you can call from your software, such as Pellet, Hermit, and others built into software packages like TopBraid.

LITA: Top Tech Trends: Call For Panelists

planet code4lib - Mon, 2014-11-24 18:10

What technology are you watching on the horizon? Have you seen brilliant ideas that need exposing? Do you really like sharing with your LITA colleagues?

The LITA Top Tech Trends Committee is trying a new process this year and issuing a Call for Panelists. Answer the short questionnaire by 12/10 to be considered. Fresh faces and diverse panelists are especially encouraged to respond. Past presentations can be viewed at http://www.ala.org/lita/ttt.

Here’s the link:
https://docs.google.com/forms/d/1JH6qJItEAtQS_ChCcFKpS9xqPsFEUz52wQxwieBMC9w/viewform

If you have additional questions check with Emily Morton-Owens, Chair of the Top Tech Trends committee: emily.morton.owens@gmail.com

Pages

Subscribe to code4lib aggregator