Planet Cataloging

June 17, 2013

Bibliographic Wilderness

Umlaut 3.1.0 released, with new Bootstrap-based visual design

I like to be confident that open source code I wrote is pretty stable and robust before recommending that others use it.

So I usually try to run any new code, or new versions of existing code, in production myself for a couple weeks before actually releasing it as a stable release.

I’ve been running Umlaut 3.1.0 in production for a couple weeks now. Some minor problems found by reviewing the logs for uncaught excpetions, and fixed. It’s ready for a release.

Umlaut is an open source aggregator of “last mile”services, working with your link resolver and other services to provide consolidated and efficient discovery/delivery service provision.

Umlaut 3.1.0 has now been released. Please see the release notes, especially if upgrading from a previous version of umlaut.

The major change is a complete overhaul of the visual design, based on bootstrap, and small-screen friendly. Thanks again to Scot Dalton from NYU for the initiative to make the Bootstrap-based redesign finally happen.

umlaut_bootstrap


Filed under: General

by jrochkind at June 17, 2013 07:05 PM

June 13, 2013

Bibliographic Wilderness

on the internet, and power

Bruce Schneier writes on how our internet lives are frequently dominated by a few huge internet companies with immense power over those internet doings:

There are a lot of good reasons why we’re all flocking to these cloud services and vendor-controlled platforms. The benefits are enormous, from cost to convenience to reliability to security itself. But it is inherently a feudal relationship. We cede control of our data and computing platforms to these companies and trust that they will treat us well and protect us from harm. And if we pledge complete allegiance to them — if we let them control our email and calendar and address book and photos and everything — we get even more benefits. We become their vassals; or, on a bad day, their serfs….

…So how do we survive? Increasingly, we have little alternative but to trust someone, so we need todecide who we trust – and who we don’t — and then act accordingly. This isn’t easy; our feudal lords go out of their way not to be transparent…

In the longer term, we all need to work to reduce the power imbalance…   We need to balance this relationship, and government intervention is the only way we’re going to get it.

I’ve also been thinking a lot about trying to create more cooperatively controlled internet infrastructure as a way to balance this power and bring (economic) democracy to the internet.

And, with regard to libraries, in many of our fantasies the institution of libraries, collectively, would be a force in internet life, a civic, public sector, decentralized but massive in aggregate counter-balance to the ‘feudal’ internet companies.  If libraries can find, keep, and expand a sustainable role as internet actors.


Filed under: General

by jrochkind at June 13, 2013 08:37 PM

Catalogue & Index Blog

Getting Started with RDA pop-up workshop fully booked

The "Getting started with RDA" pop-up workshop at the University of Warwick on 3rd July is now fully booked.  Apologies to anyone who was hoping to attend this event.  If you would like to add your name to the waiting list please contact the CIG committee.

by Stuart William Hunt at June 13, 2013 07:39 AM

June 12, 2013

Bibliographic Wilderness

Scientific publishing has some problems beyond business models

From an open letter in the Guardian:

 Early in their training, students learn that the quest for truth needs to be balanced against the more immediate pressure to “publish or perish”….

…This publishing culture is toxic to science. Recent studies have shown how intense career pressures encourage life scientists to engage in a range of questionable practices to generate publications….

…At the same time, journals incentivise bad practice by favouring the publication of results that are considered to be positive, novel, neat and eye-catching. In many life sciences, negative results, complicated results, or attempts to replicate previous studies never make it into the scientific record. Instead they occupy a vast unpublished file drawer….

As academic librarians, our role is to be experts — not in any specific field — but in the phenomenon of academic publishing in general.   In our educational role with students, we ought to be helping students understand and think about these issues — to problematize and complexify the world of research publication.  Despite our patrons desire to have blacks and whites that let them complete their assignments with as little thinking as possible (yeah, I said it) —  it’s our professional duty to not only help them complete their assignments as conveniently as possible but also understand problems and current issues in academic publishing in general.

And to make the case to administrators and faculty that this our rightful role.  Not all faculty will welcome critique of the scholarly publishing enterprise that is essentially their livelihood either of course (go read that letter in the Guardian we began with, again).

This reminds me again of Karen Coyle’s excellent points about the phenomenon of “predatory publishers” – we over-simplify if we suggest that publications can easily be split into problem-free ‘good’ and untrustworthy ‘predatory’ problems.

We do disservice to our patrons to imply that as long as they steer clear of identified ‘bad’ publishers from some librarian-endorsed list, then of course anything that’s “peer reviewed” becomes absolutely trustworthy gospel through the magical transubstantiation of ‘peer review’.

And we do disservice to ourselves and our professional capacities to avoid critical engagement with our domain of expertise — academic publishing.  We are — or ought to be — academic professionals, not just clerks and secretaries for the university community or salespeople for scholarly publishers.    Could it help restore professional credibility and respect to librarians if we participated at the front of research into research, of the history and critical analysis of the enterprise of scholarly publishing?


Filed under: General

by jrochkind at June 12, 2013 03:19 PM

Take control of delivery and access with Umlaut

In a recently published editorial in ITAL, Services and User Context in the Era of Webscale DiscoveryMark Dehmlow writes:

A major issue that continues to confound me is the lack of fully integrated request and delivery services that many discovery systems lack. Of course, all of them implement full text linking to every online article that they can create a link to, but as the sphere of scholarly data stretches beyond just articles, library print collections and delivery services have continued to be neglected primarily because implementing those services in an intuitively integrated way, beyond the “link to your old OPAC” methodology, remains a complex task. My main concern with this deficit is that there is a significant amount of scholarly material only available in print and to focus primarily on
electronic access limits the ability of our users to perform comprehensive research and reduces access to significant resources and services that libraries provide.

The open source Umlaut software (for which I am principal developer) has been aiming to fill this gap for over 7 years now, aiming to provide an aggregated and integrated path to delivery and access cross-cutting library departments, systems and services, accross the entire library business.

To be sure, Umlaut is not a magic bullet.  It’s more a platform to design the best solution you can in your actually existing infrastructure.  To make the most of Umlaut requires local developer time and creativity to figure out how you can use it to tie together your various systems and services as seamlessly as possible.   And a typical lack of good integration API in much of our existing (proprietary) infrastructure is an added challenge, generally increasing cost/time of developing a good solution.  But Umlaut is designed to be a platform supporting local solutions to integrating delivery, access, and specific item services — giving you the common skeleton on which you can hang your custom local functionality.

I agree with Dehmlow (and others I know I’ve read essays from but can’t find now) that the ‘last mile’ of access and delivery ought to be a priority for libraries — among other reasons, because access and delivery of the mountains of content we still have that is not both online and freely available, is something that we uniquely provide to our patrons, with much less ‘competition’ than for search and discovery services.  If our services aren’t good, our patrons don’t have other options (such as Google) to get (eg) printed monographs for their research (without just buying them).

And, at this stage in the development of our technological infrastructures, this is not something that a proprietary vendor-provided open-the-box-and-turn-it-on solution is going to be able to do well. Integrated access/delivery necesarily involves cross-cutting multiple pieces of local enterprise software (catalog, ILL, local identity/SSO, and that’s just the start) and policies (can you request locally held books to be delivered to your office? Does it depend on who you are and where the book is?).  It requires custom local policy and integraiton logic. It’s not going to be feasible/economical for a vendor to provide one-size-fits software that actually works well in this arena.  So I’m not as

At least, until your entire library enterprise infrastructure comes from one vendor and consists of an actually integrated single-business cloud platform.  This does seem to be where the industry is heading and what, for instance,  OCLC, Ex Libris, and Serials Solutions are trying to provide.  I know some of these vendors are trying to provide integrated ‘last mile’ services taking advantage of the consolidated integrated cloud infrastructure they provide — although it’s seldom highlighted as an advantage in their marketting, perhaps because most library customers aren’t yet seeing what an advantage it is, what a stumbling point this is for our patrons — where we should be uniquely distinguishing ourselves as able to provide seamless delivery/access, we’re instead just again showing our patrons our ability to provide them with a disjointed, inefficient, frustrating, confusing, experience.

In the meantime, there’s Umlaut, to help you try to stich together a pleasant and fast delivery/access experience.   It hasn’t received quite as much attention in the academic library world as I would hope — I think that’s in part because administrative decision makers have not realized the importance and benefits of improving our ‘last mile’ services, and certainly standing up an Umlaut at your institution does take some local development resources.  However, in addition to my place of work, NYU and Vanderbilt have been using Umlaut for a while.  Recently, I’ve heard of potential interest from several other large research university libraries.  I am hoping that at some point there will be sufficient critical mass of library developers using Umlaut that we can use the platform to take the ‘last mile’ to even greater levels of convenience and integration for our users than I’ve had the resources to do with Umlaut so far.


Filed under: General

by jrochkind at June 12, 2013 02:55 PM

First thus

Re: analog vs digital collections and cataloging

Posting to RadCat On 11/06/2013 16:38, Anderson, William wrote: <snip>It would seem the first logical step would be “what is the job”, the tools will emerge from that. It occurs to me that we do need something like FRBR user tasks, if not “the” FRBR user tasks. Does the word “tasks” start one step to far along the process than what is needed. As James stated below, returning to

by noreply@blogger.com (James Weinheimer) at June 12, 2013 08:10 AM

June 11, 2013

OCLC Cataloging and Metadata News

Moving beyond Bibliographic Record Notification

Learn about migrating to OCLC WorldShare Metadata

June 11, 2013 03:00 PM

First thus

Google Books and Google Play

I just discovered something new (for me at least) about Google Books. I looked at an older posting of mine on my blog and found this link to the Royal Commission report on the management of the British Museum (where Panizzi's catalog is discussed), and the link was: http://books.google.com/ebooks?id=ajFDAAAAcAAJ which used to go into the normal Google Books interface but

by noreply@blogger.com (James Weinheimer) at June 11, 2013 07:55 AM

Re: analog vs digital collections and cataloging

Posting to RadCat On 10/06/2013 16:13, Anderson, William wrote: <snip>As a thought experiment what would such a distributed exercise look like, and what would the economics look like behind it. We do have the model of the public commons, combination social meeting space, business technology center, event venue, and (?) roving information consultants it sounds like. Likewise we have

by noreply@blogger.com (James Weinheimer) at June 11, 2013 07:05 AM

Re: [ACAT] NSA leak source and Metadata (Was: NSA leak source)

Posting to Autocat On 10/06/2013 16:54, McDonald, Stephen wrote: <snip>By using metadata the same way that library users do. You run a search on the metadata set to FIND communications that meet certain criteria (such as frequent communications with someone suspected of supporting terrorism). You examine the results of the search to IDENTIFY suspicious connections. You SELECT one or

by noreply@blogger.com (James Weinheimer) at June 11, 2013 07:01 AM

Terry's Worklog

MarcEdit 5.9 Update Posted

I just posted a new update.  The update includes a major rewrite of the merge algorithm, as well as a few bug fixes and small enhancements.  Here’s the list:

  • Update: Merge Function – this is a rewrite.  The function was rewritten to provide a bit more flexibility, better performance, and a framework for exposing the values used to calculate match points so users can modify weighting if they have specific values that they want to give more importance. 
  • Bug Fix: Find all jump list would report not data found when LDR data was queried.
  • Bug Fix: Jump to record – wouldn’t jump if the record was the last item on a page.
  • Enhancement: Z39.50 – removed the 3 database query limit.  Limit has been bumped up to 250, though at some point, I need to take a look at redoing the interface to make this type of query casting more friendly.
  • Bug Fix: Change Indicator Function – when using two wildcards, the indicators search would report no items found.  This has been corrected.

As always, you can find the new updates at:

–tr

by reeset at June 11, 2013 06:42 AM

Resource Description & Access (RDA)

Authorized Access Point Representing an Expression

6.27.3 Authorized Access Point Representing an Expression (see this LC-PCC PS for Translations and Langugae editions)
Construct an access point representing a particular expression of a work or a part or parts of a work by combining (in this order):
a) the authorized access point representing the work (see 6.27.1) or the part or parts of a work (see 6.27.2)
b) one or more terms from the following list:
   i) the content type (see 6.9)
   ii) the date of the expression (see 6.10)
   iii) the language of the expression (see 6.11)
   and/or
   iv) another distinguishing characteristic of the expression (see 6.12).

EXAMPLE
Goncourt, Edmond de, 1822–1896. Frères Zemganno. English
Resource described: The Zemganno brothers / by Edmond de Goncourt. An English translation of a French novel
Pushkin, Aleksandr Sergeevich, 1799–1837. Evgeniĭ Onegin. English (Beck)
Resource described: Eugene Onegin / Alexander Pushkin ; translator, Tom Beck
Kolloidnyĭ zhurnal. English
Resource described: Colloid journal of the Russian Academy of Sciences. An English translation of a Russian serial
Piave, Francesco Maria, 1810–1876. Ernani. Spanish
Resource described: Ernani : drama lírico en cuatro actos / de F. Piave ; música de G. Verdi ; versión castellana de M. Capdepón. A Spanish translation of Piave’s libretto
Brunhoff, Jean de, 1899–1937. Babar en famille. English. Spoken word
Resource described: Babar and his children. An audio recording of an English translation of the children’s story
Wilde, Oscar, 1854–1900. Works. 2000
Resource described: The complete works of Oscar Wilde / general editors, Russell Jackson and Ian Small. — Oxford ; New York : Oxford University Press, 2000– . Other expressions of Wilde’s complete works have been published in other years
Shakespeare, William, 1564–1616. Works. 2003. Yale University Press
Resource described: The annotated Shakespeare. — New Haven : Yale University Press, ©2003–. An expression of Shakespeare’s complete works; another expression with title Nelson Thornes Shakespeare also began in 2003
Langland, William, 1330?–1400? Piers Plowman (C-text)
Resource described: Piers Plowman / by William Langland ; an edition of the C-text by Derek Pearsall. Langland’s work Piers Plowman exists in different versions designated as A-text, B-text, C-text, etc.
Nutcracker (Choreographic work : Baryshnikov)
Resource described: The nutcracker. A recording of a performance of the American Ballet Theatre and Mikhail Baryshnikov production of the ballet The nutcracker; choreography by Baryshnikov

by salman haider (noreply@blogger.com) at June 11, 2013 02:25 AM

June 10, 2013

CommonPlace.Net

Resilience, connections and a clean slate

The inside-out library at ELAG 2013

This year marked my fifth ELAG conference since 2008 (I skipped 2009), which is not much if you take into account that ELAG2013 was the 37th one. I really enjoyed the 2013 conference, not in the least because of the wonderful people of the local organising committee at the Ghent University Library, who made ELAG2013 a very pleasant event.This year’s theme was “the inside-out library”, a concept coined by Lorcan Dempsey, which in brief emphasises the need for libraries to shift focus 180 degrees.

DSC09680

Sylvia Van Peteghem opening speech

Before you read any further I strongly suggest you read Rurik Greenall’s post on ELAG 2013 first. He covered most of the programme in his usual thorough and analytical way.

In my personal overall conference experience major emphasis was on research support in libraries. This was partly due to my attendance of the pre-conference Joint OpenAIRE/LIBER WorkshopDealing with Data – what’s the role for the library?’ on May 28. It was good to have sessions focusing on different perspectives: data management, data publication, the researchers’ needs, library support and training. I was honoured to be invited to participate in the closing round table panel discussion together with two library directors Wilma van Wezenbeek (TU Delft Library) and Wolfram Horstmann (Bodleian Library), under the excellent supervision of Kevin Ashley (DDC). An important central concept in the workshop was the research life cycle, which consists of many different tasks of a very diverse nature. Academic and research libraries should focus on those tasks for which they are or can easily become qualified.

Looking from another angle we can distinguish two main perspectives in integrating research: the research ecosystem itself, which can be seen as the main topic of the OpenAIRE/LIBER workshop, and the research content, the actual focus of researchers and research projects. I will try to address both perspectives here.

On the first day of the actual conference Herbert Van de Sompel gave the keynote speech with the title “A clean slate”. Rurik Greenall aptly describes the scope and meaning of Herbert’s argument. Herbert has been involved in a number of important and relevant projects in the domain of scholarly communication. My impression this time was: now he’s bringing it all together around the fairly new concept of the “research object”, integrating a number of projects and protocols, like ORE, Memento, OpenAnnotation, Provenance, ResourceSync. It’s all about connections between all components related to research on the web in all dimensions.

This linking of input, output, procedures and actors of research projects in various temporal and contextual dimensions in a machine readable way is extremely important in order to be able to process all relevant information by means of computer systems and present it to the human consumer. In this respect I think it is essential that data citations in scholarly articles should not only be made available in the article text, but also as machine readable metadata that can be indexed by external aggregators.
Moreover, it would be even better if it was possible to provide links to research projects that would serve as central hubs for linking to all associated entities, not only datasets. This is the role that the research object can fulfill. During the OpenAIRE/LIBER workshop I tried to address this issue a number of times, because I am a bit surprised that  both researchers and publishers appear to be satisfied with having text only clickable dataset citations. That is even the case the other way around with links to articles in dataset repositories like Dryad. I think there is a role here for information professionals and metadata experts in libraries. This is exactly the point that Peter van Boheemen made in his talk about producing better metadata for research output. Similarly Jing Wang stressed the importance of investigating the role of metadata specialists and data librarians for interoperability and authority control in her presentation on the open source linked data based research discovery tool Vivo.

Again there are two perspectives here. Even if we have machine readable metadata on research projects and datasets, most systems are not adequately equipped with functionality to process or present this information. It is not so easy to update complex systems with new functionality. Planned update cycles, including extensive testing, are necessary in order to adhere to the system’s design and architecture and to avoid breaking things. This equally applies to commercial, open source and home grown systems. Joachim Neubert’s presentation of the use of the open source CMS Drupal for linked data enhanced publishing for special collections illustrated this. Some very specialist custom extensions to the essentially quite flexible system were needed to make this a success. (On a different note, it was nice to see that Joachim used a simple triple diagram from my first library linked data blog post to illustrate the use of different types of predicates between similar subjects and objects.)
Anyway, a similar point can be made about systems and identifiers for people (authors, researchers, etc.). I participated in the workshop on ISNI, ORCID and VIAF : Examining the fundamentals and application of contributor identifiers led by Anila Angjeli and Thom Hickey, one of six ELAG workshops this year. Thom and Anila presented a very complete and detailed overview of the similarities and differences of these three identifier schemes. One of the discussion topics was the difference in adoption of these schemes by the community on the one hand and as machine readable metadata and their application in library systems on the other.

Here comes “resilience” into play, a concept introduced by Beate Rusch in her talk on the changing roles of the German regional library consortia and service centres in the world of cloud computing and SaaS. Rurik Greenall captures the essence of her talk when he says “… homogenous, generic solutions will not work in practice because they are at odds with how things are done …” and that “messy, imperfect systems… are smart and long lived”. Since Beate’s presentation the term “resilience” popped up in a number of discussions with colleagues, during and after the conference, mainly in the sense that most systems, communities, infrastructures are NOT resilient. Resilience is a concept mainly used in psychology and physics, meaning the ability of someone or something to return to its original state after being subjected to a severe disturbance. Beate’s idea with resilience is that we can adapt better to changing circumstances and needs in the world around us if we are less perfect and rigid than we usually are. In this sense I think resilience can also mean that a structure could permanently change instead of returning to its original state.
In the library world resilience can be applied to librarians, libraries, library infrastructure and library systems alike. In my view “resilience” might apply to the alternative architecture I have described in a recent blog post, where I argue that we should stop thinking systems and start thinking data. In order to be resilient we need an open, connected infrastructure, that is of the web (not on the web). The SCAPE infrastructure for processing large datasets for long term preservation, presented by Sven Schlarb, might fit this description.

A  number of presentations focused on infrastructure and architecture. The new version of the Swedish union catalogue LIBRIS could be described as a resilient system. Martin Malmsten, Markus Sköld and Niklas Lindström showed their new linked open data based integrated library framework which was built from the ground up, from ”a clean slate” so to speak. I can only echo Rurik’s verdict “ With this, Libris really are showing the world how things are done”. Contrary to the Library of Congress BibFrame development which started very promising, but now seems to evolve into an inward looking rigid New Marc. This was illustrated by Martin Malmsten when he revealed to us that Marc is undead, and by Becky Yoose, who wrote a very pertinent parable telling the tale of the resurrection of Marc.
Rurik Greenall described the direction taken at his own institution NTNU Library: getting rid of old legacy library and webpage formats and moving towards being part of the web, providing information for the web, being data driven. It’s a slow and uphill struggle, but better than the alternative. A clean slate again!
Dave Pattern presented a different approach in connecting data from a number of existing systems and databases by means of APIs, and combining these into a new and well received reading list service at the University of Huddersfield.

Back to research. In our presentation, or rather performance, Jane Stevenson and I tried to present the conflicting perspectives of collection managers and researchers in a theatrical way, showing parallel developments in the music industry. Afterwards we tried to analyse the different perspectives, argued that researchers need connected information of all types and from all sources and concluded that information professionals should try and learn to take the researcher’s perspective in order to avoid becoming irrelevant in that area.
The relationship between libraries and researchers was also the subject of the talk “Partners in research. Outside the library, inside the infrastructure“, by Sally Chambers and Saskia Scheltjens. Here the focus was on providing comprehensive infrastructures for research support, especially in the digital humanities. Central question: large top-down institutionalised structures, or bottom-up connected networks? Bottom line is: the researcher’s needs have to be met in the best possible way.
A very interesting example of an actual digital humanities research and teaching project in collaboration between researchers and the library is the Annotated Books Online project that was presented by Utrecht University staff. The collection of rare books is made available online in order to crowdsource the interpretation of handwritten annotations present in these books.

Besides research support there were presentations on other “inside out library” topics: publishing, teaching, data analysis and GLAM.
Anders Söderbäck presented the Stockholm University Press, a new publishing house for open access digital and print on demand books. I was pleasantly surprised that Anders included two quotes of my aforementioned blog post in his talk: “...in the near future we will see the end of the academic library as we know it” and “According to some people university libraries are very suitable and qualified to become scholarly publishers … I am not sure that this is actually the case. Publishing as it currently exists requires a number of specific skills that have nothing to do with librarian expertise“. But of course Anders’ most important achievement was winning the Library Automation Bingo by including all required terms in one slide in a coherent and meaningful way.

DSC09707

DSC09708

Merrilee Proffitt presented an overview of MOOCs and libraries, Sarah Brown described the way that learning materials at the Open University in the UK are successfully connected and integrated in the linked data based STELLAR project. Looking at these developments the question arises if there are already efforts to come to a Teaching Object model, similar to the Research Object?
Andrew Nagy described the importance of analysing huge amounts of usage data in order to improve the usability and end user front end of the Summon discovery tool. Dan Chudnov presented the Social Media Manager prototype, used for collecting data from twitter in order to be used in social science research.
Valentine Charles described the activities carried out by Europeana to contribute large amounts of digitised library heritage resources to Wikimedia Commons by means of the GLAMwiki toolset in order to improve visibility of these resources the Open Access way. The GLAMwiki toolset currently appears to offer a number of challenges for the interoperability and integration of metadata standards between the library and the Wikimedia world. Another plea for resilience.

Then there were the workshops. The combination of these parallel hands-on and engaging group activities and the plenary sessions makes ELAG a unique experience. Although I only participated in one, obviously, I have heard good reports from all other workshops. I would like to give a special mention to Ade and Jane Stevenson’s “Very Gentle Linked Data” workshop, where they managed to teach even non-tech people not only the basic principles of linked data, but also how to create their own triple store and query it with SPARQL.

Summarising: looking at the ELAG2013 presentations, are we ready for the inside out library? Sometimes we can start with a clean slate, but that is not always possible. Resilience seems to be a requirement if we want to cope with the dramatic changes we are facing. But you can’t simply decide to be resilient, either something is resilient or it isn’t. A clean slate might be the only option. In any case it seems obvious that connections are key. The information profession needs to invest in new connections on every level, creating new forms of knowledge, in order to stay relevant.

Share

June 10, 2013 04:30 PM

First thus

NSA leak source and Metadata (Was: NSA leak source)

Posting to Autocat Connected with this leak is a more general discussion of "metadata" and some of the frightening implications it has. While the government says that it does not get the "content" of the information, they do get the "metadata". (http://usnews.nbcnews.com/_news/2013/06/07/18824941-obama-nobody-is-listening-to-your-telephone-calls?lite) We can see this

by noreply@blogger.com (James Weinheimer) at June 10, 2013 12:15 PM

Re: analog vs digital collections and cataloging

Posting to RadCat On 10/06/2013 00:06, Julie Moore wrote: <snip> But getting back to our core values of librarianship, what does a library without books, without a reference desk, and without technical services make? I can tell you that our most popular service point is Star Bucks! And perhaps that is the future of academic libraries ... to basically become a student center. </snip> It has

by noreply@blogger.com (James Weinheimer) at June 10, 2013 12:10 PM

June 09, 2013

First thus

Re: analog vs digital collections and cataloging

Posting to RadCat On 09/06/2013 04:18, Julie Moore wrote: <snip>The advent of the Internet has certainly changed things in cataloging, as we now watch these tens of thousands of MARC vendor records flow in and out of our catalogs via batch loads. They say that our patrons prefer to find ALL content using keyword searching (bringing up tens of thousands of hits) on their computers (or

by noreply@blogger.com (James Weinheimer) at June 09, 2013 08:14 PM

Various librarian-like stuff

carolslib

Do you follow George Takei on Facebook or Twitter? Who knew Sulu had such a humor? Or that George had such a background? He was one of many in the shameful Japanese Internment camps in the USA during World War II. In fact, the play Allegiance is based on the internment camps (not about George […]

by carolslib at June 09, 2013 03:24 PM

June 08, 2013

Resource Description & Access (RDA)

RDA 2.4.1 Serials' editor

Exception
Serials. Record a statement of responsibility identifying an editor of a serial only if the name of the editor is considered to be an important means of identifying the serial (e.g., if a particular person edited the serial for all or most of its existence; if the person's name is likely to be better known than the title of the serial).

by salman haider (noreply@blogger.com) at June 08, 2013 03:44 PM

Identifyin​g Original and Translatio​n in same resource

QUESTION:

This is in regards to 2011352604, which contains both the original text in Sanskrit and the Hindi translation. In AACR2, we would have had a 240 of Kalyāṇakāraka. $l Hindi & Sanskrit. In RDA, we would not have a 240. Instead, in theory, we would give 2 700s (as in the Mail Carrier example from the class materials):

700 02 ‡a Ugrādityācārya, ‡d 8th/9th century. ‡t Kalyāṇakāraka.
700 02 ‡a Ugrādityācārya, ‡d 8th/9th century. ‡t Kalyāṇakāraka. ‡l Hindi.

However, in this case, the 1st 700 would really result in the same N/T string as the 100/245 combo. So is it still required?

ANSWER:

Yes, two 700 name/title analytics.   In some cases the preferred title may be the same as the 245 title proper, but in man other cases it won't be.  We just thought it would always be easier to say "make the analytics" than to say "sometimes you do, and sometimes you don’t".  So it may be redundant in some cases.

(Reproduction of suggestions of experts on RDA)

by salman haider (noreply@blogger.com) at June 08, 2013 03:40 PM

Revisions to CSM due to RDA

The Classification and Shelflisting Manual (CSM) contains the policies on assigning LC classification numbers and LC book numbers (i.e., shelflisting).  Originally written in the 1980s and last updated in 2010, the CSM reflects the AACR2 environment.  

The U.S. national libraries’ decision to implement Resource Description and Access (RDA) led the Policy and Standards Division (PSD) to review the CSM and determine which instruction sheets needed to be revised to reflect RDA instructions.  Generally, the text was updated to ensure consistency with past classification and shelflisting practice, and new examples were chosen.

Specialists in PSD also took this opportunity to reevaluate some long-standing practices both to simplify them and to make them more sustainable in today’s linked, global, environment.  

The priority was to first revise the instruction sheets that are most heavily affected by the implementation of RDA. Those sheets are now provided as PDFs on the ABA website at http://www.loc.gov/catdir/cpso/csm_instruction_sheets.html.  PSD will also post additional instruction sheets there as the revisions are completed.  

PSD is planning staff briefings on the policy changes, the times and dates of which will be announced in the coming days. 

The brief descriptions of significant policy changes provided below are intended for guidance only.  The full instruction sheets should be consulted for instructions and examples. 

F 175               Editions      The instructions for classifying simultaneously published editions were clarified.  

F 632               Literary Authors      Individual literary authors born before 1925 and anonymous literary works published before 1925 must now be printed in the schedule (i.e., numbers must be proposed for them). The cutoff dates had been 1870 and 1899, respectively.      A section was added to clarify the classification of criticism and commentaries about an individual author’s works.

F 633               Literary Authors: Subarrangement of Works      Compilations are no longer classified either as collections or as separate works based on whether the author is still living.  Instead, the assignment of an RDA conventional collective title is now the sole determinant.      The caption Selections, which is used frequently in the literary author tables, will be revised to Selections. Extracts. Passages in order to clarify the distinction between Selected works and Selections.  

F 634               Literary Collections      The treatment of a literary collection versus a collection in which one of the works is predominant was clarified.

G 140              Dates      The examples of date formats were updated to reflect RDA instructions.      Policy on the use of work letters was revised.  New editions of works entered under corporate body should be assigned work letters beginning with b if necessary, to distinguish between editions published in the same year.  This change makes the policy for corporate bodies consistent with the policy for works entered under a personal name or title. [The instruction to use the workmark a for serials was not changed.] 

G 150              Translations      Several languages were added to the Translation Table.      Polyglot was redefined to mean that there are two or more translations present within a resource (previously, three translations were necessary).      The Cuttering practice for translations in which the language is not listed in the Translation Table was revised. Instead of repurposing a Cutter extension in the Table, the language should be assigned a unique Cutter extension.

(Source: PSD-LC)

by salman haider (noreply@blogger.com) at June 08, 2013 03:33 PM

LCSH revisions due to RDA Phase 2 revisions

The RDA Phase 2 changes to the name authority file were completed on March 27, 2013, so the Library of Congress’ Policy and Standards Division will now begin to update LCSH authority records.  PSD expects to complete the project by the end of April 2013.

Personal, corporate, and conference names, as well as titles and geographic headings (MARC 21 fields 100, 110, 111, 130, and 151), that are established in the name authority file and are also printed in LCSH are eligible for revision.  Phrase headings and subdivisions that are based on names or titles (e.g., Food in the Koran; Future life—Koranic teaching) will also be revised as necessary.

This is a large project that must be undertaken in a systematic manner in order to find all of the affected records.  LC catalogers and SACO participants are therefore requested to refrain from making proposals to update headings until after PSD announces the completion of the project.  

At this time, PSD is not planning a wholesale revision to those LC subject headings that include abbreviations that are not permitted in RDA, which chiefly appear in a subfield $y (e.g., Egypt—History—Early Dynastic Period, ca. 3100-ca. 2686 B.C.). 

(Source: PSD, LC)

by salman haider (noreply@blogger.com) at June 08, 2013 03:29 PM

RDA Policies for Library of Congress

LC policies for a few topics recently discussed by the ABA Management Team.  These policies either confirm that the interim policies LC staff have been following are still in place, or note where policies will be different in the post-March period.

The topics covered are:

Policy 1:  RDA Acceptable Authority Records after March 31, 2013
Policy 2:  Monographs Imported from OCLC (including copy cataloging)
Policy 3:  CIP Verification
Policy 4:  Working with Existing Serial Records
Policy 5: Added Volumes for Multipart Monographs
Policy 6:  Manual Hybridization of AACR2 Records with RDA Elements

All staff involved with creating and updating LC/NACO authority records should pay particular attention to Policy 1 on the treatment of authority records. Policies 2-6 are primarily applicable to ABA and other processing units choosing to follow them.  

These policies are also posted at http://www.loc.gov/aba/rda/pdf/rda-policies-for-lc.pdf 

Phase 2 Changes to the LC/NACO Authority FileAlso note that the Phase 2 changes to the LC/NACO authority file were completed this week-- 371,942 records were updated as part of this process.  Should you encounter one of these revised headings in your normal work, please use the new authorized form in your bibliographic record.  A similar automated change is being planned for updating headings in bibliographic records, but until this process is complete, headings will be out of synchronization between the authority record and bibliographic records.  Until the automated changes to bibliographic records are complete, it is not necessary to perform bibliographic file maintenance on existing bibliographic records, nor report the discrepancies to PSD.

Questions on these policies may be sent to LChelp4RDA@loc.gov

by salman haider (noreply@blogger.com) at June 08, 2013 03:19 PM

Acronyms as variants in RDA NARs

QUESTION:

Would the same rules for qualifying a corporate body apply to both the heading and the variants? For example, would this be appropriate under RDA:
110 2_ Progressive Artists Group (Jaipur, India)
410 2_ PAG (Jaipur, India)
We would never qualified "PAG" under AACR2 when it's used as a variant form. When it's used as a heading, of course, then we would qualify. What about in RDA?

ANSWER:

We had a very specific LCRI practice that basically forbade the adding of qualifiers to many kinds of references, we got rid of those restrictions on variants because they represented an exception that wasn't really important and only caused confusion. 11.13.2.1 tells you that you can add the same additions to variants as to authorized access points.

For personal names see RDA Rule 9.19.2.1

(Reproduction of Question and Answer from experts from LC)

by salman haider (noreply@blogger.com) at June 08, 2013 03:11 PM

Revised LC-PCC Policy for Fuller form of name (9.19.1.4, option)

From: Program for Cooperative Cataloging [mailto:PCCLIST@LISTSERV.LOC.GOV] On Behalf Of Reser, Dave
Sent: Thursday, February 21, 2013 5:23 PM
To: PCCLIST@LISTSERV.LOC.GOV

Subject: [PCCLIST] Revised LC-PCC Policy for Fuller form of name (9.19.1.4, option)

Dear PCC members--

The policy related to the optional use of the RDA element "Fuller form of name" in authorized access points for persons (RDA 9.19.1.4, option) has been the subject of debate by more than one PCC Task Group, the PCC Policy Committee, and listservs such as PCCLIST.  LC and PCC have agreed to a new shared policy for this instruction (a change from the previous LC policy).  Because of the high level of interest in this topic, and the postponement of the February release of the RDA Toolkit where it would have been published (http://www.rdatoolkit.org/blog/517 ), we have taken the exceptional step of mounting the revised policy so that it may be applied immediately by catalogers.  Until it is published in the Toolkit, the policy will reside in the Section "Preliminary Publication of LC-PCC PS" at http://www.loc.gov/aba/rda/lcps_access.html (the revised policy is also included below).

Training documentation will be updated to reflect the change in policy.
Thanks, Becky Culbertson (PCC SCS chair) and Dave Reser (LC PSD) 

LC‐PCC PS for 9.19.1.4, Option (Fuller form of name) 


New Authority Records

LC practice/PCC practice for Optional addition: Apply the option to provide a fuller form of name if a part of the forename or surname used in the preferred name is represented by an initial or an abbreviation, if the cataloger considers it important for identification. Add unused forenames or surnames only if needed to distinguish one access point from another (see RDA 9.19.1.4). 

Existing Authority Records

LC practice/PCC practice for Optional addition: Unless otherwise changing an existing heading (e.g., conflict), do not change an existing AACR2 or RDA heading merely to add or remove a fuller form of name.


by salman haider (noreply@blogger.com) at June 08, 2013 02:46 PM

Cataloger's Desktop Now Contains Six RDA-Related Resources

Library of Congress

News from CDS

Cataloger's Desktop Now Contains Six RDA-Related Resources

Another RDA-related resource has been added to Cataloger's Desktop.  The new resource, RDA training resources, is maintained by the CILIP-BL Committee on RDA and provides links to RDA training from Cambridge University Library, CILIP Cataloguing & Indexing Group, the Australian Committee on Cataloguing, rdacake (RDA CAnadian Knowledge Exchange), Die Deutsche Nationalbibliothek, National Library of New Zealand, and several U.S. contributors.

 The following is the current list of RDA-Related resources in Cataloger's Desktop: 

*         RDA: Information and Resources in Preparation for RDA (LC)
*         RDA: LC Documentation for the RDA Test
*         RDA: Resource Description and Access (subscription resource that requires a separate subscription to RDA Toolkit)
*         RDA-L (JSC)
*         RDA Training Resources (CILIP-BL)
*         RDA Vocabularies (Open Metadata Registry)

by salman haider (noreply@blogger.com) at June 08, 2013 02:19 PM

June 07, 2013

all things cataloged

saskiayave

The conference “Academic Librarian 3: The Yin-Yang of Future Consortial Collaboration and Competition” was held in Hong Kong at the end of last month. Presentations are now available, and I would like to draw your attention to one presentation about cataloging: “From union catalogue to fusion catalogue: how collaborative cataloguing might be initiated and implemented in the Hong Kong context” (PDF). Due to electronic resources and the accompanying vendor records, the union catalog, with its relatively uniform application of rules and standards, gets transformed into a “fusion catalog” with different cataloging rules and various levels of detail. This observation definitely resonates with what I’m dealing with at work right now, namely the integration of thousands of e-book records for an evidence-based selection model set up by one of the big university libraries we serve. The data comes from OCLC, in MARC (and created with a different set of cataloging rules), is subsequently converted into the German / Austrian format MAB and into the Aleph Sequential Format in order to be loaded into our catalog. They are not the “prettiest” records but this is an efficient method of offering the users a large amount of content in a fast way. One more project that brought the Austrian union catalog closer to a “fusion catalog” is the big digitization undertaking by the Austrian National Library, “Austrian Books Online”, where not only books are scanned but also catalog cards which are then OCRed, automatically transformed into bibliographic records and batch-loaded into the catalog database.

So does this new “fusion catalog” with a blended mix of standards, formats, rules and detail affect the user at all? Or is it all hidden under the discovery layer anyway? Do we still really need and can we maintain the high level of consistency of the union catalog? The conference presentation gives some aspects of the lessons learned during the transition from union to fusion catalog, that is sometimes imperceptible to everyone but catalogers:

“Past:

  • Following uniform cataloguing practices
  • Preferring a high level of consistency in bibliographic records

Present:

  • Bring in vendor records applying different cataloguing rules and various level of completeness
  • Accepting that ‘a minimal record [is more] beneficial to library users than no record at all’

Variations are inevitable

  • The ideal: Conform fully to one single cataloguing standard and to local conventions
  • In reality: different cataloguing data sets are blended together
  • Direct and immediate access to the needed library materials is more important to users than standard cataloguing records

[...]

  • When variations are accepted and catalogers are open to accepting differences in cataloguing practices”

With RDA on the horizon and with the perspective of having legacy data and new data sitting side by side, as well as data created following different RDA policy decisions for alternatives/options and cataloger’s judgments, if consortial and/or global shared cataloging is to continue we will finally have to say goodbye to our rather closed world-view and come to terms with a non-uniform, blended mixture of bibliographic information.


by Saskia at June 07, 2013 02:16 PM

Various librarian-like stuff

carolslib

Oh Laurel, you always make me think! With Reading Habits of Professionals, you did it again.  I love to see people reading – reading anything at all. Even if I don’t like that author or genre or series, the person is reading. You can’t judge a book by its cover. You can’t judge a reader […]

by carolslib at June 07, 2013 01:58 AM

June 06, 2013

Universal Decimal Classification

New UDC Consortium Facebook page

In February 2013 the old UDC Consortium website was redesigned. The website now has an online shop for Consortium publications. The old UDCC facebook profile has also recently moved to a new  facebook page which is going to be used more frequently for announcements of UDC products and events. We would like to use this opportunity to invite UDC users to follow these channels of communications.

by Aida Slavic (noreply@blogger.com) at June 06, 2013 07:33 PM

June 05, 2013

Bibliographic Wilderness

how to make apache fake a 500 http response

For experimentation or testing (manual or automated, if automated usually captured by vcr), I sometimes need a URL guaranteed to always return an HTTP 500 error response.

Here’s some configuration you can drop in an apache conf to generate a simple default 500:

Redirect 500 /error500

Accessing http://yourserver/error500, or /error500/more/path, or /error500/more/path?with=query, all will return a 500 response with apache’s default 500 body.

The command is ‘Redirect’ becuase this is normally used to generate a redirect with “Location” header, so you can use it for mocking any 3xx with Location too (third argument, if present,  value of Location header), but it also works fine for mocking up other status codes like 500 or anything else, so long as you don’t care too much about what the body looks like.


Filed under: General

by jrochkind at June 05, 2013 03:40 PM

025.431: The Dewey blog

Hoodoo is not Vodou, and Vice-Versa

In October 2012 the LCSH "Voodooism" gave way to "Vodou," the spelling most commonly used by Haitians for the traditional religion of their country.   The DDC has likewise adopted this spelling.  At the same time, we are making clearer the split between vodou, the Haitian traditional religion, and hoodoo, an occult practice found primarily in the southern United States.  Vodou belongs in 299.675 Vodou; hoodoo belongs in 133.4 Demonology and witchcraft.

Example 1. Sticks, stones, roots & bones: hoodoo, mojo & conjuring with herbs, to which the Library of Congress subject headings, "African American magic" and "Hoodoo (Cult)" have been assigned, is about hoodoo and should be classed in 133.4.

Example 2. Haitian vodou: an introduction to Haiti's indigenous spiritual traditions, to which the Library of Congress subject headings, "Vodou—Haiti," "Haiti--Religious life and customs," and “Haiti--Religion” have been assigned, is about Vodou and should be classed in 299.675097294 Vodou in Haiti (built from 299.675 Vodou, plus T1—09, plus T2—7294 Haiti, following the instructions at T1—093-099 Specific continents, countries, localities; extraterrestrial worlds).

Unfortunately, our "clear split" isn't always clear in the literature, as many people have used the term "voodoo" when they are referring to hoodoo.  This is why we have added an example to the Manual note at 130 vs. 200, part of which now reads, "Use 130 for parapsychological and occult phenomena if they are not presented as religious, or if there is doubt as to whether they have been so presented, e.g., works on hoodoo in 133.4 even if the term "voodoo" has been applied (not 299.675 Vodou)."

An example of this type of literature is Voodoo sorcery grimoire.  Although it uses "voodoo" in the title, the work is about hoodoo and should be classed in 133.4.  (It doesn’t help the classifier that both the old "Voodooism" and "Hoodoo (Cult)" were assigned as LCSHs to the work.) 

How can the classifier make the correct choice?  Often there will be clues in the bibliographic record.  On the one hand, the presence of "Haiti" and "religion" should move the classifier toward 299.675, as should "Bondye" and "loa," deities in Vodou.  On the other hand, presence of the following terms should move the classifier toward 133.4:  "conjure," "grimoire," "magic," "mojo," "rootwork," "sorcery," "spells," and  "witchcraft."

by Rebecca at June 05, 2013 12:50 PM

June 04, 2013

Bibliographic Wilderness

More affordable cloud hosting options

A few years ago when AWS was getting a lot of attention in the library world (and everywhere else!), it immediately seemed to me, based on some back of the envelope calculations, to be likely unaffordable to libraries. And not a very good value proposition compared to our standard self-hosting — especially for academic libraries which can benefit from their host universities IT infrastructure, but in general, it wasn’t cheap.

Here’s a blog post arguing that in general, EC2 indeed isn’t a great value proposition – if you mostly need 24/7 instances. Where EC2 shines, of course, is it’s ability to “elastically” (the “E” in EC2) spin up and down instances on the fly, and pay hourly, to quickly adjust your provisioning for to the moment demand.

That’s for architectures that are horizontally scaled out a lot, and need to scale up a lot — not most of our library things, but perhaps increasingly more and more (if we succesfully get more competent and succesful!), sure. Although the author notes and provides some calculations showing that even this can be a dicey value proposition.

In that blog post, he mentions other providers with better prices, but not by name.

In the Hacker News thread, someone mentioned Digital Ocean. They seems to offer a service roughly analagous to EC2, but with far better prices — including hourly charging and instantaneous provisioning, so in theory supporting on-demand load balancing for highly horizontally scaled services, just like EC2.  It might still be hard for a university library to beat in-house hosting, due to having our existing university IT infrastructures where we probably don’t need to pay for or pay seriously reduced pricing for bandwidth, electricity, server room facilities, maybe even operations staff, etc. But if you do have a context where cloud hosting makes sense, it’s worth remembering that Amazon is not the only reliable/competent player in town, and is definitely not the cheapest — although it may be the most feature-complete.


Filed under: General

by jrochkind at June 04, 2013 02:20 PM