English language — subjectless constructions*

This is (probably) a final blog post referring to the recent survey by the UK Archives Discovery Network (UKAD) Working Group. Here we look in particular at subject indexing.

We received 82 responses to the question asking whether descriptions are indexed by subject. Most (42) do so, and follow recognised rules (UKAT, Unesco, LCSH, etc.). A significant proportion (29) index using in-house rules and some do not index by subject (18). Comments on this question indicated that in-house rules often supplement recognised standards, sometimes providing specialised terms where standards are too general (although I wonder whether these respondents have looked at Library of Congress headings, which are sometimes really quite satisfyingly specific, from the behaviour of the great blue heron to the history of music criticism in 20th century Bavaria).

Reasons given for subject indexing include:
  • it is good practice
  • it is essential for resource discovery
  • users find it easier than full-text searching
  • it gives people an indication of the subject strengths of collections
  • it imposes consistency
  • it is essential for browsing (for users who prefer to navigate in this way)
  • it brings together references to specific events
  • it brings out subjects not made explicit in keyword searching
  • it enables people to find out about things and about concepts
  • it may provide a means to find out about a collection where it is not yet fully described
  • it maximises the utility of the catalogues
  • it helps users identify the most relevant sources
  • it can indicate useful material that may not otherwise be found
  • it enables themes to be drawn out that may be missed by free-text searching
  • it can aid teachers
  • it helps with answering enquiries
  • it facilitate access across the library and archive
  • it meets the needs of academic researchers
The lack of staff resources was a significant reason given where subject searching was not undertaken. Several respondents did not consider it to be necessary. Reasons given for this were:
  • the scope of the archive is tightly defined so subject indexing is less important
  • the benefits are not clear
  • the lack of a thesaurus that is specific enough to meet needs
  • a management decision that it is ‘faddy’
  • the collections are too extensive
  • the cataloguing backlog is the priority
Name indexing is considered more important than subject indexing only by a small margin, and some respondents did emphasise that they index by name but not by subject. Comments here included the observation that subject indexing is more problematic because it is more subjective, that subjects may more easily be pulled out via automated means and that it depends upon the particular archive (collection). As with name and place indexing, subject indexing happens at all levels of description, and not predominantly at collection-level. Comments suggest that subjects are only added at lower-levels if appropriate (and not appropriate to collection-level).
For subjects, the survey asked how many terms are on average applied to each record. According to the options we gave, the vast majority use between one and six. However, some respondents commented that it varies widely, and one said that they might use a few thousand for a directory, which seems a little generous (possibly there is a misunderstanding here?)
Sources used for subjects included the usual thesauri, with UKAT coming out strongest, followed by Unesco and Library of Congress. A few respondents also referred to the Getty Art and Architecture Thesaurus. However, as with other indexes, in-house lists and a combination approach also proved common. It was pointed out in one comment that in-house lists should not be seen as lesser sources; one respondent has sold their thesaurus to other local archives. There were two comments about UKAT not being maintained, and hopes that the UKAD Network might take this on. And, indeed, when asked about the choice of sources used for subject indexing, UKAT again came up as a good thesaurus in need of maintenance.
Reasons given for the diverse choice of sources used included:
  • being led by what is within the software used for cataloguing
  • the need to work cross-domain
  • the need to be interoperable
  • the need to apply very specific subject terms
  • the need to follow what the library does
  • the importance of an international perspective
  • the lack of forethought on how users might use indexes
  • the lack of a specialist thesaurus in the subject area the repository represents (e.g. religious orders)
  • following the recommendations of the Archives Hub and A2A
* the title of this blog post is a Library of Congress approved subject heading

Place names: we would be lost without them

According to the recent Indexing and Authority Records Survey (which I have been blogging about recently), archivists have a number of reasons why they think it is important to undertake place indexing:

  • to facilitate access
  • it is essential to resource discovery
  • users frequently request information about places
  • it is very important for local historians
  • it is good practice
  • to tackle inconsistencies in spelling and place name changes
  • to distinguish between places that have the same name
  • as a source of statistics (e.g. how many collections relate to individual countries)
  • it is an important part of the University’s diversity plan – many students are from other countries – shows that the collections are international
  • the records are arranged by place
  • it is a way to bring together disparate material in diverse collections
  • it helps identify and track boundary changes over time
  • it is used by national network sites (e.g. the Archives Hub)
The main reason not to index by place was given as a lack of staff resources, but some did also feel that it is not necessary. Other reasons were:
  • the search engine can pull out the place name
  • would need to index at item level for place entries to be useful and this is not practical to do
  • cataloguing and name indexing are the priorities
  • collections cover a small geographical area
  • collections are more thematic and name indexing works better than place indexing
  • not appropriate for the material (e.g. cartoons)
  • it has never been done
  • names are standardised to facilitate keyword searching
For those that do index by place, just as with names, the spread between collection-level, series-level, file and item-level indexing was pretty even, and the percentage of collections indexed by place varied enormously. The sources used for place names were varied, although most do seem to use the recognised gazetteers and guides. Others referred to the Library of Congress, local people and the documents themselves.
Many do use the NCA Rules, but there were some comments about the drawbacks of these -they do not recognise the three Yorkshire Ridings, they were created by a previous generation of archivists and are outdated.
We did ask whether any repositories use a co-ordinates based system, and only 3 responses were in the affirmative, though a couple stated that they were going to look into this.
Finally, when asked about reasons for the choice of rules or sources for place names, there were some varied responses:
  • being part of a set-up with other contributors
  • familiarity
  • ease
  • internationally accepted [standard], widely known and used
  • indexing was done before standards were introduced
  • it appears that no real thought has been given to this
  • standards were not precise enough when the decision was made
Place name indexing: is it necessary? One respondent said: ‘To put it bluntly we would be lost without it.’
Image: Flickr Creative Commons JMC Photos

“I’m Spartacus!’ (or giving a name authority)


This is the second blog post about the recent UKAD survey on indexing and name authorities (as stated previously a report on the survey will be made available shortly).


It seems to me that there is some confusion over what authority records actually are. When we came up with our survey it was clear that defining these terms is not always that straightforward and we often make assumptions that are not necessarily shared . We created a glossary for the survey, and defined a name authority record as:

“An entry for a person or corporate body that includes additional elements about the entity, providing contextual information as well as a name index entry.”

However, it is clear that some respondents were thinking of name index entries rather than more complete authority records. According to our survey, which received 93 responses, 34 maintain authority records that follow recognised rules or sources (although comments indicate that the number of these records may be very limited), 14 follow local practice and 29 do not maintain authority records. Bear in mind that responses were not per institution, so the figures can only tell us so much. But what they do indicate is: (i) there is some confusion about what authority records are (ii) some repositories maintain authority records that follow their own in-house practice rather than recognised standards (iii) it is important for archivists that the software cataloguing systems they use support the creation of authority records.

Many repositories use the original records to create authority records, which is one reason why archivists are in the best position to provide this kind of detailed and useful information to researchers. The original records can give a real insight into individuals, particularly lesser-known individuals. Many archivists base their name authority records on ISAAR(CPF), which gives a level of consistency, but many do not, maybe reflecting the fact that ISAAR is a recent standard (first edition 1996), and cataloguing is not a recent phenomenon.

If the authorised form of the actual name is following recognised rules, this provides for effective resource discovery. But in reality we know that there are often many versions of an individual out there. Here are the entries on the Archives Hub for David Lloyd George:

  • George David Lloyd
  • George David Lloyd 1863-1945 1st Earl Lloyd George Of Dwyfor Statesman
  • George David Lloyd 1863-1945 1st Earl Lloyd George Of Dwyfor Statesman And Prime Minister
  • George David Lloyd 1863-1945 Emph Altrender Epithet Prime Minister
  • George David Lloyd 1863-1945 First Earl Lloyd-george Of Dwyfor Prime Minister
  • Lloyd George David
  • Lloyd George David 1863-1945
  • Lloyd George David 1863-1945 1st Earl Lloyd George Of Dwyfor Statesman
  • Lloyd George David 1863-1945 1st Earl Lloyd-george Of Dwyfor Statesman
This illustrates quite nicely the problems of including an epithet, and even more clearly the problems of NCA Rules insisting on using the last element of a surname, even if it is a compound or hyphenated surname. I will never understand that one…sigh.


I love one of the responses to the question of which sources are used for authority records: ‘books, the internet, people’. In a way this reflects the diversity of sources used, which include encylopaedias, directories, books, journals and registers as well as donor knowledge. This shows how important the expertise of archivists is in using various sources to bring together valuable information about individuals, families and corporate bodies. Authority records maximise the benefits of the information archivists gather together for their work, bringing it to researches and giving them new ways into collections.

Archivists have to work with the software that they have, and sometimes this imposes certain limitations. One respondent mentioned the need to avoid using the ampersand, for example. Many repositories use CALM, and this is compliant with ISAAR(CPF), which should provide a great boost to archivists wanting to create authority records.

I do think that archivists should really be starting to think more carefully about the benefits of name authority records, and we need to have a more co-ordinated and collective approach to this. As one respondent put it, ‘We don’t create these at present, and I wonder whether we ever ought to? Surely this is most sensible as a global resource that we can contribute to and share.’ For my part, I would be very keen for the Archives Hub to facilitate this, and I hope that this is something we can look to in the future.

Image: Flickr Creative Commons Steeljam photostream


What’s in a Name?

I have just been taking a look through the results of a recent survey by the UK Archives Discovery Network (UKAD) Working Group. The Working Group are getting together this week and will be looking at making the results public.
The main thing that struck me was the variety of responses. If we thought that this survey might clarify the situation, I’m beginning to wonder if all that it clarifies is that the situation is not clear!
I’m just going to concentrate on name indexing here, and leave place and subject for another blog post.
It seems that only a small proportion of archivists (as reflected in this survey) do not think that indexing is important. Of the 80 responses, 49 indexed to recognised rules and 23 indexed in line with local practice; 13 did not index and 23 went for ‘other’, which tended to mean they were in the process of creating an index, moving to an index following recognised standards or had legacy data with some indexes.
The survey revealed many reasons to create a name index as a means to access archives:
  • for enhanced resource discovery
  • many users want to search by name (respondents indicated it is a very popular search option)
  • it brings together collections that reference the same people
  • it is a way researchers look for connections
  • it aids interdisciplinary research
  • to identify people involved in particular works and their roles
  • it helps researchers to narrow down larger numbers of hits to just relevant collections
  • it promotes interoperability
  • it addresses problems with variants of the name, name changes, or different people with the same name (aids reliability)
  • it is at the heart of family history research
  • it is useful for answering enquiries
  • it is useful for selecting material, e.g. for exhibitions
When asked why name indexing is not carried out, there were a number of reasons:
  • free text retrieval makes name indexing redundant
  • lack of funding
  • lack of training
  • lack of staff resource
  • the current system does not support indexing
  • it has never been done
  • uncertainty about how to index effectively
  • uncertainty about benefits
Out of 100 responses, 46 felt that name indexing is very important, 33 felt it is reasonably important and 11 felt it is a low priority. The main reason given for name indexing being a low priority was the pressing need to deal with cataloguing backlogs and actually get some kind of description out there. It also seems that archivists do not always feel that they have the evidence to suggest that indexing is of benefit to researchers (or enough benefit to warrant the time involved).
The level at which collections are indexed was often given as ‘whatever is appropriate’ and clearly varied widely. I had expected it to be much higher for collection-level descriptions, but this was not the case.
We asked which sources are used for names, and again the answers were varied. Many people clearly do use the original records, with the National Register of Archives and Dictionary of National Biography coming in close behind. There was mention of Wikipedia, and even Google. In terms of rules, a majority do use the NCA Rules, and more use in-house rules than use AACR2. Several respondents said they use ISAAR(CPF), which is curious, as this standard is for name authority records and states that the main name entry should follow recognised rules (e.g. NCA Rules). I wonder if people were thinking of name authority records rather than basic index entries.
More on the survey to follow. And the UKAD Network will be publishing the results via the listserv, archives-discovery-network@jiscmail.ac.uk Make sure you sign up to this if you are interested in these kind of activities: https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=ARCHIVES-DISCOVERY-NETWORK

Christmas at the Co-op

This month we are celebrating Christmas at the Co-op, showing images of how Co-operative Societies prepared for the festive season. As well as photographs, there are examples of publications the Co-operative movement produced in the run up to Christmas, including war-time rationing recipes from the 1940s.

Photo: Royal Arsenal Co-operative Society staff and artists preparing the Christmas Bazaar, 1960s. Copyright © National Co-operative Archive.

International archival standards: living in perfect harmony?

The International Council on Archives Committee on Best Practices and Standards met recently to look at the four ICA descriptive standards: ISAD(G), ISAAR(CPF), ISDF and ISDIAH. It was agreed at this meeting to delay a full review that might lead to more substantial changes and to concentrate on looking at harmonization.
On the Hub we use ISAD(G), which has become very widely recognised and used. ISAAR(CPF) is something that would be important if we started to think about implementing EAC-CPF, enabling our contributors to create authority records for creators of archives. We think that this is the sort of development that should have cross-sectoral agreement, and we are actively involved in the UK Archives Discovery Network (UKAD), which provides a means for us to discuss these sorts of issues across the archives community in the UK.
As far as the International Description for Descriptive Function (ISDF) is concencerned, I feel that a great deal more work is needed to help archivists understand how this can be practically implemented. Our new EAD Editor does allow contributors to add functions to their descriptions, but this is just using the EAD tag for functions. To me, the whole issue of functions and activities is problematic because I am looking at it from the perspective of aggregation. It is all very well for one institution to define their own functions and activities, but how does this translate into the wider environment? How do we successfully enable researchers to access archives by searching functions and activities across diverse institutions?
I have not really given any thought at all to the International Standard Description for Institutions with Archival Holdings (ISDIAH) other than to basically familiarise myself with the standard. For us, the unique code that identifies the institution and the institution’s name is all that we require within our descritions. We link to the Archon details for the institution, and maybe it is in the Archon directory of UK archives, that ISDIAH should be implemented? I am not sure that it would be appropriate to hold detailed information about individual institutions on the Hub.
I will be interested to see what the outcomes of the Committee’s work are. I wonder whether we need a greater understanding of the standards themselves before we try to understand how they work together? Maybe adopting more consistent terminology and providing a conceptual framework will help archivists to appreciate what the standards are trying to achieve and encourage more use, but I am doubtful. I think that a few training days: ‘Understanding the ICA Descriptive Standards’ wouldn’t go amiss for many archivists, who may have only recently adopted ISAD(G), let alone thought about the implications of the other standards.
In the appendices to the minutes, there are some interesting points of discussion. Even some of the assumptions seem to be based on a greater understanding of the standards than most archivists have. For example, ‘if you use ISAD(G) in conjuction with ISAAR, the Admin/Biog history element of ISAD(G) becomes useless because the description of the record creator is managed by ISAAR’. Well, yes, but I’m not sure that this is so clear cut in practice. It makes sense, of course, but how do we relate that to all the descriptions we now have? Also, ‘ISAAR can be used to structure the information contained in the Admin/Biog history element of ISAD(G)’ – that makes sense, but I know of no practical examples that show archivists are doing this.
I wonder if we really need to help archivists to understand the standards – what they are, what they do, how they work, how they can benefit resource discovery – before we throw a conceptual framework at them. At the same time, I increasingly feel that ISAD(G) is not relevant to the modern environment and therefore I think there is a pressing need to review ISAD(G) before looking at how it relates to other standards.

Hub contributors’ reflections on the current and future state of the Hub



The Archives Hub is what the contributors make it, and with over 170 institutions now contributing, we want to continue to ensure that we listen to them and develop in accordance with their needs. This week we brought together a number of Archives Hub contributors for a workshop session. The idea was to think about where the Hub is now and where it could go in the future.
We started off by giving a short overview of the new Hub strategy, and updating contributors on the latest service developments. We then spent the rest of the morning asking them to look at three questions: What are the benefits of being part of the Hub? What are the challenges and barriers to contributing? What sort of future developments would you like to see?
Probably the strongest benefit was exposure – as a national service with an international user-base the Hub helps to expose archival content, and we also engage in a great deal of promotional work across the country and abroad. Other benefits that were emphasised included the ability to search for archives without knowing which repository they are held at, and the pan-disciplinary approach that a service like the Hub facilitates. Many contributors also felt that the Hub provides them with credibility, a useful source of expertise and support, and sometimes ‘a sympathetic ear’, which can be invaluable for lone archivists struggling to make their archives available to researchers. The network effect was also raised – the value of having a focus for collaboration and exchange of idea.
A major barrier to contributing is the backlog of data, which archivists are all familiar with, and the time required to deal with this, especially with the lack of funding opportunities for cataloguing and retro-conversion. The challenges of data exchange were cited, and the need to make this a great deal easier. For some, getting the effective backing of senior managers is an issue. For those institutions who host their own descriptions (Spokes), the problems surrounding the software, particularly in the earlier days of the distributed system, were highlighted, and also the requirement for technical support. One of the main barriers here may be the relationship with the institution’s own IT department. It was also felt that the use of Encoded Archival Description (EAD) may be off-putting to those who feel a little intimidated by the tags and attributes.
People would like to see easy export routines to contribute to the Hub from other sytems, particularly from CALM, a more user-friendly interface for the search results, and maybe more flexibility with display, as well as the ability to display images and seamless integration of other types of files. ‘More like Google’ was one suggestion, and certainly exposure to Google was considered to be vital. It would be useful for researchers to be able to search a Spoke (institution) and then run the same search on the central Hub automatically, which would create closer links between Spokes and Hub. Routes through to other services would add to our profile and more interoperability with digital repositories would be well-received. Similarly, the ability to search across archival networks, and maybe other systems, would benefit users and enable more people to find archival material of relevance. The importance of influencing the right people and lobbying were also listed as something the Hub could do on behalf of contributors.
After a very good lunch at Christie’s Bistro we returned to look at three particular developments that we all want to see, and each group took one issues and thought about what the drivers are that move it forward and what the retraining forces are that stop it from happening. We thought about usability, which is strongly driven by the need to be inclusive and to de-mystify archival descriptions for those not familiar with archives and in particular archival hierarchies. It is also driven by the need to (at least in some sense) compete with Google, the need to be up-to-date, and to think about exposing the data to mobile devices. However, the unrealistic expectations that people have and, fundamentally, the need to be clear about who our users are and understanding their needs are hugely important. The quality and consistency of the data and markup also come into play here, and the recognition that this sort of thing requires a great deal of expert software development.
The need for data export, the second issue that we looked at, is driven by the huge backlogs of data and the big impact that this should have on the Hub in terms of quantity of descriptions. It should be a selling point for vendors of systems, with the pressure of expectation from stakeholders for good export routines. It should save time, prove to be good value for money and be easily accommodated into the work flow of an archive office. However, complications arise with the variety of systems out there and the number of standards, and variance in application of standards. There may be issues about the quality of the data and people may be resistant to changing their work habits.
Our final issue, the increased access to digital content, is driven by increased expectations for accessing content, making the interface more visually attractive (with embedded images), the drive towards digitisation and possibly the funding opportunities that exist around this area. But there is the expense and time to consider, issues surrounding copyright, the issue of where the digital content is stored and issues around preservation and future-proofing.
The day ended with a useful discussion on measuring impact. We got some ideas from contributors that we will be looking at and sharing with you through our blog. But the challenges of understanding the whole research life-cycle and the way that primary sources fit into this are certainly a major barrier to measuring the impact that the Hub may have in the context of research outputs.

Web 2.0 for teaching: wishy-washy or nitty-gritty?

A useful report, summarising Web 2.0 and some of the perspectives in literature about Web 2.0 and teaching, was recently produced by Susan A. Brown of the School of Education at the University of Manchester: The Potential of Web 2.0 in Teaching: a study of academics’ perceptions and use. The findings were based on a questionnaire (74 respondents across 4 Faculties) and interviews (8 participants) with teaching staff from the University of Manchester. It is available on request, so let us know if you would like a copy.
Some of the points that came out of the report:
  • It is the tutors’ own beliefs about teaching that are the main influence on their perceptions of Web 2.0
  • There is little discussion about Web 2.0 amongst colleagues and the use of it is generally a personal decision
  • Top-down goals and initiatives do not play a major part in use of Web 2.0
  • It may be that a bottom-up experimental approach is the most appropriate, especially given the relative ease with which Web 2.0 tools can be deployed, although there were interviewees who argued for a more considered and maybe more strategic approach, which suggests something that is more top-down
  • There is little evidence that students’ awareness of Web 2.0 is a factor, or that students are actively arguing in favour of its use:
“This absence of a ‘student voice’ in tutors’ comments on Web 2.0 is interesting given the perceptions of ‘digital natives’ – the epithet often ascribed to 21st Century students – as drivers for the greater inclusion of digital technologies. It may shore up the view that epithets such as ‘digital natives’ and ‘Millennials’ to describe younger students over-simplify a complex picture where digital/Web technology users do not necessarily see the relevance of Web 2.0 in education.”
  • The use of and familiarity with Web 2.0 tools (personal use or use for research) was not a particularly influential factor in whether the respondents judged them to have potential for teaching.
  • In terms of the general use of Web 2.0 tools, mobile social networking (e.g Twitter) and bookmarking were the tools used the least amongst respondents. Wikis, blogs and podcasting had higher use.
  • In terms of using these tools for teaching, the data was quite complex, and rather more qualitative than quantitative, so it is worth looking at the report for the full analysis. There were interviewees who felt that Web 2.0 is not appropriate for teaching, where the role of a teacher is to lay down the initial building blocks of knowledge, implying that discussion can only follow understanding, not be used to achieve understanding. There was also a notion that Web 2.0 facilitates more surface, social interactions, rather than real cognitive engagement.
“A number of…respondents expressed the view that Web 2.0 is largely socially orientated, facilitating surface ‘wishy-washy’ discussion that cannot play a role in tacklinkg the ‘nitty-gritty’ of ‘hard’ subject matter”.
Three interviewees saw a clear case for the use of Web 2.0 and they referred to honing research skills, taking a more inquiry-based approach and taking a more informal approach and tapping into a broader range of expertise.
In conclusion “The study indicates that there are no current top-down and bottom-up influences operating that are likely to spread Web 2.0 use beyond individuals/pockets of users at the UoM [Universtiy of Manchester]”. The study recommends working with a small group of academics to get a clearer understanding of the issues they face in teaching and how Web 2.0 might offer opportunities, as well as providing an opportunity for more detailed discussion about teaching practices and thinking about how to tailor Web2.0 for this context.