Optimistic outcome for optimising the Hub


Paddy, Steve and I (Jane) have spent the last 4 months working on an interesting JISC project to optimise Archives Hub pages for search engines, as part of the Strategic Content Alliance

Initiative.

Search Engine Optimisation (SEO) is a process that aims to increase the visibility of a Website in important search engines like Google. SEO works by modifying the content, the layout, and the architecture of web pages, in addition to using community building techniques to enhance the popularity of a website.

As part of this project, an SEO expert is tracking and recording our current web traffic. We are implementing recommended changes and looking for changes to the website traffic after the changes are made.

Recommendations we have implemented
1. A Search Engine Sitemap

This is something that was developed by Google and is used by other search engines. An XML sitemap is a recommended way of organising a Website and identifying the URLs for the purpose of indexing the site by search engine bots, allowing them to find content and data faster and more efficiently. It is a means for us to tell the search engine what the important pages are, and we can also put a date into the sitemap as an indication of how often the page is updated. The sitemap should help the pages get indexed faster.

The sitemap was relatively easy to create, although it probably needs a bit more work from us in terms of grading pages for priority.

2. Metadata

We have been working on the page metadata. In particular we have minimised duplicate title and description tags, ensured all pages have title tags and thought a bit more about the content of the title and description tags – does the title properly represent the page? Is the description an effective summary of the content with important keywords? It is important to think about this from the perspective of the robots – what are the words that will be most useful for them, in terms of search engine searches?

For example, where we had a metadata title ‘Archives Hub: For Archivists’, we had a heading for the same page ‘Contributing to the Archives Hub’. Ideally these should be the same and we should decide which terms are most important – should ‘archivists’ be in the main heading? Should ‘contributing’ be in the title tag? We have also started to reverse our page titles so that the subject of the page is entered first of all, so not ‘Archives Hub: Contributors’ but ‘Contributors to the Archives Hub’.

3. Headings

As stated above, we are getting the metadata title and page title to correspond, and we are also thinking about the importance of the page headers for search engines. In the past we have had monthly features with titles like ‘Wabsters and Shewsters’. Whilst this might work as an intreguing title for a user, it will not help a user searching for Scottish textile history.

4. URLs

It is worth ensuring that at least one of the important keywords is in the URL for a page. So, a page on railway history should have a URL like http://www.archiveshub.ac.uk/railways.shtml and the title ‘Railway history: 200 years of the steam locomotive’.

5. Work on those keywords

We have worked on including keywords throughout the text, and especially in the first few lines. The inclusion of suggested websites and suggested reading provides a legitimate excuse to repeat keywords, both in their titles and in the annotations.

Other recommendations

There were other recommendations that we intend to implement over time, but did not have the resources to implement immediately – and some of them will more rationally fit into a redesign of our webiste (which is happening over the next 6 months).

1. Minimise use of tables

2. Change directory names to something more meaningful, e.g. ‘institution’ instead of ‘inst’, or ‘archivist’ instead of ‘arch’

3. Encourage external sites to link to the Hub site. This is an ongoing activity, but it should be easier with our new Website, and with our new approach to monthly features. We will also be able to link to Hub descriptions from sites like the National Register of Archives because we will have persistent URLs for all descriptions.

Web ranking reports

We have been working with Alan K’necht, an SEO expert, and Thierry Arsenault from the The Canadian Heritage Information Network (CHIN). Alan has provided us with weekly Web ranking reports. These reports are based upon some agreed search terms that we are using. We created three pages for three subject areas where the Archives Hub has strong collection representation: fairs and circus history, history of textiles and british railway history. For all of these subjcts we already had a monthly feature that we had created, so we could use the pages that already existed and just work on them to make them more optimised for search engines.

Conclusions so far

So, has it worked? If I take ‘fairground history’ as an example. On April 13th, this was at 30 in the Google rankings and at 14 in Google UK rankings. By May 11th it was at 11 in Google and 7 in Google UK. By June 6th it had moved to 6 in the rankings, and a quick search on Google UK now (17th June) puts it at number 3.

Railway history is maybe a more challenging topic, as we are competing with a huge amount of information. ‘Railway history UK’ was not ranked at the start of the project, but by 15th June it was at 15 in the rankings for Google, and at 11 for Google UK. A search on Google of just pages from the UK currently brings the page up to number 6 in the list.

Of course, the challenge with Google is to get the URL in the first page of results, and it is always a moveable feast, so if the page ranks highly one week, it may not do so the next. However, the work that we have done has clearly made an improvement to our rankings, and if we apply the lessons learnt to our other feature pages, we should be able to attract more people to the Archives Hub Website.

The principle of the JISC study was that ‘implementing a few simple and inexpensive Search Engine Optimisation (SEO) techniques can increase an organisation

Features: Top 10

Woodcraft Folk: trumpet and gongHere are the current ten most popular features:

  1. Dig for Victory! (Nov 04) Archival Awareness with help from the Women’s Land Army
  2. The Great War (Sep 04)
  3. Forensics: a partial print of the history of forensic science (Jun 09)
  4. The Big Draw (Oct 02) all kinds of drawings can be found in archives
  5. Interpreting Shakespeare (Apr 03)
  6. Miners’ Strike 1984-1985 (Mar 04)
  7. Railway history: 200 years of the steam engine (Jun 04)
  8. Fairs and Circus History (Oct 08)
  9. Trees (Dec 03)
  10. Stanley Kubrick (Jul 08)

Illustration: Woodcraft Folk photo copyright © National Co-operative Archive. From Around the Campfire (Aug 07).

International Archives Day 9th June 2009

Did you know that today is International Archives Day?

This is the 2nd International Archives Day ever held and 9th June was chosen because the International Council on Archives (ICA) was founded on 9th June 1948. Last year was the First International Archives Day, coinciding with the 60th Anniversary of ICA.

For more information about this and the history of ICA, go to the Unesco Archives website.


Over the last year the Archives Hub has had over 120,000 visits from over 184 countries. The map above gives an indication of international use.

One of our contributors, Glasgow University Archive Services, is celebrating International Archives Day by launching an online resource highlighting the international scope and reputation of Glasgow University and its archive collections.

The exhibition, searchable by region, will demonstrate the involvement of Scottish businesses on the development of the world economy and the influence that University of Glasgow and staff and students have had on the development of education around the world and on the history of many countries.

To go to the resource please see the following link: http://www.gla.ac.uk/services/archives/collections/internationalarchiveday/

If you are interested in international archives you could try the following websites and blogs:

Websites:
ArchiveGrid: A subscription site where you can find historical documents, personal papers, and family histories held in archives around the world.

European Archive: A freely available digital library of archives, with an emphasis on audio-visual materials.

MICHAEL UK: MICHAEL aims to provide simple and quick access to the digital collections of museums, libraries and archives from different European countries.

Unesco Archives Portal: a gateway to international archive collection websites

OCLC WorldCat (Manuscript materials): nearly 1.5 million catalogue records describing archival and manuscript collections and individual manuscripts in public, college and university, and special libraries located throughout North America and around the world.

Blogs:
Archiefforum.be: An online community which aims to support students and young archivists in their studies and profession by peer help and advise. (Flemish language)

ArchivesBlogs: a US blog which is a syndicated collection of blogs by and for archivists.

@rchivista: Spanish language blog written by Paco Fern

Forensics

FingerprintThis June’s Collections of the Month examines a partial print of the history of forensic science, shining a light on the descriptions for the papers of physicians, pharmacists, chemists and toxicologists involved in criminal investigations, and records relating to forgery and violent crimes.

Illustration copyright © 2009 The Archives Hub.

Archival Management Software

Archival Management Software: A Report for the Council on Library and Information Resources. Lisa Spiro, January 2009.

The Archives Hub is not in the business of archival management systems, but this report provides a useful perspective on what systems have to offer, and also the current state of cataloguing, albeit essentially in the US. Recommended reading. Here is a summary, highlighting some points of interest.

The report starts off on well-trodden ground about the number of hidden archives. As a partial remedy, it encourages providing access to materials through minimal steps (basic descriptions which may not be ‘perfect’), rather than providing detailed catalogues of a small percentage of holdings. At the same time it states that collection-level descriptions must be done well, otherwise they may not effectively represent the collection to users. The report refers to taking a stripped-down approach to cataloguing – quite a change from the norm for many archivists. This is an issue we have been thinking about at the Hub, and we have taken the decision to reduce the number of mandatory fields that we require of our contributors. A difficult decision, but we felt that we needed to fit in with the ethos that a minimal description is better than no description, and we should be conscious of the difficulties archives often have in providing comprehensive descriptions with only minimal resources.

As an interesting adjunct to the debate about the control archivists have over descriptions (the requirements for expertise), the report cites a project where students are paired with unprocessed collections in their area of interest and trained to catalogue them, resulting in access for users and research topics for the students. Presumably this work is overseen by archivists, but it is still a departure from the idea that cataloguing requires an ‘expert’.

The important point is to provide electronic access as ‘increasingly, materials that are electronically inaccessible are simply not used’ (quoting Jones, Hidden Collections, Scholarly Barriers, 2003). I was heartened to read that the Library of Congress Working Group on

Steve Cohen

This month we are highlighting Steve Cohens collection which he deposited at the Ahmed Iqbal Ullah Race Relations Resource Centre in 2001.

Steve Cohen was a lawyer and anti deportation campaigner who dedicated his life to anti-racism and anti-Semitism, particularly the welfare of immigrants and refugees, and those seeking the right to remain in the UK.

There are links to related materials, mainly in the areas of race relations and immigration.

For more information on the Steve Cohen collection, please contact the Ahmed Iqbal Ullah Race Relations Resource Centre on 0161 275 2920. We’d like to thank Julie Devonald of the Ahmed Iqbal Ullah Race Relations Resource Centre, who wrote the text for this month’s description.

Image of a demo rally poster provided by and copyright the Ahmed Iqbal Ullah Race Relations Resource Centre.

Tuberculosis

Illuminated tram: 'X-RAY NOW. EASY, CONFIDENTIAL, NO UNDRESSINNG'

This month we’ve been focussing on the epidemic disease tuberculosis or T.B., once known as ‘consumption’. We highlight the records of hospitals and sanatoriums, the papers of medical professionals, campaigners and organisations, and victims of the disease; illustrated with a range of photographs from Lothian Health Services Archive, The Women’s Library, the People’s History Museum, and the Royal College of Physicians and Surgeons of Glasgow.

Photo: Illuminated tram-car advertising, 1957, part of Glasgow Corporation’s tuberculosis campaign. Image Wellcome Library, London and copyright © Wellcome Trust, reproduced under Creative Commons licence.

Democratising context?


As I reported in a previous blog, the Archives 2.0 conference threw up and tossed about a whole host of issues. Geoffrey Yeo from University College London talked about archival description and how this might need to change, and he made me start to think about what we mean when we talk about the context of an archive collection.

As an archive student I was taught about the vital importance of the archival context. This is seen as providing important evidence for users of the archive, enabling them to place the materials within the context of their creation. The context gives the material meaning. We generally catalogue from the collection level down to the item level, and we tend to impose this route on our users – our websites often compel them to go to the collection and drill down to find specific items.

The Archives Hub generally takes this approach: an initial search results in a hit list of collection level descriptions. Advanced searches by default include both collection and lower level descriptions. We are very aware of the advantages of taking users to individual item descriptions, especially now that we are planning to add images and links to content. It would be great to add images to individual descriptions. One of the challenges is to present the user with collection and item-level descriptions in such as way that they understand the principle of the archival description – from the general down to the specific.

At the Archives 2.0 Conference, Jon Newman talked about the MLA London Revisiting Archive Collections project. Having struggled to find anything useful about the project on the MLA London Website, I’ll just refer to a previous Hub Blog post to describe it: “Focus groups of diverse groups of people, generally unfamiliar with archives, were set up in three different London institutions. They were asked to look at and provide feedback on specially selected archives that were chosen because they might resonate with the groups, having relevance to their lives and experiences. For example, a Tanzanian women’s group was commenting on photographs and manuscripts relating to Tanzania and a group of cleaners and security staff, many of west African origin, were looking at Somalian and Nigerian material.”

Jon gave some examples of how participants gave different contexts to images by providing additional information about them. For example, a participant commented on a photograph of two women from a Nigerian tribe. She was originally from a neighbouring tribe and remembered details about clothing and how the tribes had a tradition of gently mocking eachother.

This project essentially broke away from the archival context to create other contexts for the archives. It showed how they can have different meanings to different people, depending upon their perspective, and gave the archives new contexts that other researchers could benefit from.

My feeling is that the archival profession is moving from a situation in which we very much saw archival context as THE context to a position where we are starting to appreciate and encourage other contexts. I wonder whether we will start to accept that all contexts are, or can be seen as, equally important, or are some more important than others?

I am sure that we don’t want to neglect the archival context because once gone, it is almost impossible to recover, and valuable evidence that can aid interpretation is lost. But maybe we should be less inclined to make the archival context primary and actually think in terms of flexible access to archives through descriptions that give equal weight to the individual item, the various contexts within which that item might be seen, and the evidential value of the item as part of a whole collection?

Whilst thinking about this whole issue, I couldn’t help but reflect that there is an increasing tendency to display isolated ‘treasures’ on the Web, and actually neglect context altogether. Many websites, it seems to me, get funding to create an attractive interface to display images, but give little attention to metadata, connections, contexts and sustainability.

So, are we moving towards a myriad of contexts, or are we in danger of losing context altogether?

Image of salt: From Flickr courtesy of kevindooley’s photostream

Archives 2.0 Conference report

Archives 2.0: Shifting Dialogues between Users and Archivists was the culmination of a programme of events held by the Centre for Research on Socio-Cultural Change, based at the University of Manchester. The Archives Hub were very happy to be co-organisers and I certainly got a good deal out of the four seminars that I attended and this two-day conference that drew together archivists, academics and other information professionals.

The first session was called ‘Whither Archives 2.0’ (named in honour of ye olde archivists I feel!). Well, I’m not entirely sure that we could answer the ambitious question of where archives 2.0 may be going, other than in the general sense that social networks and user engagement in a broad sense is only going to gather momentum.

I think that presentations on difficult subjects often have a tendency to provide a list of challenges and issues, without necessarily providing much else. There was a danger that we would all talk about the problems and challenges, which are of course important to think about, but in fact there was a good mixture of setting out the landscape, considering the broader philosophical implications and thinking about the issues as well as presenting practical projects that have really borne fruit.

In my talk (slides available on Slideshare) I referred to Kate Theimer’s Archives 2.0 manifesto that she published on her ArchivesNext Blog a while back. There were no radical dissenters from this idea of a more open, participatory and collaborative approach in principle, but I certainly felt that there were differing levels of acceptance. There were certainly assertions that professionalism and the rigour of standards are still appropriate and necessary, and so maybe the balance is difficult to achieve. There were also some references to control – the need for the professional to have a certain level of control over the archive and over the metadata – a fascinating area of debate. Interestingly, we didn’t spend much time defining what we meant by ‘Archives 2.0’ (I think that I was the only one who did this to any extent). In principle I think this is a good thing, because it’s too easy to get bogged down with definitions, but maybe there were differences between those who would define it in the broader sense of an open and collaborative mindset and those who were more focused on the current popular tools that are on offer – Flickr, Twitter, YouTube, etc.

Michael Kennedy, presenting on Documents on Irish Foreign Policy was particularly resolute that for diplomatic archives such as those he has responsible for, integrity is uppermost. He was cautious of adopting an Archives 2.0 approach that might allow users to interfere with the text. He seemed to feel that this meant that he was to some extent rejecting an Archives 2.0 approach, but we don’t want to end up taking a draconian approach to what Web 2.0/Archives 2.0 means for archives and archival finding aids – we don’t have to let users add to the text just in order to tick the right box.

One thing that struck me about some of the projects that were presented was that they seemed very self-contained and very much to operate within their own defined space. It reminds me of the ‘walled garden’ analogy that Ewan McIntosh talked about at the JISC Conference this week. We are still tending to build our own environment in our own space and asking people to come to it – to come to a destination that we prescribe for them. Ewan talked about VLEs and how students are forced to go to them for course materials, but usually dash in and out and then go back to more comfortable and happening environments. To me, Archives 2.0 is partly about thinking out of the box – maybe thinking beyond the confines of a project website and considering dissemination more broadly. Its hard though, because I think it brings us back to that thorny issue of control, or lack of it. It means considering dropping traditional practices and ways of doing things that we are comfortable and familiar with. It means venturing into other spaces and in these other spaces we aren’t necessarily in control. But this can bring great rewards. I think that this is amply demonstrated by ‘Revisiting Archive Collections’ – an MLA project that Jon Newman spoke about and that I have referred to in a previous Hub blog. I will come back to this in another blog post, because I thought it threw up some interesting notions of context which will make this post just too long!

Derek Law, from the University of Strathclyde, talked about re-framing the purpose of the library. He wasn’t necessarily stating anything we haven’t already heard, but he did effectively drum home the message that libraries (and archives??) are simply not meeting the current challenges that the online world is throwing up. It reminded me of a recent Horizon programme on the BBC about how people react to disasters. Whilst the threat to libraries may not be quite of that magnitude, Derek did paint a picture of librarians staying stubbornly rooted to the spot in the face of rapid changes going on around them that are going to change the very nature of librarianship and what a library is…if libraries exist at all in 10-20 years time. Whilst Derek was very convincing, I can’t help reflecting that there is another more optimistic side to this. In the UK we apparently publish more books than in any other country (sorry, can’t find the source for this, but I’m sure I heard it on good authority!). So, whilst the environment is changing and libraries do have to adapt, the ‘paper free’ world that has been predicted is not looking very likely to happen in our lifetimes.

Brian Kelly from UKOLN talked to us about the risks associated with implementing Web 2.0 type features (talk on Slideshare), and emphasised that there are risks in everything and sometimes it’s worth taking a certain level of risk in order to gain a certain level of benefit. We need those who are prepared to be early implementers and early adopters, but if we take a measured approach we can avoid the all to familiar trough of despair that often follows excessive levels of expectation. Brian referred to a framework that could be used to consider and manage risk. This does seem like a sensible approach, although I guess that we started the Archives Hub blog, created Netvibes and iGoogle widgets and started Twittering without really analysing the purpose, benefits, risks and costs in any great detail. Maybe we should’ve done this, but then I like to think that we have an admirable sense of adventure, a sense of the missed opportunities that too much naval-gazing can bring about and also a general appreciation that if something takes relatively little time to do or to set up then it might be worth taking the plunge and seeing how it goes. I echo Brian’s reference to the wonderful comic strip by Michael Edison – well worth watching.

Archives 2.0 — If We Build It, Will They Come?

Tomorrow, Jane Stevenson and I will be presenting at “Archives 2.0: Shifting Dialogues between Archivists and Users” (along with Hub alumnus, Amanda Hill). The title of my talk is ‘Archives 2.0: If We Build it, Will They Come?” Yes (sigh) it’s a reference to the film, Field of Dreams where Kevin Costner builds a baseball field as a pure act of faith that They Will Come (including his dead dad) once he’s done.

It’s one of those titles that you think was a good idea at the time when you submit a proposal, and then comes back to haunt you when it comes to pulling everything together. That said, there was method in my madness, and one the main issues I plan to talk about tomorrow is that of user participation. In other words, we might well be able to build the framework for Archives 2.0, but this does not mean that users will participate.

At this present and very early moment in the history of Archives 2.0, are we in danger of being technologically deterministic? In other words, are we so beguiled by what is possible in this heady time of 2.0, where the Machine is Us/Using Us, that we place more emphasis on the technology than on the precise contexts in which we deploy that technology? While we might believe in the ‘Wisdom of Crowds,’ that wisdom is not necessarily translating into archives (or even necessarily library) 2.0. Why?

Here’s an overview of what I plan to discuss:

  • Archives 2.0 as ‘Postmodernity meets ‘Traditional’ Archival Science
  • The problem of technological determinism (i.e. the story we tell ourselves where technology drives change)
  • The problem with the ‘Wisdom of Crowds’ approach to Archives 2.0
  • The promise of a ‘Community of Practice’ approach to Archives 2.0

I’ll have my own slides on slideshare before the end of the week, along with Jane’s. But for now, I invite your comment.