Posts Tagged ‘JISC’

It’s the end of Jerome as we know it (but I feel fine)

Posted on November 28th, 2011 by Paul Stainthorp

The University of Lincoln’s Jerome project finished in August with the successful release of more than 240,000 openly-licensed bibliographic records, available over developer APIs, and a joint hack day with Cambridge University Library‘s COMET project.

Now, encouraged by positive JISC feedback, both institutions—Cambridge and Lincoln jointly—have applied for follow-up project funding under the project title CLOCK. If our bid is successful, the new project will run between December 2011–July 2012, employing a web developer based at the University of Lincoln, and distilling the work of both institutions into the development of new innovative library metadata discovery services for the scholarly community.

You can read the project proposal for CLOCK at http://lncn.eu/ijt4 – the introductory section is below.

The University of Lincoln and Cambridge University Library both delivered successful projects (Jerome and COMET) for the JISC Infrastructure for Resource Discovery Programme in 2011. This is a proposal for the continuation of and elaboration upon the work of both projects, via a programme of development work shared between the two institutions.

Throughout both projects (COMET-Jerome), parallel approaches in technology and data structure were noted and commented upon. A ‘mash day’ workshop event held in Cambridge in August aimed to explore these differences as well as areas of potential synergy. Here project members identified several points of interest to take forward.

Both projects produced outputs of interest to researchers, students, librarians, developers, and designers of bibliographic discovery environments. The CLOCK project will harness the success of these two complementary initiatives and investigate new approaches to data creation and discovery in the library domain. In particular, it will investigate, propose, and develop new, web-based bibliographic tools/APIs which will make it easier for developers, academic libraries and library end-users (esp. researchers) to find Open Bibliographic Data and incorporate that data into systems and workflows.

This project is an opportunity to [1] exploit through real-world applications the significant amount of data released openly by Cambridge University Library; [2] apply the Jerome database architecture, iterative development methodology, and API framework to a bibliographic dataset an order of magnitude greater than the University of Lincoln’s; and [3] to build and enable a new set of tools and demonstrator services which will enable the future development of public Open Bib Data web applications of practical utility to libraries and end-users.

The project will be supported by library consultant Owen Stephens, who will help to put the work into a national context, relating CLOCK to the wider movement toward Open Bib Data and the work of the JISC Discovery initiative. It will take place in an environment (Lincoln/Cambridge) where a culture of developer inquiry and experimentation is encouraged and nurtured. It is also endorsed by senior library management at both universities.

Both universities are involved in complementary development work which will  both inform and be informed by CLOCK: at Cambridge, Ed Chamberlain is guiding the development of the JISC Open Bibliography 2 project; in Lincoln, Paul Stainthorp is lead researcher on the #jiscmrd Orbital project, which is investigating the management of research data, with some areas of overlap.

CLOCK will operate as part of the wider JISC Digital Infrastructure: Information and library infrastructure: Resource discovery, and support the recent concerted effort to move toward openly licensed library discovery in UK Higher Education and beyond.

OAPEN-UK focus group at the British Library

Posted on November 21st, 2011 by Paul Stainthorp

British Library staff & contractors' entrance

Today I was at the British Library (allowed in via the staff entrance, no less) for a librarians’–repository managers’ focus group of the JISC/AHRC-funded OAPEN-UK project, which will run to 2015 and which aims to gather “evidence to help stakeholders make informed decisions on the future of open access scholarly monograph publishing in the humanities and social sciences”.

N.B. There doesn’t seem to be a nice, standard abbreviation for ‘open access scholarly monograph publishing’, so to avoid endlessly repeating the phrase I’ll refer to them as ‘OA e-books’ from now on. Today’s focus group was made up of academic library people (from cataloguing, e-resource management, and subject liaison roles) along with HEI repository managers.

OAPEN-UK is an extension of the original Open Access Publishing in European Networks (OAPEN) project which looked at the role of OA scholarly monograph publishing and its potential effect on researcher attitudes, behaviours, business/publishing models – mainly in the Netherlands. Five publishers (a mixture of ‘pure’ commercial and university publishing houses) are on board the OAPEN-UK steering group; between them they have contributed 60 book titles which will form a pilot data study: divided into 30 matching ‘pairs’ of titles (each pair sharing common characteristics), one book in each pair will form the control group (licensed for sale as usual), the other in each pair will be:

“…made available on the OAPEN Library in open access under a creative commons licence. In addition, the titles may be placed / discoverable via the publisher’s own website, institutional repositories, authors own website and will be 100% available in Google Book Search. MARC records will be made available to libraries”

Quantitative and qualitative data—sales, usage, citations, reuse, plagiarism—will be gathered on both groups of 30 (control/experimental), and combined with information from focus groups (including this one!) and user surveys to inform recommendations for future directions in OA e-book provision: aimed at publishers, universities, libraries, and researcher-authors and researcher-readers.

OAPEN-UK header image

The bulk of today’s focus group was taken up with an exercise to identify some of the issues of interest to libraries and repository managers in an OA e-book-’enabled’ world. The 12 attendees divided into four groups of three and brainstormed using post-it notes (pink: ‘big issues’; blue: opportunities, yellow: questions) on charts divided into four areas for consideration: technical, financial, attitudinal, and administrative. We were then each asked to ‘vote’ on the issues we felt were most important/worthy of discussion, using little red stickers.

I took photos of the four charts:

OAPEN-UK focus group 4 OAPEN-UK focus group 3

OAPEN-UK focus group 2 OAPEN-UK focus group 1

Here’s a list of just a few of the interesting discussions that came out of the exercise:

  • What will be the attitude of subject specialists – if selection isn’t tied up with a financial burden to the university library, will they feel they have lost control of the selection process? Libraries will expect good, accurate, and correctable metadata and selection tools… or will we see a national, shared OA e-books ‘firehose’ feed with little or no selection at the institutional level?
  • How will the vendors of e-book aggregation services and platforms react? And what will be the effect of their reaction on libraries who subscribe to their services? Will we see a model where publishers/aggregators charge for ‘added value’ to a basic OA offering?
  • Does ‘Open Access’ equate to ‘access in perpetuity’? Whose responsibility will it be to ensure continued access? Will we need a LOCKSS/UK Research Reserve-type approach to looking after OA e-books? What should be the role of the JISC/legal deposit libraries/other national bodies in this (to set standards and accredit/certificate universities, perhaps)?
  • Who pays in a future OA e-book ecosystem? We’re not on familiar gold/green journal OA territory. What about author royalties – how will they be collected? Will they suffer, and how? Are libraries being pushed into a new ‘big deal’, this time for e-books (and can OA help)?

Flying Scotsman in sunlight at KGXAt this point, unfortunately—and typically—I had to dash for my train. But I’ll be following the OAPEN-UK project with great interest; it’s one I hope to come back to in future.

Some links:

Electronic Resources Librarian: priorities 2011/2012

Posted on November 17th, 2011 by Paul Stainthorp

I’ve had a useful meeting with my new boss to agree my priorities for the next 12 months of development work in the Library. Here are my top 4, in order of importance.

  1. Discovery selection & implementation;
  2. JISC Orbital project (0.3FTE) – based mainly in CERD until March 2013;
  3. Possible JISC-funded Jerome follow-on work;
  4. Development of the Lincoln Repository – working closely with the Library Institutional Repository Officer (BJ), the Research & Enterprise Office + the subject librarians on the following areas:
    • Metadata workflow and service development
    • Advocacy/training
    • Building a “Research Showcase”
    • CRIS-like development, bibliometrics, and supporting the REF
    • Developing staff profiles on the University’s website
    • E-theses
    • Helpdesk integration (…possibly)

The following are projects—part of the current Library I.T. strategy—that I’ll contribute to but probably won’t lead, and/or work that’s going on in the background that I need to stay abreast of:

  1. Reading list development (project);
  2. Authentication (project);
  3. Participation in various JISC working groups as well as UKCoRR and LISN;
  4. Working with the Acquisitions team on new team rôles/areas of work;
  5. Monitoring and guiding e-resource management (ERM), authentication, and responding to user problems (this area of work will be looked after day-to-day by the Library (E-resources) Assistant (EV), supported by other staff, as part of the cover for my JISC project work);
  6. Supporting the subject librarian for technology in a review of the Library’s presence on the University Portal;
  7. Supporting the subject librarians in promoting and supporting the use of RefWorks 2.0;
  8. Supporting the HELS in administering copyright/digitisation services and the use of Blackboard.
  9. Initiating a new CALM user group.
  10. Co-ordinating LIG (the Library Innovation Group).
  11. Participating in the work of LNCD.

G’won then: what have I forgotten about?

Rough notes from a JISC emerging bibliographic tools workshop, 5th October 2011

Posted on October 12th, 2011 by Paul Stainthorp

I was at Goodenough College in London last Wednesday, 5th October 2011, for a workshop organised under the JISC Discovery programme (discovery.ac.uk), to discuss approaches to publishing, managing, and using Open Bibliographic data (OBD) on the web. Here are some of the notes that I made on the day. I’ve left them rather rough because I don’t have time to bully them into proper paragraphs.

The workshop started with a general overview and discussion of the current picture of OBD.

  • We’re dealing with a growing number of technologies for open library discovery: Linked Data, BibJSON, OPDS (based on Atom), Lincoln’s NoSQL/API-centric approach, even SuperMARC(!?).
  • Few if any people have a good handle on all of these approaches, but we ought to be at least conversant with them.
  • We’re a room full of experimenters! But how can we communicate Discovery/OBD to others? How can JISC funding be used to support the work? We need to surface not only tools and data but also skills.
  • Possibility of looking to e.g. DevCSI/Netskills to help with addressing the skills gap. Are CompSci graduates being encouraged to exercise their skills in open/community development?

We then split into two groups to brainstorm “what’s interesting in bibliographic data at the moment?”: the two groups managed to fill around 8 flipchart sheets :-)

Photo of a flipchart covered in writing

A few quotes and themes I picked up on:

  • What will be the value of OA repositories in hindsight? Will it be open data (some are skeptical) or rather will it be their effect on the publishing industry?
  • A really useful application would be a fits-all API to identify possible identifiers within a record/page – ”I think this is an identifier, please tell me what sort it is” – which then leads into a web service to aggregate information about the thing itself (rights information, etc.) – jokingly called “Rate my Regex”! – some interest in this as a project.
  • Paul Walk: “Please an we have a day off from Linked Data!?
  • Idea of the role of “data doctor/data wrangler” gaining some currency in institutions.
  • There are plenty of code libs for dealing with bibliographic data: pymarc, MARC4JMARC::Record (perl). solrmarc.
  • Owen Stephens: “MARCXML is the worst of MARC combined with the worst of XML. It’s rubbish.
  • A colleague of Peter Murray-Rust (sorry, I didn’t catch your name!). Citable data is not copyrightable. Java library containing ~20,000,000 open article records???
  • Mark MacGillivray[?]: “To most people, this [taps laptop] is just a plastic box full of magic.

After lunch we split again, this time into three groups, each to consider a different aspect of managing Open Bibliographic Data; each to consider opportunities, costs, pitfalls, etc. relating to the technologies themselves as well as to the skills needed in exploiting those technologies:

  1. Transforming data
  2. Munging data (both groups 1. and 2. agreed that the two steps are really the same thing – just “more transformation” – also that ‘munging’ is an awful word…)
  3. Exploitation of data

I was part of the ‘Munging data’ group.

Challenges

  • Problems in the move from a unitary system to distributed data services – loss of control (quality of 3rd-party data can be a problem for the librarian mindset!), worries over sustainability of mashup-style approaches (c.f. dbpedia, BBC RDF, the now-defunct Talis Silkworm project). However, openness itself provides some guarantee against things becoming defunct (i.e. Open Source Software)_.
  • Need to think about the capacity (and the uneven geographic distribution) of local skills
  • “Any data is better than no data”. Use of third-party open data is not really a challenge for management any more (only cataloguers care!)? But still important are notions of provenance, attribution, putting power back in the hands of the end user.
  • We need to think at the citation level – is there a big difference between personal and institutional data?
  • Character encoding!

Gaps

  • Skills. Not enough developers. Unevenly distributed geographically. (Can we construct a course/curriculum for open community development skills?).
  • #ukdiscovery is somewhat distant from the mundane concerns of libraries. Ed Chamberlain is speaking to a group of cataloguers in Oxford about OBD – that’s the sort of thing we want!
  • Thinking about the role of CILIP and ‘professionalism’ – keeping [technical] skills up to date. Portfolios/competency framework approaches. Can we get a push from the top of the library profession?
  • Technology gaps, on the other hand, have mostly gone away. There are enough interesting and easy things to keep us busy without having to worry too much about the things that still don’t work. JISC can help to convince (smaller?) institutions that open development should be trusted.

Opportunities

  • Still attempting to overcome legacy licensing issues. Instead of concentrating on dealing with old data, why don’t we just take a “line in the sand” approach and make sure we’re being 100% open from now on. Do the OBD principles need to be extended?
  • Make use of feedback loops. Learn something about your data by feeding how it’s been used back into the system. Use this usage to inform your transformations.

</end>

Over the moon: Lincoln awarded £240k JISC funding for Orbital MRD project

Posted on September 9th, 2011 by Paul Stainthorp

Satellite City

In July, I blogged about our latest bid for JISC funding – this time for an project called “Orbital” to develop a university research data management infrastructure, piloted with the School of Engineering.

I’m delighted to announce that we were successful. The University of Lincoln has been awarded £241,500* funding for Orbital, under JISC’s Managing Research Data call. The project runs for 18 months, starting on 03 October 2011.

From the project proposal:

“The Orbital project will develop, test and implement a state-of-the-art research data management system, which meets both internal and external partner organisation’s requirements in terms of robustness and security. We will apply a proven approach to the management of institutional data, through the proposed use of MongoDB (a very fast, flexible, schema-less database technology), to create flexible services for capturing, storing, preserving and sharing research data in real time across internal research groups and with external research partners via secure, public APIs. A personalised web interface for specific researcher profiles and a public discovery interface will also be developed.”

Joss Winn from CERD will be the Orbital project manager; I’ll act as “lead researcher”, working—alongside other staff from the Library, Research Office, ICT services and the School of Engineering—to conduct a literature review and examine existing guidance and practice, lead the user requirements analysis, and contribute to the implementation & evaluation of the project. We’ll also be appointing not one but two new developers to work on Orbital.

There’ll be a steering group consisting of senior staff from the VCO, School of Engineering, College of Science, Library, Research Office, and ICT. We’re also bringing in external consultancy from Siemens Industrial Turbomachinery Ltd, the Innovative Design & Manufacturing Research Centre of the University of Bath, and the UK Digital Curation Centre (DCC).

This is a hugely significant project for Lincoln (and the first funding awarded to a CERD/Library/ICT project since we established LNCD). What we’re doing here – it works. To my colleagues, and especially Joss: well done. Congratulations.

We’ll be setting up a project blog for Orbital, at http://orbital.blogs.lincoln.ac.uk/ – watch that space.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
*Yup. Nearly a quarter of a million quid. No messing, eh?

Orbital: managing engineering research data (JISC bid)

Posted on July 28th, 2011 by Paul Stainthorp

Just a note to mention that I’m named (along with a few other library colleagues) on our latest project bid to JISC under the Managing Research Data Programme (02) 2011-13, for an 18 month project, called “Orbital“, on establishing a suite of systems for managing the University’s research data, and working with the School of Engineering – an extension to the Repository project and Jerome.

Here’s the bid document.

Joss Winn has blogged about it as usual. If we’re successful, Joss will act as project manager.  I’ll be “lead researcher”. Bev Jones (Institutional Repository Officer), Chris Leach (Systems Librarian), and Ian Snowley (University Librarian) are also named in the bid.

“Our proposed project is called Orbital because we’re intending to build services for managing research data that ‘orbit’ around Nucleus, the data store we built during the Total Recal and Jerome projects.

“Of course, we’ve set ourselves some new challenges with this project and much work needs to be done in all phases of the project, but having the experience of building web services around large institutional data sets, gives me the confidence that we can tackle what is a really important issue for us – for any university: managing a growing body of research data.”

What I been up to

Posted on July 7th, 2011 by Paul Stainthorp

Apologies: this is one of those generic catch-all blog posts. I attended four separate events last week: here’s a short report from each one.

~~~

Kimberlin1. CILIP UC&R Members’ Day: Making an Impact

De Montfort University, Leicester. 28 June, 2011

This workshop for CILIP members was looking at various ways in which libraries can have (and can measure) their ‘impact’. I spoke first about Lincoln’s involvement in the University of Huddersfield’s Library Impact Data Project (LIDP), and how that project is trying (successfully, it seems) to measure the relationship between students’ library use and their degree ‘success’.

Then DMU subject librarian Jason Eyre talked about his PITSTOP project, which built a mediated forum for online discussion between Social Work students on placement, their lecturers, and their practice educators (in the NHS and local authorities). Jason explained that while the online discussion forum itself was not very well used, the impact of the project was that is acted as a catalyst for building a better relationship between students, academics, practice educators, and the library.

After a very well-run World Café session, where we moved around between different tables, each themed with a different aspect of ‘impact’ in libraries – and then lunch, information management consultant David Streatfield presented on the difficulties of measuring and evaluating the impact that academic libraries can have. He outlined some of the different approaches that have been taken in the past, and how those approaches can be less than successful in an environment of government pressure to control public service provision.

Lastly, Maria Cotera, former president of the CILIP Career Development Group, told us several anecdotes about the ways she has seen library workers make an impact themselves, through their involvement in staff development, social, and extra-professional activities. In an exercise, all the delegates came up with an example of a shared pressure or circumstance in our home institutions that could be turned into an opportunity for staff development.

Thanks to Marie Nicholson and the UC&R East Midlands committee for inviting me to speak! Twitter hashtag: #UCREMimpact.

~~~

Great Central Icehouse2. EMALINK event on collection development

University of Lincoln. 29 June, 2011

This was another East Midlands event, and the first EMALINK event held in Lincoln since we joined that network. It was organised, jointly, by the University of Lincoln, our neighbours Bishop Grosseteste University College, and Nottingham Trent University (NTU). The theme was the lifecycle of collection management: from selection and acquisition, through analysis and review of collections, and finally disposal.

NTU kicked off with a look at their work to incorporate Talis Aspire into the DNA of their library: they’re building a set of resource selection and allocation processes that are strongly driven by the resource lists built by academics using Aspire. Lincoln responded with two short presentations about collection analysis: our project to compare the strengths and weaknesses (in size, breadth, and age) of the various subject collections in our physical bookstock with the relative sizes of the student body in different subject areas; and our work to determine value for money in ‘Big Deal’ database subscriptions. Finally, Susan Rodda from Bishop Grosseteste talked about the options for disposing of unwanted physical library stock, and how BG have managed, for several years, to weed their collection without sending any paper to landfill.

~~~

Goodenough library (detail)3. JISC Managing Research Data Programme (#jiscmrd) community briefing event

Goodenough College, London. 1 July, 2011

On Friday, I attended this briefing event for the current JISC research data funding call for proposals, on Joss Winn‘s behalf. The JISC programme manager ran through the requirements and expectations for the various strands of this current call. Kevin Ashley of the Digital Curation Centre also presented: about how the DCC can support and work with institutions who are running research data management projects. See hashtag: #jiscmrd for information about the programme.

~~~

OU Library4. JISC Innovations in Activity Data workshop

The Open University, Milton Keynes. 4 July, 2011

After a long, Sunday-afternoon train journey to Milton Keynes, I paid my first ever visit to the OU’s Walton Hall campus for another activity data-related event, this time organised and hosted by the team behind the JISC-funded RISE (“Recommendations Improve the Search Experience”) project.

The day began with three presentations from projects funded under the current JISC activity data strand:

  1. Joy Palmer of MIMAS and the SALT project (“Surfacing the Academic Long Tail”: MIMAS working with the John Rylands University Library of the University of Manchester);
  2. RISE themselves (Richard Nurse of the OU) talking about how they are using EZProxy log data to power a recommendation service (“…users who looked at this, also looked at these…“);
  3. Via video link, live from Huddersfield: Dave Pattern talking about LIDP.

Then, another World Café-type exercise (two in one week!). We moved about the room, scribbling on the tablecloths, making notes about: [a] what activity data universities have at their disposal; [b] what use we might put it to; and [c] what barriers are in our way.

In the afternoon: two more presentations. The OU’s Tony Hirst (a.k.a. @psychemedia), rattling and rambling through various techniques for visualising activity data. This is really valuable stuff… what I’m less clear about is: where’s the first rung of the dataviz ladder? How does a muggle start thinking about data visualisation? Tony says that many of the techniques he writes about are things he “didn’t know how to do a couple of hours before…“, but that doesn’t necessarily mean that the rest of us will find them as easy to pick up! Tony’s coming to Lincoln soon, so I’m going to try and talk to him about data visualisation a bit more then.

Last of all, David Kay (of SERO and the JISC activity data Synthesis Project: kind of an umbrella for all of these separate activity data initiatives) summed things up nicely: including an excellent slide listing the kinds of skills library workers are going to have to develop in order to do justice to activity data: including data visualisation, again! I’ll post that slide here, if and when I can find it.

There was a little bit of activity on Twitter for this workshop: look for the hashtag #iad11.

~~~

Anonymised library activity data for the academic years 2007/08, 2008/09 and 2009/10: collected for the JISC Library Impact Data Project

Posted on June 13th, 2011 by Paul Stainthorp

These data consist of entries for 4,268 anonymised students who graduated from the University of Lincoln with a named award at the end of the academic year 2009/10, along with a selection of their library activity over three years (2007/08, 2008/09, 2009/10): library item circulation, visits to the main GCW University Library, and e-resources usage represented by authentication against AthensDA.

View this item on the University Repository: http://eprints.lincoln.ac.uk/4540/

In the background at Discovery event

Posted on May 26th, 2011 by Paul Stainthorp

A few of the Jerome project team are at the JISC/RLUK event in London: ‘Discovery – building a UK metadata ecology‘. Our slides are running on a screen in the foyer; I’ll be hanging around to talk about them.