Posts Tagged ‘Horizon’

Library Impact Data Project: good news, everybody!

Posted on June 18th, 2011 by Paul Stainthorp

I think this is worth re-posting from the LIDP blog:

LIDP graphicWe are very pleased to report that we have now received all of the data from our partner organisations and have processed all but two already!

Early results are looking positive and our next step is to report back with a brief analysis to each institution. We are planning to give them our data and a general set of data so that they can compare and contrast. There have been some issues with the data, some of which has been described in previous blogs, however, we are confident we have enough to prove the hypothesis one way or another!

In our final project meeting in July we hope to make a decision on what form the data will take when released under an Open Data Commons Licence. If all the partners agree, we will release the data individually; otherwise we will release the general set for other to analyse further.

I submitted Lincoln’s data on 13 June. It consists of fully anonymised entries for 4,268 students who graduated from the University of Lincoln with a named award, at all levels of study, at the end of the academic year 2009/10 – along with a selection of their library activity over three* years (2007/08, 2008/09, 2009/10).

The library activity data represents:

  1. The number of library items (book loans etc.) issued to each student in each of the three years; taken from the circ_tran (“circulation transactions”, presumably) table within our SirsiDynix Horizon Library Management System (LMS). We also needed a copy of Horizon’s borrower table to associate each transaction with an identifiable student.
  2. The number of times each student visited our main GCW University Library, using their student ID card to pass through the Library’s access control gates in each of the three* years; taken directly from our ‘Sentry’ access control/turnstile system. These data apply only to the main GCW University Library: there is no access control at the University of Lincoln’s other four campus libraries, so many students have ’0′ for these data. Thanks are due to my colleague Dave Masterson from the Hull Campus Library, who came in early one day, well before any students arrived, in order to break in to the Sentry system and extract this data!
  3. The number of times each student was authenticated against an electronic resource via AthensDA; taken from our Portal server access logs. Although by no means all of our e-resources go via Athens, we’re relying on it as a sort of proxy for e-resource usage more generally. Thanks to Tim Simmonds of the Online Services Team (ICT) for recovering these logs from the UL data archive.

I had also hoped to provide numbers of PC/network logins for the same students for the same three years (as Huddersfield themselves have done), but this proved impossible. We do have network login data from 2007-, but while we can associate logins with PCs in the Library for our current PCs, we can’t say with any confidence whether a login to the network in 2007-2010 occurred within the Library or elsewhere: PCs have just been moved around too much in the last four years.

Student data itself—including the ‘primary key’ of the student account ID—was kindly supplied by our Registry department from the University’s QLS student records management system.

Once we’d gathered all these various datasets together, I prevailed upon Alex Bilbie to collate them into one huge .csv file: this he did by knocking up a quick SQL database on his laptop (he’s that kind of developer), rather than the laborious Excel-heavy approach using nested COUNTIF statements which would have been my solution. (I did have a go at this method—it clearly worked well for at least one of the other LIDP partners—but it my PC nearly melted under the strain.)

The final .csv data has gone to Huddersfield for analysis and a copy is lodged in our Repository for safe keeping. Once the agreement has been made to release the LIDP data under an open licence, I’ll make the Repository copy publicly accessible.

*N.B. In the end, there was no visitor data for the year 2007/08: the access control / visitor data for that year was missing for almost all students. This may correspond to a re-issuing of library access cards for all users around that time, or the data may be missing for some other reason.

Library catalogue: Site Search analytics

Posted on March 17th, 2011 by Paul Stainthorp

A while ago (and, as with all things Horizon, with the help of Dave Pattern at Huddersfield), we enabled Google Analytics on our library OPAC (sometimes referred to as HiP, the “Horizon Information Portal“). This takes the form of a piece of Google JavaScript which lives in a ‘footer’ document common to all HiP pages.

Chris Leach gave a presentation about using Google Analytics with HiP at the last SirsiDynix Horizon User Group.

Now Nick Jackson has shown me how to enable Google’s Site Search features on our Analytics profile for the library catalogue. Site Search will allow us to ‘tease out’ the search activity within the library catalogue itself, by analysing the URL structure of HiP queries, recognising and extracting the search terms, then tracking the paths users take from those search queries to destination pages (i.e., individual bibliographic record pages on HiP).

For instance: a typical HiP search query ends up looking something like this:

http://www.library.lincoln.ac.uk/ipac20/ipac.jsp?session=1G00362045UH4.101795&menu=search&aspect=subtab13&npp=10&ipp=20&spp=20&profile=ln&ri=&index=.GW&term=journalism&x=0&y=0&aspect=subtab13

By telling Google Site Search to look in the query parameter “term” for the search keyword(s)—in this case journalism—and to ignore the “session” parameter, Google Analytics can start to group similar queries together and provide us with data about what our users are searching the catalogue for.

Screenshot of the setup page for Google Site Search

It’s been running for less than 24 hours, but already we’re starting to see build up a record of the keywords people are typing into the catalogue:

Screenshot of Google Site Search top search terms on HiP

What could we [and what should we] do with this data? Are there any Google Site Search experts out there who could give me a few tips? If anyone from within the Library at the University of Lincoln would be interested in helping to analyse the search term data, please let me or Chris know.

One thing we’ve already discussed is the idea of using the HiP search term activity as test data to ‘teach’ the Jerome machine intelligence engine about the kind of things Lincoln library users are interested in… this will help us in determining how the Jerome API’s personalisation features might be used to present and relevance-rank results.

Total ReCal magazine article

Posted on November 10th, 2010 by Paul Stainthorp

This article appeared in the last University of Lincoln staff magazine (September 2010, issue 5, p.5). I’m representing the Library on the group managing this JISC project.

Total ReCal will help students to keep track

A new Joint Information Systems Committee (JISC) funded research project has been launched to merge calendars around the University.

Total ReCal, led by the staff from the Centre for Educational Research and Development, the Library, and ICT, will improve the student experience by collating data from systems across the support departments.

Joss Winn, Technology Officer, said: “All student calendars will comprise data from timetabling, Blackboard and the Libraryʼs Horizon system. One of the big motivations behind the project is that students have no easy way of finding out hand-in deadlines for assignments, being informed if the deadlines change, and seeing the deadlines marked on a calendar alongside academic timetables. This service will provide the solution.”

The combined data can also be used in a host of applications such as Google Calendars and Facebook. The software will be offered to the JISC community for use by other institutions when the project has been completed.

For further information visit http://totalrecal.blogs.lincoln.ac.uk

Anonymised library book circulation data for the academic year 2008/2009: collected for the JISC MOSAIC project

Posted on August 17th, 2010 by Paul Stainthorp

mosaic.2008.level1.1265378452.0000001.xml

The University of Lincoln collected one academic year’s worth of its own library book circulation data (“user activity data”) for the JISC-funded MOSAIC project, which set out to investigate the technical feasibility, service value and issues around exploiting user activity data. Data was collected for the period 1 September 2008 – 31 August 2009. Lincoln’s data was processed according to a data schema common to all participants in the MOSAIC project; any data that might be used to identify an individual library user was removed or anonymised.

View this item on the University Repository: http://eprints.lincoln.ac.uk/2164/