Posts Tagged ‘Lincoln Repository’

Throw down the SWORD

Posted on May 7th, 2013 by Paul Stainthorp

With the Orbital project at its end, and plans for a University research information / research data service afoot, I’m reviewing the excellent work carried out by our (now-departed) developers Harry Newton and Nick Jackson – work which linked up CKAN, the Orbital ’bridge’ application, and the Lincoln Repository (EPrints) using SWORD – described in earlier blog posts here and here.

“One important piece of work that we’re undertaking at the moment in Orbital is the facility to deposit the existence of a dataset, from CKAN and the University’s new Awards Management System (AMS), into our (EPrints) Repository via SWORD – at the same time requesting a DOI for the dataset via theDataCite API. The software at the centre of this operation is what we refer to as Orbital Bridge.”

This deposit workflow is now broadly working as it should – I think only a few tweaks would be necessary now to turn this into a working tool for the University of Lincoln.

Most urgent is the need for the University to sign up with the DataCite DOI service, which would secure a DOI for each dataset record deposited from CKAN and hence formally published by the University. This subscription should form part of the new research information service.

The underlying code could be used for other SWORD-enabled deposit from sources of metadata (e.g. the Library’s discovery system, Find it at Lincoln), to the Lincoln Repository as the University’s bibliographic ‘system of record’.

Warning: this is an extremely screenshot-heavy blog post! Click on any one of the screenshots below to view a larger image.

Here’s a step-by-step walkthrough of the entire process of adding a dataset to CKAN, and depositing it as a record in the Lincoln Repository.

  1. Go to the Researcher Dashboard at: https://orbital.lincoln.ac.uk/ and click on “Sign In”.
    Screenshot from the Researcher Dashboard
  2. Enter your staff accountID and password to sign in to the Researcher Dashboard.
    Screenshot from the Researcher Dashboard
  3. Once you have been signed in and returned to the Researcher Dashboard, click on your name (in the top right-hand corner) and then click on “My Projects”.
    Screenshot from the Researcher Dashboard
  4. You will see an overview of your research projects – both funded projects (derived from the AMS), and unfunded projects you have added locally. Click on the name of the project you want to add data to.
    Screenshot from the Researcher Dashboard
  5. You will be taken to a page for that research project. On the right-hand side of this page, under the heading “Options”, click on “Create Research Data Environment”.
    Screenshot from the Researcher DashboardImage7
  6. You will be taken to the University’s CKAN research data platform, where a page/group will have been created which corresponds to your project in the Researcher Dashboard. Sign in to CKAN using your staff accountID (there is currently no single sign-on between the Researcher Dashboard and CKAN) and password and you should be returned to the same page. However you will probably be sent instead to the CKAN home page, in which case you will have to look again for your project under the “Groups” menu.
    Screenshot from CKAN
  7. Toward the top of the project screen in CKAN, click on “Add Dataset” > “New Dataset…”.
    Screenshot from CKAN
  8. Fill in the form with information about the overall dataset, including the following fields:
    • Title
    • URL
    • License (N.B. US spelling!)
    • Description
      Screenshot from CKAN
  9. Then click on “Add Dataset”
    Screenshot from CKAN
  10. If you now click on “Further information” tab on the left-hand menu, you can add the following additional information about the dataset (this is not obvious from the initial dataset form):
    • Author
    • Author email
    • Maintainer
    • Maintainer email
    • Version
    • Summary [of changes]
      Screenshot from CKAN
  11. To attach individual data document(s)—which CKAN refers to as “resources”—to the dataset, scroll down the page and click on “Upload a file” (there are other options) > “Choose file” > “Upload”.
    Screenshot from CKAN
  12. Then fill in the form with the following basic information about the “resource”:
    • Name
    • Description
    • Format
    • Resource Type
    • Datastore enabled (ticked by default)
    • Mimetype
    • Mimetype (Inner)
    • “Extra Fields” (user-defined, or used by Orbital)
      Screenshot from CKAN
  13. To deposit a record for this dataset in the Lincoln Repository, go back to the Orbital Researcher Dashboard at: https://orbital.lincoln.ac.uk/ and navigate to your project. Toward the bottom left of the page you should now see a table containing the dataset(s) you have created in CKAN for this project. Choose which dataset you want to deposit, and hit the “Publish to Lincoln Repository” button.
    Screenshot from the Researcher Dashboard
  14. The Researcher Dashboard will then display a deposit form containing the following fields (some of which should be being autopopulated from CKAN fields but which do not appear to be):
    • Title
    • Description
    • Type of Data
    • Keywords
    • Subjects
    • Divisions
    • Metadata visibility [Show|Hide]
    • People
      Screenshot from the Researcher Dashboard
      “Publishing will publicly announce the existence of your dataset on the Lincoln Repository, as well as start the process of long-term preservation of your data.“Usually you should only publish a dataset either at the end of a research project, or if the data is being cited in a paper. Publishing a dataset will place some restrictions on the changes you can make to the dataset in the future, such as removing your ability to delete the data. It will also generate a DOI, which allows your dataset to be uniquely identified and located using a simple identifier.“Please check the information in this form and make any necessary changes, as this is the information which will be entered into the published record of the dataset.“If you have any questions about this process please contact a member of the research services team for advice or assistance.”
  15. When you hit the “Publish Dataset” button, the dataset record from CKAN will be used to create a record in the Lincoln Repository. The record will be submitted for review by the Repository team, who will then make it live. N.B. for the time being, you will see an error “Validation errors: [doi] is a required string“ – this happens because the University does not currently have access to the live DataCite DOI service, which would secure a DOI for each dataset record deposited from CKAN. This should form part of the new research information service.
    Screenshot from the Researcher Dashboard
  16. Here’s an example of a record in the Lincoln Repository, created from a CKAN dataset and made live by the Repository team.
    Screenshot from the Lincoln Repository

Problems with the deposit process as it currently stands:

  1. Permissions are not correctly cascaded from a project the Researcher Dashboard to a group in CKAN.
  2. There is currently no single sign-on between the Researcher Dashboard and CKAN.
  3. When CKAN challenges a user to log in to a group, they should be redirected back to the group page after logging in – instead they get sent back to the CKAN home page, in which case they will have to look again for their project under the “Groups” menu.
  4. A minor one – in CKAN ”License” (noun) appears in US spelling (should be “Licence”).
  5. In order to add all the information needed to deposit a dataset from CKAN, user has to click  ”Further information” tab on the left-hand menu (this is not obvious from the initial dataset form).
  6. Some of the field labels in CKAN are a bit opaque or use technical terms (“Mimetype”) which could do with explanation.
  7. When depositing to EPrints, some of the deposit fields should be being autopopulated from CKAN fields – this does not appear to be happening. The fields affected are:
    • “Description” (could be derived from CKAN dataset/resource Description fields)
    • “Type of Data” (could be derived from CKAN resource Format field)
  8. Repository records created from CKAN have the data “Creator” attached, but not the “Maintainer”.
  9. Repository records created from CKAN don’t have a link back to the CKAN dataset (should go in the EPrints “Official URL” field) – this will be required to provide access to the data.
  10. After deposit, users see the error message “Validation errors: [doi] is a required string” – the University does not currently have access to the live DataCite DOI service, which would secure a DOI for each dataset record deposited from CKAN.

Back and blue: the Lincoln Repository has been upgraded

Posted on March 14th, 2013 by Paul Stainthorp

Thanks to three days of intensive work by ICT Services (Tim Simmonds & David Whitehead), the Lincoln Repository is now back online at: http://eprints.lincoln.ac.uk/

What’s different (better!) about the new version?

  • Now running the newest version of EPrints software (v.3.3);
  • Styled using the University of Lincoln corporate blue Common Web Design;
  • Includes the REF2014 plugin (http://www.eprints.org/ref2014/);
  • Includes access to the EPrints “Bazaar” plugin store for extending the Repository (http://bazaar.eprints.org/);
  • Has a greatly improved system for managing staff users (developed by David Whitehead), which should make the Repository run much faster;
  • Will provide faster updates to the Staff Directory (http://staff.lincoln.ac.uk/);
  • Provides a better platform for future development and integration with other University services.

Inevitably there will be a few oddities and teething problems with the new Repository. If you spot anything that doesn’t seem to be behaving, please report it using the Feedback form on the library website (http://lncn.eu/nty) and we’ll investigate.

For posterity’s sake: a screenshot of the old, green, EPrints 3.1 Repository (including the message we put on it warning of the upgrade):

Screenshot of the old Lincoln Repository

And the new, CWD-blue, EPrints 3.3 version:

Screenshot of the new Lincoln Repository

Disruption to the Lincoln Repository: 11th-13th March

Posted on March 7th, 2013 by Paul Stainthorp

As part of work to upgrade and improve the Lincoln Repository, the service will be “locked” between 11-13 March. The Repository will still be publicly accessible, but you will not be able to log in or add any new content.

The Lincoln Repository exists for the permanent deposit of research materials produced by our community of staff and students. Repository content can be browsed or searched through http://eprints.lincoln.ac.uk/ or through searching Google. Wherever possible, repository content is freely available for download and re-use.

As part of a project to upgrade and improve the Repository, including work to support the University’s REF (Research Excellence Framework) submission, the Repository software will be upgraded between 11-13 March 2013.

During the upgrade, material posted to the Repository will still be publicly accessible, but the Repository will be “locked”, meaning you will not be able to log in or add any new content.

The upgraded Repository will become accessible again on 13 March.

If you have any questions or concerns about this upgrade process, please email: eprints@lincoln.ac.uk

Changes to the University structure on the Lincoln Repository

Posted on February 1st, 2013 by Paul Stainthorp

Screenshot from the Lincoln RepositoryThe following changes have been made to the browsable University structure in the Lincoln Repository (eprints.lincoln.ac.uk):

  1. The former Faculty of Agriculture, Food & Animal Sciences has been removed.
  2. National Centre for Food Manufacturing (Holbeach Campus) has been moved into College of Sciences > Faculty of Science.
  3. Also in the Faculty of Science, a node has been created for the new Lincoln School of Pharmacy.
  4. In the Faculty of Health & Social Sciences, two departments have been updated to reflect slight changes in name:
    1. School of Social & Political Sciences
    2. School of Sport & Exercise Science

This structure is used to organise content in the Repository, it’s also used to organise the Quarterly Research Output Reports, which are generated from Repository data.

In a future iteration of the Repository we intend to take the University structure directly from Lincoln’s data API layer, Nucleus.

University of Lincoln Library Plan (bullet points)

Posted on December 14th, 2012 by Paul Stainthorp

The University of Lincoln has an internal Library Plan document, which sets out areas of strategic development for our service over the next 3-5 years.

Here are some bare notes I made in a meeting about the Library Plan this morning, relating to e-resources tasks and priorities over the next year. They’re here so I don’t lose them. They won’t make much sense without the Plan itself (which you probably won’t have read), but if you’re interested in what any of these mean, please get in touch.

These will be my priorities in 2013.

  1. Integration of LibGuides with Blackboard VLE.
  2. Helpdesk / enquiry management.
  3. Review use of reference management software.
  4. Resurrect the Calm (archives & special collections) project.
  5. REF2014 and beyond – plan for strategic development of the Lincoln Repository, including RDM and e-theses.
  6. Close the loop on authentication: OpenAthens LA, walk-in access, LOCKSS.
  7. Stats! Look at the University’s data-warehouse and dashboard approach, Nucleus/data.lincoln.ac.uk, and external services/software (JUSP, RAPTOR, etc.)
  8. Plan with the subject librarians to hone the use of Find it at Lincoln.
  9. Plan with the subject librarians/acquisitions to ‘hard launch’ reading lists; look at collection management.
  10. “Business Information Service”?

Change to request procedure on the Lincoln Repository

Posted on December 12th, 2012 by Paul Stainthorp

We have made a slight change to the way the Lincoln Repository deals with requests for copies of restricted-access items from readers.

It’s no longer possible to change the status of an item (making it publicly available) at the same time as confirming the request. If you think a restricted-access item on the Repository should be made publicly available (i.e. made Open Access), please email the Repository team.

Screenshot from the Lincoln Repository

Orbital deposit of dataset records to the Lincoln Repository: workflow

Posted on December 6th, 2012 by Paul Stainthorp

Further to yesterday’s blog post about linking our CKAN datastore with our EPrints Repository (to allow researchers to deposit permanent, public, citable records of their datasets), here’s a fleshed-out diagram of the proposed dataset deposit workflow process.

At the moment, this assumes a one-time “fire and forget” deposit. At some point, we’re going to have to tackle versioning.

The original diagram is available on Lucidchart. See the table in my previous blog post for details of which data fields are involved in the process (i.e. passed between CKAN, Orbital Bridge, the DataCite API, and EPrints).

This is a proposal and still has to be road-tested. Comments welcome.

Diagram of the dataset deposit process

Stages in the proposed deposit process:

  1. User enters project metadata in AMS
  2. AMS creates project container in CKAN
  3. User creates dataset record in CKAN
  4. Nucleus adds user metadata to CKAN
  5. User deposits data in CKAN
  6. User presses “DEPOSIT DATASET” button in CKAN
  7. Orbital Bridge requests DOI
  8. DataCite API returns DOI
  9. Orbital Bridge adds DOI to dataset record in CKAN
  10. User reviews and approves dataset metadata (making changes if necessary)
  11. Orbital Bridge writes changes back to dataset record in CKAN
  12. Orbital Bridge creates a new EPrints record via SWORD
  13. EPrints confirms existence of new record
  14. Orbital Bridge writes EPrints record URL back to CKAN dataset record

Orbital: AMS–CKAN–EPrints–DataCite

Posted on December 5th, 2012 by Paul Stainthorp

One important piece of work that we’re undertaking at the moment in Orbital is the facility to deposit the existence of a dataset, from CKAN and the University’s new Awards Management System (AMS), into our (EPrints) Repository via SWORD – at the same time requesting a DOI for the dataset via the DataCite API. The software at the centre of this operation is what we refer to as Orbital Bridge. Here’s a diagram of how the various systems will need to link together. Diagram of data flow between systems The table below shows how fields may be mapped between systems. DataCite properties are taken from the DataCite MetaData Schema (v2.2). This is very much a work in progress! In particular, the red question marks (?) in the “CKAN field” column indicate fields that may not yet exist in the source system (CKAN). It’s in no particular order yet.

The following DataCite properties are optional, and we don’t intend to use them at the moment.

  • 3.1 – TitleType
  • 9 – Language
  • 12 – RelatedIdentifier
  • 12.1 – relatedIdentifierType
  • 12.2 – relationType
  • 13 – Size
  • 15 – Version

 

Search limiters in Find it at Lincoln

Posted on October 11th, 2012 by Paul Stainthorp

Find it at Lincoln uses limiters to allow you to refine searches to particular types of item. We’ll make changes to our limiters in response to user feedback and as we learn more about how to get the best out of the system, but here’s how our limiters are set up at the minute.

You can apply some limiters before you search. On the basic search screen you can view the available pre-search limiters by clicking on “Search Options”. Limiters are also displayed on the advanced search screen.
Screenshot from Find it at Lincoln

You can also narrow your results down post-search by adding limiters from the left-hand menu (a kind of faceted searching). Three of the most important limiters always appear at the top of the menu under “Refine your results”.
Screenshot from Find it at Lincoln

1. Full text online

Applying the “Full text online” limiter should restrict a search to:

2. Library Catalogue only

This limiter restricts the search to records from the Library Catalogue – print holdings and ebooks, including PDA records.

If you tick both “Full text online” and “Library Catalogue only“, you only get ebooks (Boolean AND).
LucidChart Boolean diagram

3. Available through the Library

The third limiter, “Available through the Library“, is the sum of the previous two limiters (i.e. Boolean OR). At the moment, this limiter is applied by default to all searches. If you un-tick this limiter (switching all three limiters off), you will see results from the entirety of Find it at Lincoln, including books and articles to which we do not have access. There are pros and cons to having “Available through the Library” switched on by default, and we are still debating this at the University of Lincoln.
LucidChart Boolean diagram

In addition to these three limiters, Find it at Lincoln has one expander applied by default: “Also search within the full text of the articles” goes beyond the article metadata to look for your search terms within indexed full text. This increases the number of results, but can make searches slower, and potentially includes less-relevant results. You can see this option by clicking on the small blue arrow which appears next to the number of search results (and remove it by clicking on the small orange cross).
Screenshot from Find it at Lincoln

Other options for limiting/refining searches include:

Pre-search limiters (in “Search Options” and on the advanced search screen):

  • Apply related terms (expander) – uses a thesaurus to include results containing terms related to your search keywords.
  • Lincoln Repository only –  self-explanatory.
  • Peer reviewed – limits to peer-reviewed articles only.
  • Location – applies only to records from the Library Catalogue: location and collection of print items (example).
  • Others… Image Quick View Types, Language, Author, Date Published, Journal Name.
Post-search limiters in the left-hand faceted search menu: these are result-senstitive (i.e. you will see different options depending on what’s in your current search result set). I’ll expand on some of these in a future blog post.
  • Source Types
  • Subject
  • Publisher
  • Publication
  • Language
  • Geography
  • Location – see above.
  • Content Provider – the various databases and collections which make up Find it at Lincoln

Finally, of course, limiters can be combined for more and more specific searches. We are going to set up a separate, demo version of Find it at Lincoln so that we can test our changes to limiters/expanders/other search settings before we unleash them on the live service.

Quarterly Research Output Report 2012 Q1

Posted on September 18th, 2012 by Paul Stainthorp

The latest Quarterly Research Output Report for the University of Lincoln has been produced, covering the period January–March 2012 inclusive.

These reports paper summarize research outputs published in each quarter by academic staff at the University of Lincoln. The lists include substantive research outputs first appearing “in published form” (or equivalent for non-textual outputs) during this period. The lists have been generated automatically from data stored in the Lincoln Repository (http://eprints.lincoln.ac.uk/). Tables summarize the volume of outputs recorded by School.

The quarterly reports are themselves available to download from the Repository, at: http://eprints.lincoln.ac.uk/5873/