Automating Durham University Library's New Books List into RSS Feeds

RSS feeds using
Millennium data

Andrew Preater, University of London
Research Library Services
Presented at EIUG 2010, 15th June 2010

www.london.ac.uk

A short break in County Durham
I work for University of
London Research Library
Services, at Senate House

But I will talk about my
previous development work
at Durham University Library

Introduction

The problem is the new books list
We use these to list new items for
readers as a current awareness tool
Various ways to do this...

Durham‟s new books featured list

Problems
High maintenance
Not split by subject; not easily „mashable‟
Usage next to nothing by 2007-08

10 hits!

RSS feed improvements
Puts our metadata where the
reader is
Much less work for library
staff
Standards-based XML data,
can be reused elsewhere or
mashed up
RSS feed icon from www.feedicons.com

Project as proof of concept
Low-risk pilot for automated export and
processing of Millennium data
Demonstrates this utility for future projects
Quickest and easiest example using this
approach

Desired outcomes

Automated as much as possible

Minimal effort by non-systems staff to
maintain

No special software – no budget!

Stable and reliable, „just works‟

Software used
Other than Millennium...

1. Linux server with Perl installed

2. MySQL database

3. Web server running PHP

Basic idea
A featured list was created each week
based on changing book item status to „d‟

So a „new books‟ review file was being
made...

New step added: export the contents
of the review file and reuse it

Export these fields
BIB MARC 245 $a
BIB MARC 245
BIB AUTHOR
BIB IMPRINT
BIB SUBJECT
BIB RECORD #
ITEM FUND CODE
ITEM SHELFMARK
ITEM LOCATION

Example single item

"Dead white men and other important people
:"~"Dead white men and other important
people : sociology's big ideas / Ralph
Fevre and Angus
Bancroft."~"Fevre, Ralph, 1955-
"~"Basingstoke : Palgrave
Macmillan, 2010."~"Social sciences --
Philosophy.";"Sociology."~"b25978974"~"bgso
c"~"300.1 FEV"~"main4"

Processing this list
Perl script run every 15 minutes by cron:

1. Checks if there is a new file

2. Processes the data

3. Loads it into a MySQL database

4. Cleans up

Step 2: tidying up the data
1. Replace & with &
2. Insert RFC822-compliant date
3. Strip quotation marks around fields
4. Strip trailing non-alphanumeric character in 245
$a
5. Lowercase fund codes

Step 2: example single item
Dead white men and other important people 245$a
Dead white men and other important people
: sociology's big ideas / Ralph Fevre and Angus 245
Bancroft.
Fevre, Ralph, 1955- Author
Basingstoke : Palgrave Macmillan, 2010. Imprint
Social sciences -- Philosophy.";"Sociology. Subject
b25978974 Record #
bgsoc Fund code
300.1 FEV Shelfmark
main4 Location
Mon, 07 Jun 10 12:31:01 BST Date

Database
Two tables are used:

items is refresh weekly: contains our
books information

fundmap maps Millennium fund codes to
subjects. Export is automated but doesn‟t
need to run weekly

fundmap example
deptcode fundcode deptname
foobar site
ECON bceco Economics & Finance DURHAM

HIST bchis History DURHAM
Govt & Intl
MEIS bbcme DURHAM
Affairs/IMEIS
Govt & Intl
MEIS bxabc DURHAM
Affairs/IMEIS
Trevelyan College
CTV ctvl1 DURHAM
Library

Web front end
PHP script hosted on IT Service Web
server will serve the feeds

http://www.dur.ac.uk/reading.list/
newitems.php?dept=HIST

Parameter is „all‟
or a subject code

What it does

1. Select items from database
2. Writes beautiful, valid RSS
3. Serves it up to the browser

A bit more detail...

Generating RSS feed XML
Write <title>, <description>
<link>, <image> once

Item <title> is 245 $a and links to catalogue bib record

Itemeach database line, writefull item
For <description> contains one
data. newsinclude encoded HTML
RSS Can <item> ...

Item <description> author, shelfmark and
subjects hyperlinked to catalogue search.

Finished product - I
Shown in
Akregator feed
reader

Running
happily since
August 2007

Finished product - II

HTML version of
RSS feeds on
Library Web site

Also: in-house PC
screensavers,
plasma displays...

Summary Exported flat file
Millennium
review file

Process and load
Display with
into database
Web front end

Lessons learned
Easiest to use Unicode everywhere

Write valid RSS 2.0 or Atom, use
http://feedvalidator.org for hints

Few complaints; change uncovered a tiny
hard core of featured lists fans
That said...

“Couldn‟t you automate this?”
You can automate
much of it with Expect
or AutoIt

Recommend Marc
Dahl‟s presentation on
Expect for Innopac:
http://bit.ly/dahl-expect

Following on from this...

Automated export and processing used for:

Exporting Course Reserves to Blackboard

Display of e-resources data in CMS

Sending fines data to Oracle Financials

Thank you!

Any questions?

Contact me
Email: andrew.preater@london.ac.uk
Twitter: @preater

Automating Durham University Library's New Books List into RSS Feeds

Recommended

Recommended

More Related Content

Similar to Automating Durham University Library's New Books List into RSS Feeds

Similar to Automating Durham University Library's New Books List into RSS Feeds (20)

More from Andrew Preater

More from Andrew Preater (20)

Recently uploaded

Recently uploaded (20)

Automating Durham University Library's New Books List into RSS Feeds

Editor's Notes