Lessons from the Bazaar
Open Source Software Use and Development in Libraries
Mary Ellen Petrich
November 29, 2009
Seminar in
Information Science: The Open Movement and Libraries
LIBR 287-05 Fall 2009
School of Library and
Information Science San Jose State University
Published in Open and
Libraries Class Journal, Fall 2009
Abstract
Open source software has had a big
impact on libraries. This paper discusses the importance of evaluating and
participating in OSS development communities, and why libraries may want to
start their own OSS projects. When libraries use OSS software and fail to
participate in the user community, the library risks of losing sight of the
project direction, which may ultimately make the product unsuitable for that
specific library. While library staff may not be able to participate as
programmers, other types of participation can assist the community such as evaluating
software and writing documentation for best practices. Participating in OSS
communities can educate library staff in the practical skills needed to develop
in-house software.
Lessons
from the Bazaar: Open Source Software Use and Development in Libraries
Open source software has
become more common in the library and in everyday life. Two of the most successful
open source software projects are the GNU/Linux operating system and the Apache
web server. Apache (http://www.apache.org) has been used by more domains than
all the other web servers combined from 1999 to the present (October 2009 Web
Server Survey, 2009). More familiar to most people is the Firefox web browser (http://www.mozilla.com)
which in 2008 had about a quarter of the browser market worldwide, but as much
as half the market in some European countries (Lilly & Beltzner, 2009, min.
12-15). One that should be better known to the general public is OpenOffice.org
(http://www.openoffice.org) which just announced the 100,000,000th
download on October 28th, 2009 (from the OpenOffice.org home page). Each
of these open source software projects has large successful communities.
For libraries, major
open source integrated library system (ILS) software projects are: Koha,
Evergreen, and OPALS. Major open source online public access catalog (OPAC)
projects are: Scriblio, VuFind, SOPAC, and eXtensible Catalog. Libraries and
archives also use open source digital repository software such as DSpace,
Fedora, and EPrints. In addition there are many other types of open source
software that are used by libraries.
Open source software attracts
attention from many people in the library field because the philosophy of free
and open access to information tools is one which mirrors the philosophy of
libraries to provide free information. Ameral (2008) asserts, Libraries and
the open movement mesh together logically (p. 6). Chudnov proclaims,
Use, study, modify, and
copy is an immensely powerful summary of both what free software is and why
free software works. In my mind, the only more powerful statement of freedom
apart from our founding documents is the statement emblazoned on hundreds of
older libraries, many built nearly 100 years ago with Carnegie funds: Free to
All. (p. 23)
Many opinions about open
source software exist in the library community. Karen Schneider (2008), of
freerangelibrarian.com, inspires with her statement that open source software
enables librarians to be leading innovators (p. 45) who develop their own
tools. Others, like Marshall Breeding, propose caution when speaking of the
largest piece of software that any library is likely to own, the integrated
library system (ILS): As
the open source movement matures, these systems will need to compete on their
own merits and not solely on a philosophical preference (Breeding, 2008). Many
writers, both advocates and critics, caution that while
open source software is free of charge, libraries must evaluate, deploy, and
maintain the software, and those services must be paid for either by employing
staff in the library or by employing the services of a commercial vendor. Most
recently Steven Abram, a highly visible speaker in the library field who is
employed by proprietary software maker SirsiDynix, wrote an extremely negative position
paper on open source software which has garnered considerable criticism, but has
also generated some discussion about the strengths and weaknesses of the open
source versus the proprietary model of distributing software (Hadro, 2009).
While there are a small
number of superstar open source projects, some studies suggest that the
majority of open source projects are written primarily by one person and have not
gathered a self-sustaining community (Krishnamurthy, 2002). Even the most
vigorous advocates for open source software still use some proprietary products
when necessaryHoward County Library, well known for using open source software
on hundreds of public access computers uses some computers with Microsoft and
Apple software for staff computers (DeGroff, 2008).
It is difficult to
evaluate what percentage of libraries are using library-specific open-source
software. A literature survey by Houser (2009a) shows that the use of Linux as
an operating system in libraries has expanded substantially between 1999 and
2009, but the types of articles cited indicate that using Linux is much more of
an exception than a rule. In a related article, Houser (2009b) says, While the
topic may still be open to debate, there is clearly a contingent of library
professionals who feel their user experience is better with open source
software (p. 7), wording which indicates that the contingent is not large. Breedings
(2009) review of four open source discovery interfaces shows that there are a
variety of open source products available to libraries. He declares that
without a doubt, the number of libraries using open source discovery interfaces
is still much smaller than that of those using proprietary products. But he
goes on to say that the high quality of these products and their broad
geographical acceptance leads him to believe that their popularity will
increase rapidly. Breeding (2008) outlines a study that he performed which
indicated a relatively low level of interest in open source ILS options (p.
36). Open source software relevant to libraries can also be classed as Web 2.0
software, as the open source model is used to develop and distribute more
experimental types of software. A study on the availability of Web 2.0 tools on
library web sites demonstrates that it is typically the largest, most popular,
and best funded libraries that are able to experiment with new types of
software (Lietzau, 2009). Based on this subjective information it does not seem
that there are a large number of libraries that are using primarily library-specific
open-source software.
Using open source software
For most libraries the
primary perceived disadvantage of open source software is that the institution
will need additional technical expertise, and additional staff time. Because
open source software development tends to focus on features and the fun process
of software development, open source software can be more difficult to install,
configure, and maintain. Schneider suggests that this goes with the territory:
to get what you want for your library needs, you need a product that has a
higher implementation and maintenance curve (p.46). Software should be evaluated
point by point to ensure that it does get you what you need.
While some of the appeal
of open source software to libraries is the perception of a common mission with
libraries, there are other advantages, although they tend to favor more technical
users. DeGroff (2008) points out that Linux does not attract the viruses that
Microsoft operating systems do. Like Chudnov (2009), many writers extol the
four virtues of being free to use, study, modify, and copythese freedoms allow
library staff to easily and legally evaluate and use the software. Finally, many
people use open source software just because its the simplest and easiest
thing to do. As Bisson (2007b) says, sometimes its a no-brainer.
While using OSS is not
free because of the expense of the hardware and support, it does allow some
flexibility for deciding what the cost is. Many open source software packages can
either be used out-of-the-box, or substantially customized for the
institution. As an example, the University of Vermont is using DSpace with
little customization (http://badger.uvm.edu/dspace/): the user interface is the
default; the logo for the DSpace organization remains in the upper left hand
corner; the link to About DSpace goes to the DSpace web site, and the link to
Help opens default help files. On the other hand the University of Illinois
has extensively customized their version of DSpace (https://www.ideals.illinois.edu/):
the user interface is customized; the site has been rebranded from DSpace to
IDEALS; the link to About goes to a page with extensive links to policies and
procedures for using the site; and the link to Help opens extensive custom
help files, with screen shots of the customized interface.
Ultimately, as Morgan
(2009) says, using open source software means that the user has more control
since the software is open and accessible. However, in addition, the user also
takes on greater risk. Because open source software is so configurable there
are more unknowns about whether it will be possible for an institution to use
it as needed.
Participating in open source software communities
While most libraries
considering open source software are aware that the maintenance costs may be
greater than proprietary software, most consider only the benefits of a related
open source community, and not the quality or commitment of the community.
Understanding the strengths and weaknesses of the community should be part of
evaluating an open source product. As Tansley, Smith, and Walker (2005) say, a
system which still needs work but has a bustling community around it is likely
to be a better long-term bet than a more technically developed system with no
visibly active community(p. 252). Understanding how to interact with the
community of an open source project is significant to successfully selecting, deploying
and maintaining the software. Ignoring the community means a significant
increase in risk for open source product users. Van den Berg (2005) outlines a
set of criteria against which open source software can be evaluated, including
the qualities of the community, such as frequency and quality of posts, number
of users, and friendliness of respondents.
In her article, Griggs
(2009) argues that libraries have to make an effort to find and evaluate open
source software since such software rarely has the marketing materials most
proprietary software does. She suggests that libraries should contribute back
their product evaluations, even if the format is informal, so that future
customers can benefit from the effort (p. 47). Hoppenbrouwers (2007) goes
further by saying that open source product selection and assessment might be
significantly more expensive (p. 515). In the open source model, these
marketing costs are shifted to the customer, but can be spread out if multiple
evaluators contribute information.
Hoppenbrouwers (2007) asserts
that, while a user of open source software is not a customer of a vendor, that
user is the customer of a community. He says, a customer they are, with
associated real costs to pay and real benefits to enjoy (p.511). As a
community customer, an organization has a responsibility to give back to the
community. Most articles focus on community participation in the form of
software development, but there are many other roles that a community customer
can play, the simplest of which is helping other users and organizations on the
communitys discussion boards. Others include performing quality assurance, writing
documentation, and translating.
While it is easy to say
that community customers have an obligation to give back, it is unlikely that
such a motivation will be enough to drive an institution to contribute. What
other advantages does contributing return to the institution? Stam and Wendel
de Joode (2007) speculate on the motivations of individuals and institutions contributing
to an OSS community. The focus of their study is on the participation of
commercial firms in open source communities. While the motivations of a library
may not be to make money, there are many similarities in the motivations of a
library and a commercial firm in reducing costs and maximizing return on
investment. Commercial firms, like libraries, will only invest their time and
effort in the communities if they believe it will lead to additional benefits or
reduce costs (Stam & Wendel de Joode, 2007, p. 498). The motivations of
individuals are outlined briefly as developed in many other articles: building
a reputation, improving programming skills, improving software functionality, and
having fun. For institutions, a danger of not participating in the community is
that patches, releases and upgrades can come and go without notice. A problem
could arise with the product, and without the goodwill of continued
interaction, the responses of the community could be slower and less helpful. Without
active involvement, an institution could find that product development has
taken a turn which makes a valued tool unusable in future releases.
The advantages to
participating in the community are many. Reading about the challenges of other
software users educates and informs the IT staff of an organization, improving
their understanding of the software. Active community members are more aware of
the sometimes frequent software releases, upgrades, and patches of open source.
Active participants have greater influence on product development. Organizations
may participate in the community to ensure that mundane tasks are performed
which individual contributors may not enjoy, such as documentation,
localization, and packaging. Participating
in OSS communities can educate libraries and library staff in the practical
skills needed to develop in-house software. Some institutions
may choose open source just because they can influence development, even taking
part in bug fixes or improvements. On a larger scale, participating in the
community keeps it active and vital. Without activity and interaction, the
development of new functionalities and ideas slows down and the community
begins to fade away (Dahlander & Magnusson, 2008, p. 646). Asking
questions and answering questions, making bug reports, and requests for
functional improvements, contributing software evaluations, and best practices
stirs the pot of the community and keeps it active.
Bonfield (2009) has
written an excellent review of the major library specific open source projects.
Some of the characteristics that he evaluates are whether the software was
built from scratch, what language it uses for customization, how good the
documentation is, and how stable the organization is based on current
leadership, funding and the size and support of the community. Only Koha and
Evergreen have communities that he considers sustainable. The other five
projects that he evaluated: Blacklight, Kochief, Scriblio, SOPAC, and VUFind,
are primarily driven by a lead developer or a sponsoring department that
provides ongoing support to the project. Without that key individual, the
project could disappear.
Producing open source software
When asked why they
decided to start development of an open source project, the first thing most
librarians will list is the technical capabilities that they had in place. Its
the equivalent of saying, because we could. Looking back it is easy to see
the advantages that certain institution had that lead to success. Along with technical
skills, a technically oriented institutional culture and forward thinking
administrators are also cited as important factors in success. However, the
driving force is often a profound dissatisfaction with existing products.
At Oregon State
University (OSU) the librarians found that their patrons were unable to use the
vended federated search producteven the librarians were having trouble using
it. They evaluated the available products and couldnt find one that did what
they wanted, so in mid-2005 they decided to develop a product that became known
as LibraryFind. The institutional culture at the OSU libraries was vital in
supporting the production of this project: since the campus has an open source
lab, the administration understood the open source philosophy; and the library
had a staff member, Jeremy Frumkin, whose role was to pursue innovative
services and technologies, who could drive the project. Finally, the library
was able to get a grant to improve the software and share it with other
institutions. In January of 2007, it was released to the world as open source
software. Frumkin explains that developing LibraryFind was not the cheap way to
go, but rather it was the better solution for the library. In his opinion there
are times when proprietary software will be the better solution for a library
and other times when using available open source software or developing new
software will be the better solution. We felt the return on our investment versus
the cost of building the software was greater than our return on investment vs.
the cost of purchasing the available vended products (Bisson, 2007a, p. 38).
While the project
members had hoped to develop a community as a sustaining force for the project,
the community for LibraryFind has been small. The mailing list is active as of
this writing and averages around 15 messages per month, mainly from users
installing or configuring the software. Even these contributions assist the
project in evaluating problems and bugs in the software.
At the same time that
Jeremy Frumkin was working on LibraryFind, Kim Griggs, the lead programmer for
the OSU libraries, began a project to create a content management system for
the libraries web site called Library la Carte (Griggs, 2009). The software
was intended to assist librarians in creating interactive course assignment
pages and subject research guides, without knowing how to write code. It was
released as open source software in December of 2007, and the project was moved
to RubyForge to improve community interaction. As of this writing, the
community is still active, and averages about 10 messages per month, mainly
from users.
The Simon Frasier
University (SFU) library was a relatively early entrant to the library specific
OSS field. The reSearcher project started in 1992, and was funded in 1997 by a
consortium of Canadian academic libraries (Stranack, 2007). This project to produce
a link resolving program eventually grew into four products, including dbWiz, a
federated search product. The projects were developed in concert with other regional
academic libraries and a community of users collected around the products. Stranack
outlines the opportunities and advantages of community involvement, concluding
that participation in an open source project demonstrates to administrators
and other stakeholders that their libraries are innovative and taking
leadership positions in the critical area of information technology (p. 22). Currently,
the community for reSearcher seems to be winding down. While there was a lot of
activity on the discussion boards in 2006, as of this writing there are just a
couple of posts per month (http://lib-forums.lib.sfu.ca/viewforum.php?f=1).
In 2005, the SFU library
became a partner in the Public Knowledge Project (PKP). This project was
started by John Willinsky in 1998 while he was at the University of British
Columbia. Like the reSearcher project, this project also started with one open
source projectsoftware for creating online peer-reviewed journalsand grew
into four related projects. As of this writing, the community boards for PKP
are active, so much so that it is difficult to estimate the number of posts per
month (http://pkp.sfu.ca/support/forum/).
It often seems that
academic libraries are able to lead the way in implementing new technologies
and processes. Academic libraries often have more resources and the academic
culture is more likely to encourage staff education and experimentation. But several
exceptional public libraries have also been able to collaborate to produce open
source software. The Georgia Public Library Service (GPLS) developed Evergreen,
which is open-source integrated library system (ILS) software (Molyneux, 2009).
Howard Library is well known for moving all their patron computers completely
to open source software and even went so far as to create their own branch of
Linux, which eventually proved too time-consuming to maintain (DeGroff, 2008). John
Blyberg, previously from the Ann Arbor District Library and currently the
Darian Library in Darian Connecticut, has developed SOPAC (also called the
Social OPAC) and two related open source projects (Sheehan, 2009). Although
software development continues, the community for SOPAC is not very active (http://thesocialopac.net/forum)
(Bonfield, 2009).
Conclusion
Open source software has already had
a big impact on libraries and has the potential to become even more important. Organizations
that use open source software must consider the community of an OSS package
because of the impact it can have on the quality and support of the product. Libraries
also need to consider whether they should participate in these communities. While
library staff may not be able to participate as programmers, they can assist
the community by performing tasks such as quality assurance, writing
documentation, translating, and assisting other users. The advantages to
contributing to an open source community are: improved support when needed,
improved knowledge of the product, and an invigorated community. Not participating
means having less influence in the community and being uninformed about the future
roadmap of the product. Participating in OSS communities can educate library
staff in the practical skills needed to develop in-house software. For libraries with technical resources, developing and
sharing new software applications is a way to demonstrate leadership in the
library community.
References
Amaral, M. (2008)
Institutional Repositories, Open Source Options, and Libraries. Open and Libraries Class Journal, 1(1).
Retrieved November 1, 2009 from http://infosherpas.com/ojs/index.php/openandlibraries/article/view/11/20
Bisson, C. (2007a
May/June). Building open source. Library
Technology Reports, 43(3), 36-39.
Bisson, C. (2007b
May/June). No-Brainer. Library Technology
Reports. Retrieved November 4, 2009 from http://maisonbisson.com/oss4lib/no-brainer/
Bonfield, B. (2009, July
22). W-E-B-S-I-T-E, find out what it means to me. In the Library with the Lead Pipe. Retrieved November 3, 2009 from
Library, Information Science & Technology Abstracts with Full Text. Also
available at http://inthelibrarywiththeleadpipe.org/2009/w-e-b-s-i-t-e-find-out-what-it-means-to-me/
Breeding, M. (2008,
March). Making a business case for open source ILS. Computers in Libraries, Retrieved November 4, 2009 from
http://www.librarytechnology.org/ltg-displaytext.pl?RC=13134.
Breeding, M. (2009,
June/July). Open discovery interfaces. American
Libraries, 40(6/7) 40.
Cana, M. (2003, July 5).
Open source and Ranganathans Five Laws of Library Science. Public infosophy: socio-technological
rendering of information. Retrieved October 25, 2009, from http://www.kmentor.com/socio-tech-info/2003/07/open-source-and-ranganathans-f.html
Chudnov, D. (2009,
February). What librarians still dont know about free software. Computers in Libraries, 29(2), 22-24.
Dahlander, L., &
Magnusson, M. (2008). How do firms make use of open source communities. Long Range Planning, 41, 629-649.
DeGroff, A.B. (2008,
Winter). Free and Open Source Software at Howard County Library. Nebraska Library Association Quarterly,
39(4), 11-15.
Griggs, K. (2009,
February). Library information made to order: an open source project built for
and with librarians. Computers in
Libraries, 29(2), 13-14, 46-47.
Hadro, Josh. (2009,
October 30). Marketing materials from ILS vendor touches off a flurry of reaction.
Library Journal. Retrieved November
2, 2009 from http://www.libraryjournal.com/article/CA6704622.html
Houser, J. (2009a). Open
source operating systems in libraries: research in context. Library Technology Reports, 45(3), 8-10.
Houser, J. (2009b). Open
source public workstations and todays library world. Library Technology Reports, 45(3), 5-7.
Hoppenbrouwers, J.
(2007). Community customers. In St.Amant, K. & Still, B. (Eds.), Open source software: technological,
economic, and social perspectives (pp. 510-521). Hershey, Pennsylvania:
Information Science Reference.
Krishnamurthy, S. (2002,
June 3). Cave or community? An empirical examination of 100 mature open source
projects. First Monday, 7(6),
Retrieved November 4, 2009 from
http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/view/960/881
Lilly, J. &
Beltzner, M. (Speakers). (2009, September 24). Firefox, Mozilla, & open
source: software design on scale [1:09:54]. Stanford
University YouTube Channel. Video retrieved November 2, 2009 from
http://www.youtube.com/watch?v=S2771Tlr_38
Lietzau, Z. (2009
October). U.S. public libraries and Web 2.0: whats really happening? Computers in Libraries, 29(9), 6-10.
Molyneux, R.E. (2009,
January). Evergreen in context [Special section]. Bulletin of the American Society for Information Science and Technology,
35(2), 26-30.
Morgan, E.L. (2009,
March 28). Open source software: controlling your computing [Presentation for
Computers in Libraries Conference]. Retrieved November 2, 2009 from http://infomotions.com/musings/oss4cil/
October 2009 Web Server
Survey. (2009, October 17). Retrieved November 5, 2009 from
http://news.netcraft.com/archives/2009/10/17/october_2009_web_server_survey.html
Owen, G.W.B. &
Stranack, K. (2007). The Public Knowledge Project and the Simon Fraser
University Library: a partnership in open source and open access. Retrieved
November 3, 2009 from http://eprints.rclis.org/15613/
Schneider, K. (2008).
Free for all: open source software. School
Library Journal, 54(8), 44-46.
Sheehan, K. (2009,
February). Creating open source conversation. Computers in Libraries, 29(2), 8-11.
Stam, W., & Wendel
de Joode, R. von (2007). Analyzing firm participation in open source
communities. In St.Amant, K. & Still, B. (Eds.), Open source software: technological, economic, and social perspectives
(pp. 495-509). Hershey, Pennsylvania: Information Science Reference.
Stranack, K. (2007,
July). The reSearcher software suite: a case study of library collaboration and
open source software development. Retrieved November 2, 2009 from http://eprints.rclis.org/15612/
Tansley, R., Smith, M.,
& Walker, J. (2005). The DSpace open source digital asset management
system: challenges and opportunities. In A. Rauber et al. (Eds), Lecture Notes on Computer Science, Volume
3652, 242-253. Retrieved November 2, 2009 from http://hdl.handle.net/1721.1/29462
van den Berg, K. (2005).
Finding open options. Masters thesis, Tilburg University. Retrieved November
3, 2009 from http://www.karinvandenberg.nl/Thesis.pdf