essence of library
I like the flow of the google / library discussion: what’s the essence of library? and suspect I’ll be thinking about that one for a long time to come. (It sounds like a delightful perfume: a bit musty with an sweet undernote of decaying paper and an overnote of astringent preservative, maybe.)
Just picking out a few of the responses & adding a few more comments:
Michael Madison laid out a best-case defense for google based on google’s added-value of meta-information, and then termed the discussion: “is there an ‘essence’ of library?” And points out that we ought to focus “more what Google does than on what Google is“.
Siva Vaidhyanathan responded that Google doesn’t come close to the ‘essence of a library’.
This is the heart of the discussion that really intrigues me. Not because I truly am arguing that Google is a library, but because I suspect that the ways that information is being transmitted might start to render moot our current definitions of “library”. In my earlier post, I wasn’t really suggesting that Google take advantage of the warm feelings towards libraries; I doubt it would be a very helpful strategy, because most judges, like everyone else, would intuitively distinguish between the classical public library and Google. Rather, I was suggesting that library exceptionalism is only going to work so long as libraries are conceptually distinct.
Michael M then responded to Siva with some discussion of the essence of a library, ultimately concluding that we really have to talk about libraries in terms of information flows. And then he brings it back to Google:
Do we experience Google Print content as we experience other collections that we regard as libraries, or do we experience that content as we experience the Web — a functionally unlimited aggregation of data? Right now, the answer to that question has to rely on intuition and speculation. My money is on the second option, but in the end: who knows?
I’d like to suggest two basic functions for libraries: One is warehousing and archiving physical collections; serving effectively as a museum of information. The second function is providing information services. Storage, and access.
In the past and even today these two functions are, practically, inseparable. And each implicates a whole host of sub-functions many of which serve both masters — e.g., cataloging, which organizes the stored collections.
But these functions have been splitting and will continue to. Digitizing projects, like Google Print, will put the physical artifacts on the same plane with museum artifacts: nice if you’re a scholar and need the original, but for most people, the digitized content will suffice. [Google Print is not the only digitizing project, of course; there are plenty of others on smaller scales that have gotten less attention. I would be interested to get some examples of public-private partnerships because I suspect Google Print isn't the only one.]
As more of the information content becomes digital, the subfunctions used to service both the storage and access functions will shift. Two examples: cataloging and preservation. Electronic information needs much less in the way of cataloging; full-text searching obviates a lot of cataloging needs. (No, not all; I believe in subject headings and hierarchical thesauruses — although I’m not sure they’re ultimately scalable if we’re talking about organizing all information.) Digital media have their own preservation problems, fairly distinct from those relevant in most special collections. The central problem in preserving digital media collections is shifting formats; the central problem in preserving physical collections is preserving the original artifact.
So as these transitions within libraries move forward, the easy and obvious distinctions present today between libraries and Google Print will erode.
Now, Eric Goldman in a comment here said another of his maxims was never build a business on fair use. Google Print, of course, relies entirely on fair use (17 USC 107), so far as I can see. One way we might distinguish libraries at present is that most libraries, operating in the book-warehousing business today, rely not very much at all on fair use, and rather a lot on first sale (17 USC 109). Libraries vary with respect to the library exemptions in 108, which are used principally, so far as I know, to (a) establish reserves collections; and (b) make backups of software, videos, records, etc.
But the bedrock library provisions we rely on today, 108 and 109, won’t be enough for some collections that need to be built in the future. For instance, I don’t know what libraries are currently archiving popular digital ephemera (besides the Internet Archive). But just as libraries have begun to collect popular culture media in DVDs, CDs, comic books, and zines, so there will have to be archiving projects dedicated to archiving purely digital media, including digital media that are distributed for free via the web. I’m thinking of things like JibJab’s “This Land Is Our Land”, Mark Fiore’s shockwave commentaries, and similar such materials.
Let’s consider the Mark Fiore shockwave animated cartoons. [This is purely my example, because I love Mark Fiore; I have no idea if he has been approached by any libraries or what his response might have been.] The cartoons are distributed for free over the Internet; but they are not (so far as I know) licensed for free reproduction & distribution, and they author retains copyright. If a library wanted to begin collecting them, how would they analyze this collection & provision of access to it? 109 protects the rights of “the owner of a particular copy or phonorecord lawfully made under this title … to sell or otherwise dispose of the possession of that copy or phonorecord”. But “computer programs” are exempted. Are shockwave files “computer programs”? Maybe we have to resort, at last, to fair use. Now what do American Geophysical Union, Kelly v. Arriba, MP3.com, et al, tell us? Michael Madison talked about it, but I think it was summed up by Eric Goldman: “Don’t build a business on fair use … multi-factor tests lead to complete unpredictability.”
This is obviously not a fullbore analysis of the relevant provisions as applied to publicly distributed shockwave files, but it does make my point: digital media and new ways of distributing content are already troubling the current copyright categories that are designed around brick-and-mortar libraries and physical artifacts.
And that’s just one example looking at only one aspect of the question of collecting & providing access to Mark Fiore shockwave animations. Consider the reams of problems that digital media pose in the realm of licensing, DRM, and DMCA-type technical protection measures, notwithstanding the protections allegedly offered by 109, 108, and the 1201(d). (Is there any point in even citing to 1201(c)? I feel it’s been effectively read out of the statute the same way, and perhaps for similar reasons, the 9th Amendment to the Constitution has been politely ignored.)
Libraries qua libraries — well, libraries qua public and academic libraries, anyway — will always have recourse to Congress, and I predict they will prove as popular there in the future as they have in the past: not popular enough to sway Congress from granting very broad rights to copyright holders that end up hurting libraries, but popular enough to get some limited library-specific protections.
But most librarians, myself included, want to preserve BOTH today’s model of the library: the brick-and-mortar warehouse-and-cataloger-of-physical-media (which I do think will always be around) — AND the idea of the library: the collector and provider of information. So the question is, how, or why, do we copyfighters / librarians / information activists / legal scholars distinguish Google Print in a way that doesn’t hurt Essence of Library down the line? And why, tactically, should we? Maybe, we should focus on building a more robust fair use, fixing 109 so it works with digital media, or even adding in more 108 exemptions. Or maybe on the DMCA Library of Congress anticircumvention comment rounds that are coming up again.
Further reading on this discussion at copy this blog and copy this blog again. copyfight is following the debate and a number of people are commenting: See google print is as google print does and google print library shoulda coulda woulda. More from “real librarians” and others responding on Siva’s blog: Eileen Snyder, 8/17; Siva responding to Michael Madison, and including comments from other folks too.
I’d like to link to some good discussions on 109 (I seem to recall Derek Slater recently talked about 109 and digital music files, for instance, but can’t find his post — is there a search function I’m missing? Derek?) but will need to do some more digging … later.
As I write I follow one of those social sciences rules about mobs or group discussions or something: I make myself more firm in my opinions the longer I write. This is why it would be much better if I had time to write a long post, then sit on it for a while — my tone could be measured & even the whole way through. But I was already delayed in responding, so wanted to get some thoughts out in a hurry.
update 8/18: a few last posts on this discussion: madisonian.net 8/17; siva 8/18 and siva again 8/18;
siva also posted about an aspect of this issue which i didn’t really touch at all in this discussion, which is the trustworthiness of private actors in general and google in particular. my interest was piqued by the essence-of-library question, but this was a significant thread in comments & subtexts in various discussions. See siva 8/17; copy this blog (previously cited) linked to a post & comment discussion of the google / library contract on the library law blog; and seth finkelstein wrote about what’s in it for google.
update 9/1: the best response to it all came from the onion: Google Announces Plan To Destroy All Information It Can’t Index …
The new project, dubbed Google Purge…. The company’s new directive may explain its recent acquisition of Celera Genomics, the company that mapped the human genome, and its buildup of a vast army of laser-equipped robots. ‘Google finally has what it needs to catalog the DNA of every organism on Earth,’ said analyst Imran Kahn of J.P. Morgan Chase. ‘Of course, some people might not want their DNA indexed. Hence, the robot army. It’s crazy, it’s brilliant—typical Google.’ … ‘This announcement is a red flag,’ said Daniel Brandt, founder of Google-Watch.org. ‘I certainly don’t want to accuse of them having bad intentions. But this campaign of destruction and genocide raises some potential privacy concerns.’
related posts: interesting reading early saturday morning 8/13 … google & not-for-profit libraries 8/13
algorithmically similar posts:
» google print postscript, 2005-12-06 (score:35)» google print: google’s evilness is beside the point (Bonus Rant Included), 2005-10-26 (score:29)
» google & not-for-profit libraries, 2005-08-13 (score:28)
» Google Print: whither goest this debate …, 2005-10-31 (score:26)




August 17th, 2005 at 10:14 pm
Laura, another thought-provoking post, but one that leaves me a little confused. I recognize you know a heckuva lot more about libraries than I do (although, as the son of a librarian, I’m not completely clueless), but I’m not yet convinced that your definition of libraries is well-accepted or even correct.
Libraries serve a multitude of functions that reflect both their physical structure (i.e., a place to hang out) and the intangible content they store (a repository of knowledge). With the explosion of content, both printed and electronic, libraries face an increasing challenge with the latter.
For many of us, now, (I’m sad to say) libraries are a way to get state-subsidized content–free DVD rentals, free best-seller books. Sure, I could go to Blockbuster and rent a DVD, or buy the bestseller on Amazon, but why do so when I’ve paid for it already through my taxes? This is hardly the paradigmatic repository of knowledge; it’s just a state-mandated reallocation of wealth to fund free content in competition with pay options.
I recognize libraries serve many other roles, including a public space for gathering and, increasingly, a place to get free Internet. However, until we nail down what makes a library a library, I’m not sure we can know if Google is or isn’t one.
In any case, I’d be happy to statutorily confirm that Google and other search engines are libraries for purposes of the copyright act. (Not politically realistic, but we can dream) Until then, we know that Google isn’t a library (or an archive), although we can’t say with precision what it would have to do different to be one.
Eric.
August 17th, 2005 at 10:49 pm
Very interesting thoughts. I’m just writing something down quickly before I forget. ^^; I guess my first question is: if not for the popularity reason, why would it be beneficial for Google to be considered a library in the present, if not for the warm feelings? The Section 108 exemptions for digitization just aren’t good enough for them to be worthwhile under copyright law. IE, I don’t think that a Section 108 library would be able to do something like Google Print without relying on fair use.
I do believe that copyright law hasn’t caught up with “born digital” objects. I’ve gotten some interesting advice from UT’s General Counsel on that subject…
The essence of a library- I’m planning to dust off my LIS text books to see what I can find when I get the chance, and see what the people in my School are teaching, at least. ^_^;
August 17th, 2005 at 11:58 pm
A couple of points:
1) I disagree that Blockbuster-level DVDs isn’t part of a paradigmatic repository of knowledge. It’s not the whole, but it is a part. I’m one of those folks who thinks libraries ought to collect everything, including zines, posters, pamphlets, ephemera of all sorts, and even Son of Hollywood Blockbuster XIV. The only real justification for failing to do so is the pragmatic considerations of shelf space, cataloging resources, etc. So Mark Fiore’s animated cartoons really should be collected, and I really hope that some enterprising special collections librarian is working on subject-specific archives of Yahoo! Groups, listserves, UseNet, and the like. The Internet Archive can’t do be the only archive of the Internet.
2) I must come across as much more definitive than I think I do. I do know that libraries aren’t easily defined, and I certainly know that a lot of librarians would “But—!” in outrage at my proposed Essence of Library. What the heck, it’s just a model. <grin;>
3) Eric completely got me on one point: a huge, huge aspect of libraries is the “place to hang out”; that’s an essential function that I didn’t discuss at all. So, I’ll modify my (proposed) Essence of Library: a) information warehousing; b) information accessing; and c) social space. Right now, GooglePrint and libraries look pretty different on all 3 categories. But the information landscape is shifting pretty rapidly, between rapid digitization of old media, and rapid generation of new media formats. It seems to me that technology shifts are exacerbating the differences between a) (information warehousing) and b) (information accessing). I still want libraries that do special collections & archives — information warehousing. But in the future, I also want libraries to provide the best possible information access. It seems to me likely that there will be a fair amount of convergent evolution between Google and libraries in their information accessing mode. If so, then we’d better know what is the essence of libraries, as Michael Madison suggested, because there is a significant amount of library exceptionalism within current copyright law. So what is the essence of libraries? Is it the combination of the 3 functions? If (as I bet) functions a and b are diverging, then is it the combination of one of them with c) that makes a library? If Google Print is not a library, then what is the Internet Archive? I’m not saying I know the answers; I’m just saying these questions trouble me. I love libraries, but maybe I’m driving myself to not loving library exceptions in copyright; maybe I just want more reasonable copyright, across the board, that inherently allows things like the libraries of today and the libraries and GooglePrints of tomorrow. Since that seems unlikely, though, maybe library exceptionalism the best we can hope for. And if so, then I come back once again to wondering whether it’s not more fruitful for the library community to define itself more inclusively than exclusively, given the porousness of copyright law and the rapidly shifting media landscape.
Ugh, too long, and I’m too tired; I may be shedding more confusion than light at this point. Probably time to ponder rather than write.
August 18th, 2005 at 12:41 pm
[...] Focusing on libraries- what are some commonalities for these types of libraries? -A formally organized institution that provides information-related services to a defined community -These services are generally provided by a professional and/or paraprofessional staff. ALA differentiates between a library and other information centers based on the presence of a librarian- I’m not sure if they still do, but it would make sense to be that they do. -Community space (noted on derivative works – I think we can consider this type of space a service, maybe?) -More? [...]
August 19th, 2005 at 10:51 pm
[...] Related posts: interesting reading early saturday morning 8/13 … essence of library, 8/17 [...]
August 19th, 2005 at 4:32 pm
Discussed with intelligence and insight at the Derivate Work blog. What’s A Library, and What Is Google Going to do to It? – Likelihood of Confusion by Ron Coleman, Aug. 19, 2005
August 22nd, 2005 at 8:42 pm
[...] lquilter.net follows the flow of the google / library discussion and asks what’s the essence of library? [...]
February 6th, 2008 at 5:05 am
[...] Discussed with intelligence and insight at the Derivate Work blog. [...]