random quotes ... to amuse, inspire, enrage:
  The fact that a believer is happier than a skeptic is no more to the point than the fact that a drunken man is happier than a sober one.

tagged: religion
  —George Bernard Shaw

google print: google’s evilness is beside the point (Bonus Rant Included)

Wednesday, October 26th, 2005 7:30 pm

I’m pleased to see the Google Print issue spurring discussion of the role of corporations in controlling access to information. See, e.g., today’s post @ Gnuosphere [link from sivacracy]

Gnuosphere, Siva, and others point out that Google isn’t doing Google Print out of the goodness of its heart; the company is scanning, indexing, and providing access to information for lots and lots of money. These warnings are a helpful antidote to Google worship.

But the problem with this complaint can be seen in the gnuosphere post:

Personally, I’m not against having an institution be granted the right to create such a database. But I’m wary about handing over such privilege and control to a body that is not working for the people. Should a corporation control what could potentially become the world’s first digital library? What is the purpose of a library? Why do libraries exist? For who do libraries exist? If this project is to become a globally accessible library, should there be someone controlling your right to read?

As the database of books increases in size and therefore scientific and cultural value, is an unregulated for-profit corporation the best choice to manage and control that database?

I think not.

The impulses guiding this post are clearly pro-public access and use, and pro-library, and I wholeheartedly support that. But the issue is couched as “handing over … control” of this information to a corporation. “[G]rant[ing] the right to create such a database.” Making a “choice” of entities to “manage and control” that database.

No, no, no.

The point of people’s support for Google Print is not that we support Google, love Google, or want Google to control our access to information. The point is that Google, and any other entity who wants to do it, should be able to add value to information. Google should not be THE ONE; Google should be ONE OF MANY. Picking and choosing a single entity presupposes that the information is already controlled, and this new use, this new added value, is to be carefully metered as a scarce resource.

We should be concerned about Google Print’s contractual restrictions on holders of its scanned works. But we should not fear Google simply for being the first entrant into the market. Google turns out to be evil? Implementing DRM, gathering and exploiting private personal data, indexing our DNA, imposing restrictive licensing agreements on its source material holders? Fine, criticize the evil practices (and Google too). Some other entity turns out to be evil, and wants to restrict copyright such that only Google’s database is valid? Criticize them, too. But I want to recommend that we resist the conflation of evils. If we’re concerned that Google is going to control a big huge really valuable database, and possibly to the detriment of those who want to use the database, then the answer is, in First Amendment terms, more speech. More databases, more indexers, more more more.

Bonus: Free Rant!

And by the way, you publishers, authors, and copyright-holders. You want to cash in on this market? Why don’t you consider selling the electronic texts to the aggregators and indexers for cheaper than they can scan them in and with reasonable licensing terms? There’s your market, right there. In fact, technology has made that market available to you for MORE THAN THIRTY YEARS. Dialog, Lexis, WestLaw, and other database vendors could have been using the full text of books for a really long time. Libraries would have killed to have full-text access to books.

As it happens, ignoring obvious markets is not new to the publishing industry. Book publishers ignored the market for enriched information content for years before they began ignoring the market for searchable full-text. Libraries and indexers could have used, at any time in the 20th century, a flourishing market for bibliographic and enriched descriptive information about books. Instead, with no such market, librarians CREATED, from scratch, and at very great expense, indexes and catalogs of information about books — with virtually no assistance from publishers. All those major research databases like MedLine, Agricola, and the like? Laboriously created by individual librarians, basically indexing and cataloging research journals by hand. Compare book publishers to research journal publishers. After some time research journal publishers figured out there was a market in enhanced information content, and began figuring out how to take advantage of that market. They facilitated the indexing process by including keywords and abstracts. They began selling tables of contents and journal indexes to the literature indexes. Ultimately they began selling full-text to databases and aggregators. In fact, once they figured it out, research publishers have been incredibly successful at capturing monetary value from information that the authors mostly want to give away for free. (So successful, in fact, that academic authors are having to fight their own publishers to get that valuable research information out of the market — another very interesting topic for another time.)

Could book publishers have done something similar? Sure. But for decades, literally, book publishers ignored this opportunity. As with research journals, individual librarians created the catalogs and indexes of books, hand-examining each book, figuring out what the book is about and how to describe it, etc. Libraries organized consortia and union catalogs to share this information and reduce the expense of creating it. For most of the time that cataloging took place, book publishers weren’t much help. Only in the very last few years have book publishers even begun to scratch the surface of providing enriched content to libraries and information vendors, by providing tables of contents to library systems vendors, and dipping their toes into very limited full-text databases that are scarcely available to anyone.

So the book publishing industry quaked in its boots and sat on its ass and ignored the market for searchable full-text, focusing solely on the market for information packaged as a physical artifact. And now the industry wants to complain that Google is jumping into the market? And doing it, not by licensing the full texts from the publishers, but in the most expedient fashion possible, by scanning? Please. Cry me a river, and while you’re at it, shed a few tears for the recording industry’s failure to jump online in the mid-90s.

algorithmically similar posts:

» Google Print: whither goest this debate …, 2005-10-31 (score:36)
» notes on open scientific publishing, 2004-10-01 (score:30)
» lost licensing revenue & Google Print, 2005-10-27 (score:28)
» authors vs. copyright owners, 2005-10-23 (score:28)

8 Responses to “google print: google’s evilness is beside the point (Bonus Rant Included)”

  1. derivative work » Blog Archive » lost licensing revenue & Google Print Says:

    [...] Once again, if Google Print goes forward, that doesn’t mean that Google Print will be the only big database, and it doesn’t mean that Google is now the arbiter of, well, anything other than its own sweat-of-the-brow compilation of data (the words used in books and the order in which they are used).1 [...]

  2. A Copyfighter's Musings Says:

    Google Print And Libraries; Google Print Or Libraries

    I agree with Siva – libraries are great and important.

    [...]

  3. Karl-Friedrich Lenz Says:

    While you are right that Google is one of many, that does not help the Google cheerleaders.

    Obviously, the burden to opt out multiplies with the number of organizations that have to addressed by opt-out processes. Which only serves to weaken Googles’ opt-out excuse.

    One other point is that the Yahoo project or older search databases like Lexis (object of the American Supreme Court Case of New York Times Co. v. Tasini) clearly show that you can build useful search databases while clearing the necessary rights. Google is not the only one, but they certainly seem very aggressive compared to most other projects in ignoring everyone else’s copyrights.

    I happen to dislike Google intensily because of their failure to address data protection issues adequately. However, my opinion about their copyright violations would not change one bit if they were a respectable company instead of Spam and Spyware Central.

  4. Siva Vaidhyanathan Says:

    Laura Quilter on the Conflation of Evils in the Google Library Debate

    This is important:

    [...]

    On Laura’s first point: Google will use its patents to ensure there is no real competition. We are talking about the next Microsoft here. Why would Google allow competition? I am afraid “more speech” is wishful thinking here.

    On Laura’s second point: Publishers have been doing exactly what she prescribes here. They have been doing it for almost a decade. [...]

  5. Copyfight Says:

    What She Said

    Laura Quilter: “The point of people’s support for Google Print is not that we support Google, love Google, or want Google to control our access to information. The point is that Google, and any other entity who wants to do it, should be able to add value to information. Google should not be THE ONE; Google should be ONE OF MANY.”

  6. gnuosphere Says:

    Gauging Google’s Grip

    Laura Quilter and Siva have brought some clarity to my thinking with posts concerning the Google Print inititative. Laura rightly demonstrates that my last post fails to clearly explain what I mean by “handing over … control” to the for-profit Google corporation. Laura states –

    [...]

    At this point, we’re not sure if the information is “already controlled” – I suppose that’s what the lawsuit will determine. However, this “new added value” is, as Siva points out, under latent patent control. If Google has their way, the other “of many” services will be forced to the sidelines due to mediocre user interfaces/database management crippled by patent law. Of what use will the freedom to exercise “fair use” in the creation of an electronic database be if the deliverance of that database is controlled by software idea patents?

    [...]

  7. derivative work : Google Print: whither goest this debate … Says:

    [...] Derek Slater made the point that supporting Google Print does not mean not supporting libraries, and he did it very well. If I had seen his post, I could have saved myself the trouble of writing mine. The video store / library example is quite right (although, Derek, video stores are not necessarily superior to library rentals; library rentals are in most instances free. And that must be good, since the music industry has told us nothing can compete with free….) [...]

  8. derivative work » Blog Archive » Google Print: whither goest this debate … Says:

    [...] Lenz makes the point that Yahoo!’s opt-in only venture, and older search databases that licensed content, weaken [...]

Leave a Reply

You must be logged in to post a comment.