Webware

Read all 'Book Search' posts in Webware
September 16, 2009 7:30 PM PDT

Report: Google considering book settlement tweaks

by Tom Krazit
  • 15 comments

Google is in talks with the Department of Justice and the plaintiffs in its book search settlement over possible modifications to the deal that could assuage the concerns of the Justice Department, according to a report.

Bloomberg reported late Wednesday that Google, a group of plaintiffs that sued it in 2005, and the Justice Department are in talks over the settlement, which gives Google the legal standing to digitize out-of-print books that are still protected by copyright law. Google was sued by numerous groups including the Authors Guild and several publishers after it began scanning that type of book, and came away with the right to present those books in Google Book Search following an October 2008 settlement.

But that settlement has provoked strident dissent from several authors, libraries, and privacy advocates concerned about Google's unique license to those out-of-print books. Google has promised to act responsibility, and notes that any other company can negotiate scanning deals with the Books Rights Registry set up as part of the settlement, but the deal as structured is apparently not enough to satisfy the Justice Department, according to the report, which doesn't specify which provisions of the agreement are subject to change.

A Google representative declined to comment on any possible negotiations with the plaintiffs and the Justice Department but said of the settlement "if approved by the court, this settlement stands to unlock access to millions of books in the U.S. while giving authors and publishers new ways to distribute their work."

One possibility is that the Justice Department is concerned about the Books Rights Registry, which under the settlement has the power to negotiate deals with other companies, distribute payments to books rights holders, and represent the interest of authors and publishers. The registry has been criticized as a "cartel" by Google's detractors, who believe that although it is a nonprofit run separately from Google it will essentially function as a gatekeeper for other companies that wish to scan and distribute books.

Google CEO Eric Schmidt expressed frustration with the critics of the settlement earlier today in an interview with Search Engine Land's Danny Sullivan, complaining that those criticizing the settlement aren't putting forward their own solutions to create what all parties to the settlement agree would be a tremendous asset: a comprehensive digital library.

"I'm open to a better solution," Schmidt told Search Engine Land. "You will recall, we had our solution, and we were sued over it. And we then had a-god-knows-how-many years of negotiations with 27 parties, and we've actually produced a deal," he said.

Google was ordered Wednesday by Judge Denny Chin, who is overseeing the settlement, to respond by October 2 to the wave of comments he has received regarding the settlement. Supporters and objectors had until last week to file comments with the judge, who will consider them before holding a final hearing on October 7 to decide whether to approve the settlement.

Originally posted at Relevant Results
August 29, 2009 2:19 PM PDT

More questions than answers on Google Books

by Tom Krazit
  • 44 comments

BERKELEY, Calif.--Google's Dan Clancy had patiently answered question after question regarding Google's' Book Search settlement with publishers and authors until late in the afternoon Friday, when he was finally left speechless.

Louis Trager, a reporter from Washington Internet Daily, asked Clancy what kind of message was sent when Google decided to "copy first and answer questions later." The question--for which there's no safe answer, if you're in Clancy's shoes--perhaps underscored the core of the opposition to the settlement, reached in October, after Google was sued in 2005 for scanning out-of-print works without explicit permission.

Google's Dan Clancy is charged with defending Google's position before opponents of its book search settlement.

(Credit: Tom Krazit/CNET)

If the class action settlement is approved, Google stands to gain control of a priceless asset. Jason Schultz, acting director of UC Berkeley's Samuelson Law, Technology, and Public Policy Clinic, called it "the largest copyright-licensing deal in U.S. history:" the right to display the contents of out-of-print books that are still covered by copyright protection.

Google, however, has already scanned more than 10 million books. At the moment, it's not allowed to display more than a few snippets of copyright-protected books for which it doesn't have an explicit agreement with the rights holders. If the settlement is approved, Google will suddenly flip a switch and offer full-text searches of those books, as well as links to bookstores.

Nothing vexes Google's opponents more than the fact that the company assumed that it had the right to digitize nearly 100 years of written material without serious negotiations with those rights holders until it was sued. Authors have until Friday to decide if they want to opt out of the settlement and preserve the right to sue Google on their own for digitizing their book without their permission, though they can tell Google to remove their books from the Book Search archive, even if they remain in the class.

Everyone agrees that a searchable digital library of out-of-print books would be a very valuable asset for the world. As any owner of an e-book reader such as Amazon.com's Kindle will tell you, the way we think about books is changing.

Think about it: libraries offer tons of out-of-print books, so it's not like the collective knowledge of those books is inaccessible. Yet that knowledge exists in millions of hard-bound individual silos.

What if we could make all that knowledge instantly accessible from anywhere in the world? And more importantly, what if researchers have the ability to analyze it?

Amazing gains could be made in fields like linguistics. In dismissing arguments that scale makes a search engine better, Google's Hal Varian told me last month that one area that does seem to increasingly benefit from scale is translation: the more copies of bilingual books that Google has access to, the more it can perfect its translation algorithm.

"The value of the book as data is greater than value of the book itself," said Peter Brantley, director of the Internet Archive and perhaps the most vocal critic of the settlement. And who will control access to a valuable group of books? A for-profit corporation, which, by the way, paid just $125 million for the license to that information. It paid $1.65 billion for YouTube.

Google likes to say that anyone can cut deals with the Book Rights Registry, the nonprofit organization set up after the settlement to handle payments to right holders, to get similar access to out-of-print yet in-copyright books. The thing is, the number of organizations that can afford to duplicate Google's efforts is limited.

Clancy declined to say how much Google has spent on scanning books, but the Internet Archive spends about $30 for each book scanned. If Google's costs are similar, that's $300 million and counting; there are about 23 million books in the WorldCat database. Microsoft folded its book-scanning project, once it realized that Google was aggressively going after that market, said Tom Leonard, the head librarian at UC Berkeley, which had been part of a book-scanning partnership with Microsoft.

This is what frustrates Google, to a certain extent: everyone agrees that digital access to books is important, yet no one else is willing or capable of doing it. And Google insists that it will be a fair steward of the material: the European Commission has backed Google's efforts, and several university libraries, such as that of the University of Michigan, are also fully on board.

But taking Google at its word requires trust, and trust in corporations is in short supply at this point in American history. It's taken perhaps longer than it should have, but Google is gradually realizing that a fair portion of the public no longer sees it as a cute little Silicon Valley start-up with idealistic stars in its eyes, one that insists "you can make money without doing evil."

Google damaged that trust when it began scanning books without permission, arguing that it was allowed to do so under fair-use laws. Publishers and author groups also harmed that trust when they turned over the key to the castle by bringing the lawsuit as a class action, suddenly making plaintiffs out of millions of authors who did not necessarily appreciate the future value of digital books in 2005, nor authorize the negotiation of the rights to their works.

By the time the reporter caught Clancy off guard, he was understandably drained from a long day spent under hot lights fielding questions, and at least one diatribe, from passionate academics and activists.

The thing is, it's a fair question: Google has the financial resources and collective intelligence to do nearly anything it wants in the world. Where will Google turn its information vacuum next? Will it ask permission first?

Corrected August 30, 10 p.m. with the correct identification of the reporter who posed the question to Clancy.

Originally posted at Relevant Results
August 19, 2009 10:54 AM PDT

Challenge to Google Books settlement focuses on class actions

by Tom Krazit
  • 5 comments

A new objection to Google's Book Search settlement with books rights holders plans to argue that the parties to the settlement are "trying to ram this through so that millions of copyright holders will have no idea that this is happening."

Google's plan to digitize books on a grand scale has its fair share of detractors, and Scott Gant will soon join them in opposition to the settlement according to The New York Times. Gant, a lawyer with Boies Schiller & Flexner, says he's acting on his own as an author concerned about the use of class action status to lump all authors into the same pool.

Several groups acting on behalf of publishers and authors sued Google in 2005 over its plan to digitize books, and the suit was granted class action status. That meant that when Google and the publishing groups settled the lawsuit in 2008, publishers and authors that held the rights to books that were out of print but still protected by copyright law had to opt out of the settlement if they didn't want to participate in the project. They have until next month to do so.

Gant will file a brief with the court arguing that this is "an abuse of the class action process," according to the Times, which also noted that some legal scholars believe this is a novel challenge to the settlement. Many of the objections have focused on the fact that with the settlement, there's only one institution in the United States that has the legal authority to scan out-of-print books still protected by copyright: Google.

The class action status meant that Google did not have to individually negotiate with rights holders for "orphan works," a vast undertaking that nonetheless should have been conducted if Google and the parties to the settlement were truly interested in preserving the rights of authors, Gant argues. As part of the settlement, Google has had to show that it is taking as many steps as possible to ensure all possible rights holders have been informed about the settlement and their options under the agreement, but Gant's brief plans to say that licensing agreements should not be established via class action lawsuit.

Google told the Times that regardless of how the deal was reached, rights holders still have control over their destiny, and can opt out of the settlement should they wish to prohibit Google from scanning their work. A final hearing in the case is scheduled for October.

Originally posted at Relevant Results
August 18, 2009 5:20 PM PDT

Report: Google near Book Search deal in France

by Tom Krazit
  • 3 comments

Google is about to sign a deal with the French national library to grant patrons access to Google's archive of scanned books, according to a report.

France's La Tribune reported Tuesday (spotted by The Times Online) that the Bibliotheque Nationale de France has all but given up on its own attempt to create a digital library in partnership with other European countries, paving the way for Google Book Search to get a foothold on the continent. Google declined to comment on whether a deal had been reached, providing this statement: "Like we've always said, we'd be delighted to work with a prestigious institution such as the BNF and are currently in talks with them, but have nothing to announce at this time."

At one point, the BNF had hoped to create a counterweight to Google's digital book ambitions, fearing that English works would dominate those created by non-English speakers and "place interpretation of French and other continental European literature, history, philosophy and even politics in American hands," according to a 2005 New York Times article. But Quarero, the result of that initiative, has not really gotten off the ground and on a limited budget, couldn't really hope to compete with the resources Google has devoted to scanning books.

So, the BNF is taking a "if you can't beat them, join them" strategy, according to the reports. A BNF representative told The Times Online that it won't stop its own digitization efforts but plans to add Google's archives to its own.

Back in the U.S., the waiting period for Google's settlement with book publishers that will enable the project to move forward is almost done, with a September deadline for authors to opt out of the settlement looming. A final hearing is scheduled for October.

Originally posted at Relevant Results
June 18, 2009 11:49 AM PDT

Google Book Search gets a face-lift

by Tom Krazit
  • Post a comment

Google Book Search now calls out the placement of search terms more clearly in books and magazines.

(Credit: Screenshot by Tom Krazit/CNET)

All the talk about Google's Book Search lately has focused more on the law than the page, but Google continues to improve the product at the heart of those discussions.

Google rolled out several improvements to its Book Search product Thursday. Searchers can now immediately see where their search term appears in a given book, see thumbnails of all the pages in a book or magazine, and embed links to books in their blogs, among other things.

The improvements, detailed Thursday in a blog post, apply only to public-domain and those Google has negotiated the rights to publish. The hubbub over the Google Book Search settlement is that it grants Google the right to post copies of out-of-print books that are still under copyright.

June 15, 2009 4:00 AM PDT

Google's digital-book future hangs in the balance

by Stephen Shankland
  • 31 comments

Google, the company best equipped and most motivated to digitize the world's books, wants to offer the world an online Library of Alexandria. The decisions of the Justice Department, authors, book publishers, a federal judge, and Google itself likely will determine whether the company actually does.

Nobody in recent years has accused Google of lacking ambition, but its Google Book Search project is certainly among the company's top projects when it comes to chutzpah. That's not just because of the technical and financial hurdles of scanning, indexing, and displaying online millions of books, it's also because of the tangled intellectual property and legal concerns involved in the controversial project.

After revealing the book-search project in 2003, Google drew copyright infringement lawsuits from the Authors Guild and the Association of American Publishers in 2005, but an October 2008 proposed settlement, now under review by Judge Denny Chin of the U.S. District Court for the Southern District of New York, has converted those groups from adversaries to allies.

The settlement, if approved, could neatly cut a Gordian knot of copyright entanglements though setting Google back $125 million. That's because it would enable Google not only to display books that are out of copyright and those that are in print by cooperating publishers, as it does today, but also those from the vast collection of in-copyright brooks that are out of print--even when those holding rights to those books didn't specifically agree to Google's plan.

The complicated proposed settlement invoked the wrath of some authors concerned it would grant Google monopolistic power over online publishing, and the court extended the deadline for authors to choose whether to opt out of the settlement from May to September. Then the other shoe dropped this month: the Justice Department signaled serious antitrust scrutiny by issuing subpoena-like civil investigative demands, or CIDs, to check into the matter.

AIG and General Motors apparently are too big to fail. But the way the opposition to Google Book Search is shaping up, it looks like some believe Google is too big to succeed.

... Read more
Originally posted at Digital Media
May 4, 2009 5:26 PM PDT

Library groups gripe about Google Book Search

by Stephen Shankland
  • 8 comments

Three groups representing hundreds of libraries lodged a long series of concerns about a proposed settlement of lawsuits over Google Book Search on Monday--but refrained from objecting overall.

Specifically, the American Library Association, the Association of College and Research Libraries, and the Association of Research Libraries expressed some affinity for Google's mission of sharing books with the public, but raised concerns in a legal filing that the settlement would concentrate power in Google's hands and poses pricing and privacy concerns.

Google has patented technology for scanning books.

Google has patented technology for scanning books.

(Credit: U.S. PTO)

Google is scanning millions of books, presenting their contents online at the Google Book Search site and blending some information into its regular search results, but the move triggered lawsuits by the Association of American Publishers and the Authors Guild. A proposed settlement of the class-action suit would permit Google to share some contents of books online--not just those that are in the public domain and those still being sold with which Google has agreements with publishers, but also those that are still in copyright but out of print.

The settlement terms are of high interest to the library world. Under the proposed settlement, Google would offer a free computer terminal in U.S. libraries that would permit people to read books online and a subscription service that would permit paying customers, such as larger libraries, to offer more computer terminals. In cases where readers or institutions pay for access to online books, funds would be divided between Google and a proposed independent non-profit group, the Book Rights Registry, set up to handle payments, find authors and other rights holders, and let them include or exclude their works from the project.

"The library associations do not oppose approval of the settlement. The settlement has the potential to provide unprecedented public access to a digital library containing millions of books," the groups said in their filing. "However, the digital library enabled by the settlement will be under the control of Google and the Book Rights Registry. Moreover, the cost of creating such a library and Google's significant lead-time advantage suggest that no other entity will create a competing digital library for the foreseeable future."

That power could put libraries at a disadvantage, they argue.

"We are concerned that the cost of an institutional subscription may skyrocket, as academic journal subscriptions have over the past two decades," said Erika Linke, president of Association of College and Research Libraries, in a statement.

Added Jim Rettig, president of the American Library Association, the proposed settlement "offers no assurances that the privacy of what the public accessed will be protected, which is in stark contrast to the long-standing patron privacy rights libraries champion on behalf of the public."

In its response, Google looked on the sunny side but didn't address pricing or privacy details: "We're pleased that the thousands of librarians and libraries represented by the ALA recognize the promise of our groundbreaking agreement, and we appreciate that the Library Associations do not oppose approval of the settlement. Google is proud to partner with dozens of libraries around the world as part of our Book Search efforts, and we have consistently maintained that, if approved by the court, our settlement agreement stands to unlock access to millions of books for users in the U.S."

But the library groups struck a more sober tone in their filing. Take, for example, this section on the power Google would get by virtue of obtaining online publishing rights through the class action settlement:

The settlement of copyright class action litigation might well have been the only feasible way this research tool could have been created. A class action settlement provided perhaps the most efficient mechanism for cutting the Gordian knot of the huge transactions costs of clearing the rights of millions of works whose ownership often is obscure. However, the class representatives and Google structured the settlement in such a manner as to give them enormous control over this essential facility. This is not surprising, given their economic interests. Indeed, precisely because of their economic interests, it is unlikely that they would have agreed to a structure that did not grant them such control.

To be sure, nothing in the Settlement prevents another entity from undertaking a mass digitization effort similar to Google's. But given the enormous cost of such an effort, and Google's significant lead time advantage (Google has been digitizing in-copyright books since 2004), no other entity is likely to so in the near future. Hence, there is no foreseeable threat to the control Google and the Registry have over this essential research facility.

Originally posted at Digital Media
May 4, 2009 12:31 PM PDT

Patent reveals Google's book-scanning advantage

by Stephen Shankland
  • 14 comments

Sometimes overlooked in the Sturm und Drang about Google Book Search is any consideration of the mechanics of economically scanning the books in the first place, but a patent awarded to Google gives insight into how the search behemoth accomplishes the task.

In short, Google has come up with a system that uses two cameras and infrared light to automatically correct for the curvature of pages in a book. By constructing a 3D model of each page and then "de-warping" it afterward, Google can present flat-looking pages online without having to slice books up or mash them onto a flatbed scanner.

This diagram shows patented Google technology for correcting for curved pages while scanning books.

This diagram shows patented Google technology for correcting for curved pages while scanning books.

(Credit: Google)
... Read more
Originally posted at Cutting Edge
April 27, 2009 3:01 PM PDT

Google seeks more time in Book Search case

by Stephen Shankland
  • 4 comments

This post was updated at 5:53 p.m. PDT with details about an extension request of the May 5 deadline filed by seven authors.

Google said Monday it's seeking 60 more days to find authors and persuade them of what it believes are the merits of a settlement involving its online Book Search service.

The proposed settlement of the 2005 case filed by the Authors Guild and the American Association of Publishers involves Google's right to show information from books online--in particular "orphan" works that are still covered in copyright but that are in limbo, for example being out of print or written by authors who can't be located. Currently, authors must respond by a May 5 deadline to opt out, otherwise authors will be included in the settlement.

"The settlement is highly detailed, and we want to make sure rightsholders everywhere have enough time to think about it and make sure it's right for them. That's why we've asked the court for permission to extend the opt-out deadline for an extra 60 days," said Alexander Macgillivray, Google's associate general counsel for products and intellectual property, in a blog post Monday.

"It's pretty easy for credit card companies to contact their cardholders--they send bills to them all the time. The world's authors, publishers and their heirs are much more difficult to find," Macgillivray said. "So, as the New York Times recently reported, the plaintiffs hired notice campaign specialists Kinsella Media Group to tell them about this exciting settlement, and Google has devoted millions of dollars to fund this notice campaign. Kinsella started by launching a website for authors and publishers and a direct-mail effort. Beginning in January, Kinsella published ads in newspapers and other publications all over the world from Fiji to the Cook Islands to Greenland. And of course, they also placed ads right here at home in the U.S., in publications as diverse as Writer's Digest and USA Today."

Google is facing resistance to the settlement.

Seven authors last week requested a four-month extension (PDF) of the May 5 deadline due to the complexity of the proposed settlement, among other reasons.

"First, two months' time is insufficient to understand the implications of a settlement of this scope," the appeal letter reads. "Second, substantial defects in notice of the settlement undermine authors' ability to assess their rights; and third, more time is required simply to understand the complex terms of the agreement."

Originally posted at Digital Media
February 5, 2009 6:40 PM PST

Google going mobile with Book Search

by Steven Musil
  • 2 comments

Google is opening a new chapter in its book digitization saga, this time taking on the likes of Amazon.com's Kindle and Sony's eReader.

The search giant on Thursday launched a mobile version of its Google Book Search, giving iPhone and Android users instant access to more than 1.5 million public domain books. The works of authors such as William Shakespeare, Jane Austen, and Charles Dickens were optimized to be read on the small screen, a challenge the Google Book Search team called "daunting" in a blog post announcing the launch:

There's an interesting backstory about the work involved to prepare so many books for mobile devices. If you use Google Book Search, you'll notice that our previews are composed of page images made by digitizing physical copies of books. These page images work well when viewed from a computer, but prove unwieldy when viewed on a phone's small screen.

Our solution to make these books accessible is to extract the text from the page images so it can flow on your mobile browser just like any other web page. This extraction process is known as Optical Character Recognition (or OCR for short).

However, as the team notes, there are frequently obstacles that keep the printed word from being accurately extracted, such as smudges, fancy fonts, old fonts, and torn pages. As an example of an "extreme case," the team presented the this page image from Lewis Carroll's Alice's Adventures Under Ground:

(Credit: Google.com)

...and the resulting extraction:

=> "lV~e.il!" .ÍAoHyU- AUte. U brstty/affc. su.it a. f o.tl as ~tk¿* , I s&O.IL .éfiiíjz tiotkun-) of-ttmlr1¿*y ¿i^n. sta¿rs ! Jfo» ura.ve ...

The e-book reader market has exploded in the past year, with analysts estimating Amazon sold 500,000 Kindles in 2008. Last month, Amazon CEO Jeff Bezos credited the "unusually strong demand" for the unit in helping the e-tailer beat Wall Street's fourth-quarter revenue and earnings expectations.

Google, which last year settled a 3-year-old lawsuit over its scanning project, may have timed this launch to upstage Amazon, which is rumored to be releasing its next-generation Kindle later this month.

Originally posted at Digital Media
advertisement
Click Here

About Webware

Say No to boxed software! The future of applications is online delivery and access. Software is passé. Webware is the new way to get things done.

Add this feed to your online news reader

Webware topics

15 sites that went kaput in 2009

Web sites launch all the time, but they also shut their doors. We highlight 15 that bit the dust this year.

Top 10 news stories of the decade

Let the debate begin: Was the iPhone more important than iTunes? Was anything bigger than Google finding a great business model? CNET offers its list of the 10 most important stories of the '00s.

Most Discussed

Inside CNET News

Scroll Left Scroll Right