A super scanner and a major university have agreed to work on Microsoft's book digitization project.
Kirtas Technologies, a maker of high-speed scanners and digitization software, signed a deal Tuesday with Microsoft to scan works for its Windows Live Book Search project.
The Cornell University Library also signed on Tuesday with Microsoft as a partner, agreeing to let its collection be scanned. The project, when complete, will make public domain works, as well as copyrighted material from publishers who opt-in, freely available through Microsoft's online Web application.
Kirtas' robotic book scanners, according to the company, can scan 2,400 pages per hour and offer "book handling that is gentler than the human hand."
The works scanned by Kirtas will become available via Windows Live Book Search starting in early 2007. Cornell librarians will have a hand in choosing which versions of books to scan and overseeing quality control of the digitization process, according to Cornell.
The program is a direct competitor to Google Book Search, which already has many works available online in full text, and has enlisted libraries including the New York Public Library and Oxford University in its endeavor.
Google, however, has taken the opposite approach to Microsoft, requiring publishers to opt-out if they do not want their copyrighted works to be scanned. The method has resulted in several lawsuits in different countries.
Google has argued that Google Book Search does not allow full access to copyrighted works, as it does with public domain works--many of which are available as free PDF books that can be read or printed in their entirety.
Amazon.com and Yahoo also have book digitizing projects under way. They were
subpoenaed by Google, along with Microsoft, in September as part of Google's defense in one of its lawsuits.
No matter what OS we're talking about, there will certainly come a day when MSFT proprietary formats will be dead and gone (even between MS Office versions, this is already true...)
I just hope Cornell uses a universal, OPEN format that future readers of any type can successfully convert to human-readable text.
My guess as a recent Cornell graduate and my experience with the Cornell Libraries, the files will probably be PDFs.
The Cornell libraries scan a lot of content from books and journals and post them to a secured site for students to access for classes. Everything they post is in the PDF format.
NY professor believes that a word-based algorithm can help bring together those who believe, with one glimpse, that they have found and lost the love of their lives.
The Silicon Valley online payments startup grew by 1,000 percent last year and is hopeful it can repeat that level of growth this year. To do that, it's had to move away from its early friends-and-family roots and embrace small businesses.
Chamtech's spray-on antenna uses a nano material to provide a low-power boost to antenna range. The wireless-in-a-can product may some day bring an end to unsightly cell towers.
EnerG2 opens a plant to make an engineered carbon that will improve performance of energy storage devices and make storage for start-stop hybrid cars less expensive.
I just hope Cornell uses a universal, OPEN format that future readers of any type can successfully convert to human-readable text.
/P
The Cornell libraries scan a lot of content from books and journals and post them to a secured site for students to access for classes. Everything they post is in the PDF format.
I do hope though they are using PDF, it's the best IMHO.