Version: 2008
  • On TV.com: New TV sex symbol: Vintage black PORSCHE

April 11, 2007 7:31 AM PDT

Google backs character-recognition research

  • 2 comments
Google is sponsoring an artificial-intelligence research group's work to develop advanced technologies for character recognition.

The open-source project, called Ocropus, has several goals, including developing a high-level, easy-to-use handwriting recognition system that can convert handwritten documents to computer text, assisting in the creation of electronic libraries, analyzing historical documents and helping vision-impaired people access information. The "ocr" in Ocropus stands for optimal character recognition.

The project is headquartered at the Image Understanding and Pattern Recognition (IUPR) research group at the German Research Center for Artificial Intelligence (DFKI) in Kaiserslautern, Germany. DFKI Professor Thomas Breuel is leading the project.

Breuel made the announcement on Monday through a post on the Google Code blog. In addition to Google's sponsorship, Ocropus is getting funds from several German government agencies and other public and private entities.

The Ocropus team expects the project to last three years, and it will support three Ph.D. students or postdoctoral students. IUPR is basing the software primarily on two research projects: one, a handwriting recognition system developed in the mid-1990s for use by the U.S. Census Bureau; and two, newer layout analysis methods for character recognition.

Other resources include Tesseract, a decades-old engine for optimal character recognition originally developed by Hewlett-Packard Labs and re-released by Google last year as an open-source system.

A preview of the Ocropus system is available on the project's Web site under an Apache license, and the IUPR is soliciting open-source contributions in order to complete a number of goals. These include creating a desktop application for the system, adding third-party tools and adapting Ocropus to a variety of languages. It's currently English-only.

See more CNET content tagged:
handwriting recognition, open source, project, Google Inc., Germany

Add a Comment (Log in or register)
Optimal or Optical?
by mrjam32 April 11, 2007 9:03 AM PDT
I have never heard of "Optimal Character Recognition". Maybe that's a new term for this project, but I believe all the old ones are "Optical..."
Reply to this comment
Should be Optical
by _Seffer_ April 11, 2007 11:11 AM PDT
See <http://en.wikipedia.org/wiki/Optical_character_recognition>.
Reply to this comment

Latest tech news headlines

RSS Feeds

Add headlines from CNET News to your homepage or feedreader.

More feeds available in our RSS feed index.

Markets

Market news, charts, SEC filings, and more

Related quotes

Google (2.30%) 13.10 583.06
Dow Jones Industrials (1.25%) 129.38 10,447.54
S&P 500 (1.34%) 14.60 1,105.98
NASDAQ (1.34%) 28.66 2,174.70
CNET TECH (1.57%) 24.72 1,601.97
  Symbol Lookup
advertisement

Inside CNET News

Scroll Left Scroll Right