• On CBS MoneyWatch: The perfect car for a teenager
September 18, 2008 5:29 PM PDT

Two new semantic engines: Cognition and Eeggi

by Rafe Needleman
  • Font size
  • Print
  • 4 comments

Two companies recently pitched me on their semantic engines. These are not search engines, which is what most people think. Rather, they are databases and algorithms that hold the structure of language (in both cases, the English language). At the most basic level semantic engines tell you what's synonymous with what. At the advanced end of the spectrum they know how grammatically similar phrases like "take a seat," "take a stand," and "take a lollipop," mean completely different things.

These engines can be used by search products to greatly improve results. Powerset, now a part of Microsoft, made a big deal of its semantic chops by showing how vaguely worded search queries would return just the results you wanted. Now, it seems, that raw semantic technology is about to become mainstream.

Cognition recently announced its "world's largest semantic map of the English language," sporting more than "10 million semantic connections." The company is rolling the technology into products like CognitionSearch for the Enterprise, which is a knowledge mining tool, as well as an "eDiscovery" product for the legal industry that enables lawyers to "quickly and efficiently find incriminating, smoking gun documents." The company is also applying its technology to a new advertising engine.

The much smaller and newer company, Eeggi, which I was introduced to at Web 2.0 Expo in New York, is also building an engine for discerning meaning. Founder and chief scientist Frank Bandach told me his model was mathematical (his training is as a prime number theorist) and that his engine goes well beyond understanding synonyms. In his demo, he entered the query "Mary kissed John," and showed how traditional word-matching engines picked up pages there were also about John kissing Mary. His system understands English well enough to filter those out as misses.

Bandach says that he's got most of the English language in his system, and that he did English first, "because it's hard. Only Finnish is harder." He's going to work on German next, by feeding it some German dictionaries, which sounds like a science-fiction way to seed a semantic engine, but he said it's enough to get the system going. Bandach says his algorithms are efficient and not, like Powerset's, CPU hogs.

Unlike Cognition, Eeggi is an early-stage project with only four people working on it. It's far too early to tell if the technology is robust and scalable enough to compete with Cognition or Powerset. But I am encouraged to see small companies working on this problem and claiming intellectual breakthroughs. I really would not be surprised to see "meaning engines" become available to Web developers in the same way spelling checkers and grammar engines are now. I have no idea what developers will build with this technology, but I can't wait to see it.

See also: Cycorp.

Click here for full coverage of Web 2.0 Expo

Rafe Needleman writes about start-ups, new technologies, and Web 2.0 products, as editor of CNET's Webware. E-mail Rafe.
Add a Comment (Log in or register) (4 Comments)
  • prev
  • 1
  • next
by BenjaminWright September 19, 2008 8:21 AM PDT
Rafe: In the legal world, e-discovery is a big topic. One response to e-discovery is to fear it and try to minimize the quantity of e-records you possess. An alternative response is to create and possess oceans of e-records and use advancing tech like semantic engines to tame those oceans in such a way that you achieve advantage over your legal adversaries. --Ben <a href="http://legal-beagle.typepad.com/wrights_legal_beagle/2008/08/e-discovery-strategy-search-artificial-intelligence.html">http://legal-beagle.typepad.com/wrights_legal_beagle/2008/08/e-discovery-strategy-search-artificial-intelligence.html</a>
Reply to this comment
by lgcii September 20, 2008 1:21 AM PDT
Personally, I'd be much more excited if this semantic ability were being used to help my next voice recognition software upgrade not choose nonsensical sequences of words as its best guess of what I said. My brand new version of the supposedly best common voice recognition app for PCs still exhibits this annoying problem about every two to three lines I dictate (and this occurs using one of the highest recommended headsets for that software).

YMMV, but I don't lose nearly as much time finding the right things on my web searches as I do making corrections when I'm dictating letters to my computer. On average the process of making corrections to the program's mistakes eats up perhaps a third of the time I spend composing such letters. That lost time I find very frustrating.
Reply to this comment
by jseville September 20, 2008 4:33 AM PDT
I'm all for better search engines and this is fascinating. More accurate search would only help me in business as a real estate blogger. However when I search Cognition for "Arlington Virginia condos" my site does even show up even though by far it has the most information about the subject for any "searcher" looking for info on the topic. This tells me they still have a long way to go with their algos or indexing.

jay
JustNewListings.com
Reply to this comment
by CharlesSKnight September 29, 2008 7:53 AM PDT
Hello,

We are very close to Cognition and eeggi. May I please republish this on AltSearchEngines.com with full attribution?

Thanks,

Charles Knight, editor
AltSearchEngines.com
Charles@readwriteweb.com
Reply to this comment
(4 Comments)
  • prev
  • 1
  • next
advertisement

About Webware

Say No to boxed software! The future of applications is online delivery and access. Software is passé. Webware is the new way to get things done.

Add this feed to your online news reader

Webware topics

As alternative energy grows, NIMBY greens

With more renewable energy projects trying to come online, the country grapples with the balance between local land use and a national push for clean energy.

Google to remake programming with Go

A Unix co-creator is among those behind a language Google hopes will speed computers and programming. Today, Go becomes open-source software.

Inside CNET News

Scroll Left Scroll Right