• On CBS MoneyWatch: Why Debit Cards Are Dangerous
May 28, 2009 6:00 AM PDT

Semantic technology gains publishing foothold

by Stephen Shankland
  • Font size
  • Print
  • 2 comments

OpenCalais, a Thomson Reuters project to improve electronic publishing by adding computer-readable labels to content, has attracted the attention of several media publishing organizations, including CNET.

The OpenCalais product, available in a free or a more sophisticated paid form, adds labels to content through a technology called semantic analysis. By adding descriptive labels, computers at least theoretically can understand what they're processing beyond just the raw text in a news story or photo caption, for example by recognizing addresses or names.

CNET, publisher of CNET News, is using OpenCalais' service to augment its product reviews and news, the companies plan to announce Thursday. CNET will use the technology to improve features such as searching, spotlighting content related to what a reader is viewing, and enabling programmatic use of its content over the Web.

Others using the technology include The HuffingtonPost and DailyMe, two other online news sites. DailyMe automatically sends its content through OpenCalais' servers, which labels the content with categories such as people, medical conditions, or companies and with specific elements of those categories, said Neil Budde, president and chief product officer.

"It allows us to build picture of news user's behavior to implicitly personalize the site for them," Budde said, adding that automated personalization features are scheduled to arrive in about a month. The company plans to license its service to other news sites, he added, and improve advertising targeting based on the same personalization information.

A closely related technology, the semantic Web, in which elements of Web pages are labeled with computer-readable coding to help computers better understand the meaning of the content, has been around for years. It's only now beginning to gain adoption as a real-world technology because of two big reasons, though: Yahoo and Google.

A year ago, Yahoo announced its search engine had begun recording semantic Web tags and could spruce up those pages' appearance in search results through Yahoo's SearchMonkey technology. Then, in May, Google announced a similar move with both indexing and display of pages in search results. OpenCalais, however, offers technology that creates online content that search engines discover through conventional means of analyzing text.

The tagging seems to help search engines find the company's content and spotlight it in search results, Budde said. "We create a lot of topics pages on the fly based on entities that come in from Calais, and those get pretty good pickup through search engines definitely," he said.

Paul Perry, The Huffington Post's chief technology officer, has begun using OpenCalais' service in the company's publishing system. When a story mentions a specific location or company, for example, OpenCalais' service suggests to editors the ability to associate the story with a specific geographic location or to add a specific company's stock ticker, Perry said.

That explicit labeling makes it easier for local editors--Chicago so far is the only city with localized Huffington Post news, though more areas will arrive this summer--to spot geographically relevant information, he said. "For us, local is super important. We're doing a ton of work for it," he said.

Semantic technology fans will convene starting June 14 for the Semantic Technology Conference in San Jose, Calif., at which Thomson Reuters' Tom Tague is scheduled to deliver a keynote speech.

Stephen Shankland writes about a wide range of technology and products, but has a particular focus on browsers and digital photography. He joined CNET News in 1998 and since then also has covered Google, Yahoo, servers, supercomputing, Linux and open-source software, and science. E-mail Stephen, or follow him on Twitter at http://www.twitter.com/stshank.
Recent posts from Digital Media
AT&T gets Luke Wilson to hit Verizon again
ComScore: Online video scores another big month
The browser battles go on and on
NBA star won't tweet until he has 1 million followers
Judging the top 10 Internet moments of the decade
IKEA's brilliant Facebook campaign
IBM staffer posts pics on Facebook, loses benefits
Google to track TiVo viewing habits
Add a Comment (Log in or register)
by kristadthomas May 29, 2009 5:42 PM PDT
Thanks Stephen -- just a quick note that our OpenCalais Marmoset makes it easy for bloggers, publishers and sites of all kinds to "feed" both Yahoo! SearchMonkey and Google Rich Snippets.

Find it here: http://www.opencalais.com/Crawler

Thanks,
-Krista
The OpenCalais team
Reply to this comment
by kristadthomas August 25, 2009 11:16 AM PDT
Just a quick update. We?ve been busy ramping up the OpenCalais service to keep pace with demand from partners like CNET, HuffingtonPost, DailyMe and more.

That?s why today we?ve increased the daily transaction allowance for OpenCalais to 50,000 transactions per day ? a 25% increase over our previous daily limit.

Of course, OpenCalais continues to be offered at no charge for commercial or non-commercial use.

Best,
-Krista
The OpenCalais team
Reply to this comment
advertisement

The browser battles go on and on

roundup From Firefox to IE and from Chrome to Opera and Safari, there's no sitting still for browser makers looking to keep their products fresh and competitive.

3G wireless still holds promise

The next generation of 4G wireless may get all the headlines, but advanced 3G technology will likely dominate services for the next few years.

About Digital Media

The Web is now the place to go for news and entertainment. Look here for the latest on blogs, music, video, virtual worlds, social networking and more.

Add this feed to your online news reader

Digital Media topics

advertisement
advertisement

Inside CNET News

Scroll Left Scroll Right