May 12, 2009 5:15 PM PDT

Google embraces exercise in Semantics

by Tom Krazit
  • Font size
  • Print
  • 2 comments

Google's adoption of Semantic Web-inspired technologies could help finally spur adoption of the long-discussed concept. In this example, the ratings assigned to a restaurant are shown.

(Credit: Google)

Although Google has traditionally downplayed the importance of the Semantic Web, the company took a noticeable step toward embracing it Tuesday.

Google introduced an enhancement to its search results at Searchology 2009 that uses technologies commonly associated with the Semantic Web, or the concept of making Web pages more discoverable and understandable to computers. Calling it "Rich Snippets," Google's Marissa Mayer and Kavi Goel demonstrated how Google is working with tech publishers (including CNET) to display new types of information in search results such as the number of stars assigned to a particular gadget.

The Semantic Web is a project spawned by World Wide Web Consortium director Tim Berners-Lee in hopes of making the Web smarter, more attune to meaning, and able to deliver information with little human prodding. The idea is to move beyond the limits of common Web pages--which can display information to a Web browser but are unable to broadcast the meaning and significance of that information to search engines and other computers--theoretically making for more relevant search results.

It has proven harder to do that in practice, however, than in theory. There are several formats that have been evangelized for the Semantic Web, and without incentives to add such formats to their Web pages (let alone make a bet on one or the other) publishers haven't made much progress since Berners-Lee began advancing the concept in the late 1990s.

And that's especially true if the 800-pound gorilla of the Web isn't keen on providing those incentives. Google has appeared cold to the idea several times in the past, as recently as earlier this year when researchers opined: "The first lesson of Web-scale learning is to use available large-scale data rather than hoping for annotated data that isn't available." Or, in short, work with what you've got when it comes to organizing the Web.

But rival Yahoo has embraced Semantic Web technology with its SearchMonkey project, encouraging Web publishers to add data to their sites that helps improve the quality of search results. Even if it's had to make a few tweaks, it has started getting Web sites used to the idea of adding structured data, or semantic technology, to their code.

Google's embrace of such technology gives it an immediate boost, and presents a few initial worries. Google's heft on the Web means it can get companies that depend on the quality of its search results (such as CNET and Yelp, featured in the demonstration) to follow its lead and force others to follow suit to compete.

But publishers must be sure that Google has backed the right horse to feel totally confident in its approach. Google's Rich Snippets can decode two different Semantic Web technologies--RDFa and microformats--to produce its enhanced search results, which seem to be popular among those advocating this approach to Web development.

The company said it would not penalize publishers who do not participate in the Rich Snippets program by declaring their Web pages less relevant, which theoretically makes this an opt-in program. But Google's Goel acknowledged that Web pages enhanced with Rich Snippets could see higher click-through rates, which would improve their relevance in Google's algorithm.

Backers of Semantic Web technology will likely find both something to cheer and cause for concern with Google's announcements on Tuesday. That seems par for the course when it comes to gauging the impact of decisions made by one of the most important companies in the tech world.

Tom Krazit writes about the ever-expanding world of Internet search, including Google, Yahoo, online advertising, and portals, as well as the evolution of mobile computing. He has written about traditional PC companies, chip manufacturers, and mobile computers, spending the last three years covering Apple. E-mail Tom.
Recent posts from Digital Media
China arrests thousands in Web porn crackdown
When policemen are caught looking at Web porn
Time Warner Cable shows subscribers how to cut cord
Want to see Google's new phone on YouTube?
AT&T cuts Tiger Woods
Online holiday sales hit $27 billion
Amazon touts top products of 2009
Teen Muziic founder chastised by Vevo
Add a Comment (Log in or register)
by citizencontact May 12, 2009 8:45 PM PDT
Perhaps what Google is actually doing is using XML objects that can be identified within an XHTML document. In this way, Google can process "snippets" separately from the rest of the page. This would sort of be like if someone could find the meat of a page by seeing the <div id="content" ..... portion of a page. I think that this is a step toward a more machine readable use of web pages as opposed to most RSS feeds that are solely machine processable and not human readable.

I am hoping that this pushes toward a trend of web pages using valid XHTML and using standards for expressing parts of the page as well as identify objects within a page. RDFa is the Semantic format, but also allows for an object model (XQuery vs. SPARQL).

What I really think is that we are finally approaching when people will realize that the World Wide Web is a one big object database ( http://advocatehope.org/tech-tidbits/theory-of-the-web-as-one-big-database ).

Daniel Bennett
http://advocatehope.org
Reply to this comment
by t8 May 13, 2009 2:34 PM PDT
My only concern is allowing my web pages to be sliced up and used by other people, thereby rendering my advertising revenue as less, and in turn creating less reason to make my site better.

What do others think?
Reply to this comment
advertisement

15 sites that went kaput in 2009

Web sites launch all the time, but they also shut their doors. We highlight 15 that bit the dust this year.

Top 10 news stories of the decade

Let the debate begin: Was the iPhone more important than iTunes? Was anything bigger than Google finding a great business model? CNET offers its list of the 10 most important stories of the '00s.

About Digital Media

The Web is now the place to go for news and entertainment. Look here for the latest on blogs, music, video, virtual worlds, social networking and more.

Add this feed to your online news reader

Digital Media topics

advertisement
advertisement

Inside CNET News

Scroll Left Scroll Right