• On TechRepublic: 10 cool USB flash drive tricks
May 20, 2009 8:07 AM PDT

Wolfram Alpha and its architecture of failure

by Matt Asay

Please see the response from Wolfram|Alpha at the bottom of this post.

One thing has become clear: to succeed on the Web and in the next generation of software, you need to invite, not dissuade, outside participation. Tim O'Reilly calls it an "architecture of participation," but whatever you call it, the best software strategies are those that encourage outside contributions, rather than discourage it.

This makes Wolfram Alpha's terms of service mind-boggingly backward at best, and troubling at worst. Some have pointed to the quasi-search engine's sometimes weird results as a reason to give the service a pass, but there's a far more fundamental reason to reject Wolfram Alpha , as Groklaw suggests.

Wolfram Alpha demands citation when using the results of its "searches," which is a distinct departure from Google's "use pretty much as you please" attitude, and will almost certainly curb the appeal of Wolfram Alpha, no matter how good its output becomes. Groklaw writes:

Wolfram's Terms of Use are not at all what I would expect from a search engine, probably because that isn't exactly what Wolfram Alpha is providing. It's a computational service, at least in some cases providing computational output from various sources of data that perhaps never existed until you asked your question. So, they claim copyright on the results and require attribution. That's fine with me, so long as the information provided really is uniquely theirs and not just the answer to what is meaning of life and everything, but it is different from what I'm used to from Google and other search engines, so it is counter-intuitive, something to be aware of before I include Wolfram Alpha output in a presentation on Groklaw or in a book.

In other words, Wolfram Alpha requires: "If you make results from Wolfram Alpha available to anyone else, or incorporate those results into your own documents or presentations, you must include attribution indicating that the results and/or the presentation of the results came from Wolfram Alpha." It's a fair request, but it may not be a reasonable request. Not if Wolfram Alpha wants people to actually make widespread use of the service.

Groklaw concludes that this requirement "means Wolfram Alpha will never replace Google," which is absolutely correct. Even if Wolfram Alpha delivers better "search" results, the burden of figuring out and delivering proper citation is going to keep people using Google, which doesn't make the same fetish of proclaiming its ownership of search results.

Wolfram Alpha may well prove to be the best computational search engine on the planet. But until it learns to lose the heavy hand of enforced citation, it's going to struggle to become a first-choice search tool.

UPDATE @ 12:48 PT. Wolfram|Alpha's Theodore Gray contacted me with the following commentary on my post above.

Hi, I'm the person who wrote most of the language in the Wolfram|Alpha terms of use, and I want to correct a couple of things you got from Groklaw.... There are two basic confusions they have perpetrated: First, Google does copyright its results pages just like we do in all comparable cases (e.g. images.google.com, news.google.com, etc).

And second, Wolfram|Alpha is not a search engine: We don't return "search results", we generate original content including plots and graphs. A more comparable situation would be Google Maps, which Google claims copyright on just like any other map provider. Similar, look at pretty much any site that generated financial trend charts, weather charts, you name it, they are always copyrighted. We are not doing anything more aggressive or grabby than Google or other widely used websites: Groklaw simply got it wrong.

I would be happy to answer any further questions you might have, but please, if you're going to criticize my beautiful terms of use language, please at least read it carefully first, and understand that it's not talking about search results. I'm happy to entertain criticism, but not if the starting point is that we're more restrictive than Google, because that's simply not the case.

My response? This misses the point of my post. As I wrote back to Theodore:

Thanks, Theodore. I quoted your terms of use directly - not sure how I can do more than that? Reading Groklaw's take and your note below, I stand by what I said. I absolutely think you have the *right* to require attribution, and I understand the reasons for doing so....

My argument is that by insisting on this, or at least not making it brain-dead easy for your users, you're going to a) make it difficult to enforce because you'll spend all your time chasing infringers and b) induce people to try to use an imperfect replacement for your service (like Google or other) because citation becomes too cumbersome.

You can resolve a and b by simply making citation automated in some way. I would be absolutely for that: I'd love to use results from Wolfram|Alpha in my work, but I doubt I'm going to want to chase down the citation every time. I don't think you want to burden "distribution" of your results: I think you therefore need to find a way to make it easy for users to show that the results came from your service, without making them do all the legwork. I'm not smart enough to know how to do that, but I think you/your team probably are/is.

The point, in short, is not whether Wolfram|Alpha has the right to do this, but simply that doing so may negatively impact adoption of its service. It's similar to copyright assignment in open source: David Neary recently made the compelling argument that copyright assignation can hurt community adoption of open-source software, but that companies may need to do it, anyway.

It's a trade-off, but in Wolfram|Alpha's case, I believe there are ways for it to make this trade-off less burdensome on users, thereby inviting participation and not unduly hampering it.


Follow me on Twitter @mjasay.

Matt Asay brings a decade of in-the-trenches open-source business and legal experience to The Open Road, with an emphasis on emerging open-source business strategies and opportunities. Matt is vice president of business development at Alfresco, a company that develops open-source software for content management. He is a member of the CNET Blog Network and is not an employee of CNET. Disclosure. You can follow Matt on Twitter @mjasay.
Recent posts from The Open Road
Mobile: Still waiting to see what sticks
Google privacy controls: Most people won't care
Amazon's move mocks EU's fear of Oracle
Skype to open-source far too little
The difference a few years makes to open source
Novell cuts 3 percent of its workforce, plus benefits
Data's one-two punch in open-source business models
Open source as an antitrust strategy
Add a Comment (Log in or register) Showing 1 of 2 pages (24 Comments)
by wperry1 May 20, 2009 8:53 AM PDT
Call it what you will... a "computational service", a search engine or a widget. If Wolfram Alpha is used to find information on other websites it will have a very difficult time laying copyright claim to the results.
Reply to this comment
by pentest May 25, 2009 6:01 PM PDT
You obviously don't know what wolfram alpha is.
by waynerifish May 20, 2009 8:56 AM PDT
I don't think Wolfram Alpha was ever intended to replace or even directly compete with Google. They are totally different services. The Terms of Service are different because the service is different.
Reply to this comment
by sdf0013 May 20, 2009 10:06 AM PDT
I've wondered this myself. Like you, I see them as very different services. I can understand their desire, especially from a marketings perspective, the desire to be sited as the source. But, certainly there cannot be a copyright issue since the results are not their own - merely regurgitated. Also, if anyone were to use these results in a presentation, you need to quote the source, not the delivery -- meaning if it came from government survey data your audience needs to know that rather than you found it on Google or Wolfram, etc.

Still. I do like the concept of what they are trying to do. It's very Star Trek-ish in simply asking the computer a question and having the answer pop-out. If you think about it, as we continue to gather more and more information, data, etc. finding it and being able to produce an answer is going to get harder and harder. We simply have to have better tools to be able to do this. We've come a very long way from the card catalog and the encyclopedia. Our research tools have to be able to keep up with the data that we're able to collect and maintain.
by aMUSICsite May 20, 2009 9:03 AM PDT
It's only version 1 and it's not supposed to be a Google killer, Wiki would be a better comparison anyway. As it's trying to give answers to answerable questions without the fear the some kid might have just changed the whole article (Wiki) or getting noting but sales pitch/latest new stories (Google).

I think it will grow over time and be a valuable tool like Wiki, but it's not going to replace either Google or Wiki it will be something in between that get's used when you are after the kind of facts it turns up well.
Reply to this comment
by divisionbyzero May 20, 2009 9:22 AM PDT
Uh, Wolfram Alpha could just auto-generate a proper citation when it does a computation that results in a unique result rather than a collation of disparate results. It should be pretty easy.
Reply to this comment
by Amyaz May 20, 2009 9:26 AM PDT
it's actually fairly stupid to post this type of information without a citation. if, say, i post info based on a calculation in a report, and you later want to know "what the number really means," you'll waste quite a bit of time if i haven't noted the original source....
Reply to this comment
by vamman May 20, 2009 9:44 AM PDT
Its a search engine designed to keep idiots out.

I have no issue with citing them in a report.

What the hell is your problem CNET?
Reply to this comment
by J. Blow May 20, 2009 1:51 PM PDT
How do you know your not considered an idiot? Pompus at best.
by Hunnter2k3 May 20, 2009 9:51 AM PDT
The instant the first person gets in trouble for not citing them, the instant it dies.
Reply to this comment
by sroussey May 20, 2009 9:58 AM PDT
Are you serious? This is the "failure"? They create, essentially, a research report for a someone that wants to claim it as their own, and can't because they have to cite it? Sheesh...

Gosh, soon sites like this will put something into their TOS that forbids site scrapers from copying articles without license (or citation!)... What WILL the world come to...
Reply to this comment
by orlandorr May 20, 2009 10:16 AM PDT
What is really terrible about Wolfram Alpha is it's NAME.

Too long and strange. They've basically alienated 90% of the world audience with it.
Reply to this comment
by Maximus_000 May 20, 2009 10:17 AM PDT
Are you serious? Means if I am give input 2+2 then result 4 is copyright property?
I am going to get copy rights of 10 so every time they display 10 they should attribute it to me.....
Lolx Great Idea Wolfram.... I assume you were sleeping when God was distributing common sense....
Reply to this comment
by Magallanes May 20, 2009 11:17 AM PDT
The main trouble with Wolfram Alpha is the lack of market study. Wolfram Alpha is pointed for some academic and scientific level of user, but those people have better tools to calculate a single 1+1 or to do a search of Yahoo Finance to check about some share.
Reply to this comment
by ueno54 May 20, 2009 12:16 PM PDT
You say:
"the burden of figuring out and delivering proper citation is going to keep people using Google"

But it's pretty clear from the terms of use:
"Whenever possible, such attribution should take the form of a link to Wolfram|Alpha"

You're complaining about linking.
Reply to this comment
by albertsoler May 20, 2009 12:36 PM PDT
"Terms of Use" issues seem to be popping up all over these days. I never bothered to read the terms for WolframAlpha. Apparently, only journalists and lawyers are the only ones who do anyway. Copyrighting the result does seem to be a strange concept regardless on how it was determined.

I wanted to know which element had the highest melting point and was impressed on the speed and representation of the result. But, do they now have a copyright on the answer (which was carbon at 6422 degrees)? Is every single article, web page, periodical, text book, science report/term paper, (*or even this post?), etc., with this bit of information suddenly infringing on this copyright? Very strange indeed. But, I don't think so. Citing the source of information is not unreasonable and a very old practice; except for plagiarists.

Still, we now have one more complication to add to old, out-dated, copyright, trademark, and patent laws.

*If this post does get DMCAed by WolframAlpha -- please let us all know. It would be high-larious!
Reply to this comment
by vikinzer May 20, 2009 12:37 PM PDT
I've taken a look at Wolfram Alpha after reading this and I have to say I don't see a lot of legitimacy in your argument. Here is why. The service bears little to no resemblance to Google. It is not a search engine of other people's content. It is a search engine of a database of information they hold. You cannot copyright information. So if you input 2+2 and get 4 legally they can't actually ask you to cite that. If however, you get a graph, or a chart, or some other presentation of data that the site generated, then they do hold copyright to that content. It isn't a link to external content someone else created, it's unique content.

Asking for credit isn't really a big deal. This is basically the same as an electronic searchable encyclopedia that makes use of modern dynamic generation technologies. While I think that their legal ability to lay claim to say the population of Bloomington, IN (my hometown) is questionable, I also don't think they are going to prosecute anyone who just lists the number.

I also think this mentality of citation comes from an academic culture. Someone who would invest in a research resource like this likely comes from an academic background. At the very least even if the founders aren't academics they involved a lot of academics in collecting the information used and deciding what was worth including. The net is a place where people read and accept, a lot of stupid rumors have circulated as a result of that aspect of net culture. I personally have no problem with an organization that wants to promote a culture of citation and making sure your facts are indeed correct. It lacks the instant gratification of net culture, and takes a little more time out of our day, but maybe we should all slow down and realize that might not be such a bad thing.
Reply to this comment
by ikjadoon May 20, 2009 1:19 PM PDT
EBSCOHost has a fantastic citation system.

If you find an article, you basically click the button and a smooth citation dialog comes up with all the available types of citation are able to be copied/pasted. It has MLA, APA, AMA, Chicago, Vancouver,etc.

It works fantastically well. :D
Reply to this comment
by jmfb_k7 May 20, 2009 1:33 PM PDT
Wolfram Alpha is not a search engine and has never claimed to be one. Only CNET and Wired call it a search engine (because they didnt read the FAQ's posted on the site. They just saw the interface and made an assumption).
Wolfram Alpha does not have aspirations to replace Google. But it would be wise for Google to licence the service (providing that they are willing to properly citate the results).
Reply to this comment
by monkeyfun14 May 20, 2009 5:49 PM PDT
"Groklaw concludes that this requirement "means Wolfram Alpha will never replace Google," which is absolutely correct. Even if Wolfram Alpha delivers better "search" results, the burden of figuring out and delivering proper citation is going to keep people using Google, which doesn't make the same fetish of proclaiming its ownership of search results."

How does anyone perceive its trying to compete with Google in the first place? They are totally different search engines...
Reply to this comment
by Sir_Sid May 20, 2009 7:27 PM PDT
Your comparison is very flawed. Wolfram Alpha should have a different policy because it takes commonly known data and puts it in a presentable form. If you use that data you should cite it. What is so hard about citation anyway
Reply to this comment
by chaseneb May 21, 2009 8:02 AM PDT
How hard can it be to add a auto-cite linK? Wikipedia already does it for you. Wikipedia even lets you choose the formatting style you'd like to use for your citation. It can't be that hard. The only concern would be that depending on your search parameters the search results would change for the person who checked the citation. If you search "here" or "today" you get results based off of where or when you search. Ideally the autocitation would lead you back to the EXACT results that are cited instead of a link that would change daily.
Reply to this comment
by mnemopolous May 22, 2009 12:52 PM PDT
Would the Wolfram Alpha terms of service make more intuitive sense if the service were offered on a subscription-based (rather than ad-based) revenue model?

I haven't played with Wolfram Alpha too much; does it rely on crowdsourcing (like Google or Wikipedia) for its data? Does Wolfram Alpha cite its sources? Google does (and that's really the point of the thing)...
Reply to this comment
Showing 1 of 2 pages (24 Comments)
advertisement
Click Here

FAQ: Buying the right Windows 7 upgrade

Readers still have lots of questions on just which version of the software they need to buy in order to upgrade their PC. CNET News tries to offer some answers.

N.Y. lawsuit details Intel's 'largesse' toward Dell

Attorney General Andrew Cuomo's federal antitrust case filed Wednesday alleges a longstanding symbiotic relationship between Intel and Dell.

advertisement

About The Open Road

Matt Asay brings a decade of in-the-trenches open-source business and legal experience to the Open Road, with an emphasis on emerging open-source business strategies and opportunities. Matt is general manager of the Americas division and vice president of business development at Alfresco, a company that develops open-source software for content management. He is a member of the CNET Blog Network and is not an employee of CNET. Disclosure.

Add this feed to your online news reader

The Open Road topics

advertisement
advertisement

Inside CNET News

Scroll Left Scroll Right