• On CHOW: His burger will EAT your burger
March 18, 2008 11:01 AM PDT

BookLamp is a 'Pandora for books'

by Josh Lowensohn
  • Font size
  • Print
  • Post a comment

BookLamp is a project that the people at Amazon.com would be idiots to pass up buying.

It's a machine learning tool that's been designed to go through books and analyze not only how they're written, but also help group together novels that share similar structures and styles. The hope is to help people discover books they may like based on previously read novels, or what kind of reading experience they're going for. Internet radio recommendation service Pandora does something similar, employing a thumbs up and down system combined with listening history.

Because BookLamp's system uses machine learning, it skips the three major aspects of each book that humans usually tally: story line and plot, the characters, and writing style. Instead, it figures out bits of these three items by using written cues and quantifiers like word density, pacing, action, character dialogue (as noted by quotations), and level of description. The system also blends in one to five star ratings from Amazon.com.

(click to enlarge) BookLamp's stats analysis looks at a book and figure out how it's written based on five factors.

(Credit: CNET Networks)

So far, the database has 179 books, but is tracking more than 700,000 data points over 30,000 scenes from those titles. If it were to scale to track more works, in theory the results for related items would be even more precise. In its current state, users can go in and pick from one of the titles and get recommendations for similar titles, or view the graphs of what the system has recorded for its pacing, density, and other characteristics.

One of the coolest features, and the one I think is the killer app is the pacing analysis. It will go through and figure out when the pace of a book speeds up or slows down.

In the video demo (embedded after the break), creator Aaron Stanton picks Michael Crichton's Jurassic Park as an example, and demonstrates that BookLamp was smart enough to detect when the pace ramps up, including on what page that change occurs. I could see this being a great way to check and see if you're wasting your time on a read that's off to an incredibly slow start and potentially going nowhere. Instead of giving up, you could simply give the chart a quick look.

The project has been around since 2003 and continues to build up its database. There's a sign-up form to request a work to be added. You can also play around with the browsing and stats tool by registering. Be sure to hit the read more button to check out the video walk-through.

[via Digg]

Josh Lowensohn writes for Webware.com, CNET's blog about Web applications and services. E-mail Josh, or follow him on Twitter at http://www.twitter.com/Josh.
advertisement

About Webware

Say No to boxed software! The future of applications is online delivery and access. Software is passé. Webware is the new way to get things done.

Add this feed to your online news reader

Webware topics

Tech at the Olympics: 'No room to fail'

Q&A The Olympics relies on thousands of servers and PCs to manage all the athletes and scores. Magnus Alvarsson is the guy who must make sure everything works.

How CoverItLive lost it on iPad day

The live-blogging tool fell apart under the strain of a Steve Jobs keynote. Here's what happened, and what comes next for the company.

Inside CNET News

Scroll Left Scroll Right