Information Retrieval Gupf

Google Claim: Make Algorithms Smart through Data, not Complexity

Posted on March 27, 2009 by jeremy

Google researchers Alon Halevy, Peter Norvig, and Fernando Pereira have an article in IEEE Computer magazine entitled “The Unreasonable Effectiveness of Data“. The article continues a theme that has been running strong within Google circles for the past half decade about how training a simple algorithm with larger amounts of data is more effective than having a smart algorithm that tries to generalize or draw inferences from smaller amounts of data:

Continue reading →

Posted in Information Retrieval Foundations | 3 Comments

Music Retrieval: Algorithms or Explanatory Context?

Posted on March 26, 2009 by jeremy

At SXSW this year, Paul Lamere of The Echo Nest and Anthony Volodkin of Hype Machine engaged in a head-to-head panel about the utility of:

Using computer algorithms (e.g. collaborative filtering, tag-based, content-based, etc.) to automatically recommend music, versus
Using computers to (a) connect people who can directly recommend music to each other and (b) provide contextually relevant information around any shared songs

Perhaps I don’t fully understand the full subtlety of the conflict, but I find myself wondering: Why can’t you do both?

Continue reading →

Posted in Explanatory Search, Exploratory Search, Music IR | Leave a comment

Good Interaction Design Trumps Smart Algorithms

Posted on March 25, 2009 by jeremy

Over on the new CACM blog, researcher Tessa Lau has an interesting post on three common misconceptions that folks have about HCI. I recommend reading the full article, but I would like to call attention to her provocative opening statement (emphasis mine):

I come to the field of HCI via a background in AI, having learned the hard way that good interaction design trumps smart algorithms in the quest to deploy software that has an impact on millions of users. Currently a researcher at IBM’s Almaden Research Center, I lead a team that is exploring new ways of capturing and sharing knowledge about how people interact with the web. We conduct HCI research in designing and developing new interaction paradigms for end-user programming.

One of my biggest grievances with web scale search engines is that they have made the assumption that smart algorithms (or, at least, simple algorithms trained with enough data to be made smart) are more important than good interaction design.

Continue reading →

Posted in Information Retrieval Foundations | 4 Comments

Content-Based Audio Search

Posted on March 21, 2009 by jeremy

Long-time Music Information Retrieval researcher Pedro Cano has a new book out, based on his dissertation: “Content-based Audio Search: From Audio Fingerprinting to Semantic Audio Retrieval“. From the review:

Music search sound engines rely on metadata, mostly human generated, to manage collections of audio assets. Even though time-consuming and error-prone, human labeling is a common practice. Audio content-based methods, algorithms that automatically extract description from audio files, are generally not mature enough to provide the user friendly representation that users demand when interacting with audio content. This dissertation has two parts. In a first part we explore the strengths and limitation of a pure low-level audio description technique: audio fingerprinting. In the second part, we hypothesize that one of the problems that hinders the closing the semantic gap is the lack of intelligence that encodes common sense knowledge and that such a knowledge base is a primary step toward bridging the semantic gap. We present a sound effects retrieval system which leverages both low-level and semantic technologies.

Continue reading →

Posted in Music IR | 3 Comments

Evolutionary Thinking and IR Design

Posted on March 20, 2009 by jeremy

Just the other day I observed that Google, by thinking only evolutionarily and being unable to make leap-based changes, long ago fell into a local maximum trap. The following blogpost from a designer who is leaving Google appears to reinforce this conjecture:

When a company is filled with engineers, it turns to engineering to solve problems. Reduce each decision to a simple logic problem. Remove all subjectivity and just look at the data. Data in your favor? Ok, launch it. Data shows negative effects? Back to the drawing board. And that data eventually becomes a crutch for every decision, paralyzing the company and preventing it from making any daring design decisions. Yes, it’s true that a team at Google couldn’t decide between two blues, so they’re testing 41 shades between each blue to see which one performs better. I had a recent debate over whether a border should be 3, 4 or 5 pixels wide, and was asked to prove my case. I can’t operate in an environment like that. I’ve grown tired of debating such miniscule design decisions. There are more exciting design problems in this world to tackle.

Continue reading →

Posted in Information Retrieval Foundations | Leave a comment

M	T	W	T	F	S	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Google Claim: Make Algorithms Smart through Data, not Complexity

Music Retrieval: Algorithms or Explanatory Context?

Good Interaction Design Trumps Smart Algorithms

Content-Based Audio Search

Evolutionary Thinking and IR Design

Recent Posts

Recent Comments

Archives