<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Information Retrieval Gupf &#187; Collaborative Information Seeking</title>
	<atom:link href="http://irgupf.com/category/collaborative-information-seeking/feed/" rel="self" type="application/rss+xml" />
	<link>http://irgupf.com</link>
	<description>Information Retrieval Research, Issues, and Discussion</description>
	<lastBuildDate>Fri, 25 Jun 2010 18:30:07 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Embark Together</title>
		<link>http://irgupf.com/2010/03/15/embark-together/</link>
		<comments>http://irgupf.com/2010/03/15/embark-together/#comments</comments>
		<pubDate>Mon, 15 Mar 2010 20:52:12 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>
		<category><![CDATA[Social Implications]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=1186</guid>
		<description><![CDATA[I would like to quickly follow up on my previous post on explicitly collaborative information seeking.  My claim in that post was that, despite the shared terminology, a service like Aardvark (or Twitter) is not truly collaborative.
Let me be clear about Aardvark: What that service does  is help you comb through a network of [...]]]></description>
			<content:encoded><![CDATA[<p>I would like to quickly follow up on my <a href="http://irgupf.com/2010/03/15/dont-forget-explicitly-collaborative-information-seeking/">previous post on explicitly collaborative information seeking</a>.  My claim in that post was that, despite the shared terminology, a service like Aardvark (or Twitter) is not truly collaborative.</p>
<p>Let me be clear about Aardvark: What that service does  is help you comb through a network of people to find those individuals  who have the highest likelihood of holding the answer to your  information need.  Somebody has the answer; you just don&#8217;t know who it  is.  So Aardvark helps you find that somebody.  The reason this is  different from what I am talking about with explicit collaboration is  that in this latter case, you already know who it is that you want to  work with on resolving a shared information need.  You want to work with  a relationship partner on finding an apartment.  You want to work with a  business colleague on finding potential markets for a new product.  You  want to work with some buddies on planning a road trip.  In all of  these situations, your partner, your colleague, and your buddies don&#8217;t  already have the answers that you seek.  But you do know that you want  to work with them to find those answers because they have the same need  that you do.  Your partner wants to live with you, your business  colleague wants to work with you, and your buddies want to travel with  you.  This is what explicitly collaborative information seeking is  about, and it&#8217;s not the same thing as the &#8220;collaborative&#8221; category  discussed in the panel.</p>
<p>Case in point: Take a look at the panel&#8217;s  slides: <a href="http://www.slideshare.net/bmevans/introductory-slides">http://www.slideshare.net/bmevans/introductory-slides</a>.   Slide 9 outlines the two main social strategies: (1) Ask the network,  and (2) embark alone.  This misses a third major, but as yet untapped,  strategy: (3) <strong>embark together</strong>.</p>
<p>A good way to think about this is in terms of information  seeking.  In both the (1) ask the network and (2) embark alone  strategies, there is only a single user with an actual information  need, a single person who is actively seeking information.  Using  Aardvark, he or she is asking other people in the network if they are able to  give an answer to satisfy that need.  But those other individuals  do not actively share your information need.  They already either (1) have the  information that you seek, and thus already have a satisfied information need, or (2) do not have the information you seek, but do not care, i.e. they do not share your information need (they aren&#8217;t going to move in with you, or go on that road trip with you).  When you ask the network, you are not actually involved in collaborative information <em>seeking</em>.  There is only a single seeker: You.  You are simply tapping into the network to  find those people who already have the information you need.  It is still the  single individual, not the network, that has the information need and  that is actively engaged in the seeking process.</p>
<p>But <em>embarking together</em> with one or two other  individuals who also lack information, i.e. engaging in explicitly  collaborative information seeking, is a entirely different process.  In this case, there are at least two information <em>seekers</em>, two people who have a shared, as-yet-unsatisfied, information need.  Now, there are a number of different ways you can build systems and design interfaces to support these multiple seekers in their task.  I&#8217;ve written a lot about such systems on this blog and on the <a href="http://palblog.fxpal.com/">FXPAL blog</a>, and will not go into it in further detail right now.  The point is simply that <em>embarking together</em> is an information seeking strategy that was not covered by any of the existing methods.  It is not the same as asking the network.  It is not the same as embarking alone.  It is a third process, a third strategy, and one that remains quite untapped in today&#8217;s marketplace.</p>
<p>Update: I have a final quick example.  On <a href="http://searchengineland.com/live-blogging-sxsw-social-search-a-little-help-from-my-friends-38086">his live blog, Danny Sullivan</a> paraphrases Max from Aardvark: &#8220;We want to do that across communication channels, so you can find  partners to go bike with&#8221;.  That&#8217;s Aardvarkian social search: You want to find the people to go biking with.  Collaborative search is the next phase.  Once you&#8217;ve found the people that you explicitly know that you want to go biking with, how do you find out where you want to go?  You know about all the bike trails around your house.  Your new biking partner knows about all the trails near her house.  But neither of you know about the trails that exist halfway between both of your houses.  Ideally, you&#8217;d like to find one of those trails that is good for both of you, because neither of you is aware of them.  (And why should you have been? Before meeting your partner, you had no reason to venture away from your favorite nearby trails.) THAT is explicitly collaborative information seeking.  When both of you actively look for new bike trails, that is embarking together.</p>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2010/03/15/embark-together/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Don&#8217;t Forget Explicitly Collaborative Information Seeking</title>
		<link>http://irgupf.com/2010/03/15/dont-forget-explicitly-collaborative-information-seeking/</link>
		<comments>http://irgupf.com/2010/03/15/dont-forget-explicitly-collaborative-information-seeking/#comments</comments>
		<pubDate>Mon, 15 Mar 2010 17:35:37 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=1181</guid>
		<description><![CDATA[A panel on Social Search is happening at SXSW right now.  Reading Danny Sullivan&#8217;s liveblogging, I came across the panel&#8217;s definition of the three distinct types of social searching.  And I think they left one out:


Collective (gathering advice from a crowd)
Friend Filtered (using your friends)
Collaborative (asking a friend — see also our The  Rise [...]]]></description>
			<content:encoded><![CDATA[<p>A <a href="http://searchengineland.com/live-blogging-sxsw-social-search-a-little-help-from-my-friends-38086">panel on Social Search</a> is happening at SXSW right now.  Reading Danny Sullivan&#8217;s liveblogging, I came across the panel&#8217;s definition of the three distinct types of social searching.  And I think they left one out:</p>
<ul>
<blockquote>
<li>Collective (gathering advice from a crowd)</li>
<li>Friend Filtered (using your friends)</li>
<li>Collaborative (asking a friend — see also our <a href="http://searchengineland.com/the-rise-of-help-engines-16921">The  Rise Of Help Engines: Twitter &amp; Aardvark</a> article)</li>
</blockquote>
</ul>
<p>The version that was left out was the type of search in which you don&#8217;t just ask a friend for an answer (e.g. Twitter and Aardvark), but the type of search in which you actively engage with a specific person to work on on a jointly-shared information need.  For example, imagine a couple looking to rent an apartment.  It&#8217;s not like one person in the couple can ask the other one &#8220;where should we live?&#8221;  The point is that both people do not know.  And so you can imagine an information retrieval system that has, built in, the capability to be multi-searcher aware.  Both people can work on the same task at the same time.</p>
<p>This is not what Aardvark does.  This is not what Twitter does. This is also not friend-filtered; this is also not collective.  It is a fourth type, a distinction that seems to have been missed by the panel &#8212; search in which a small team of people actively work together, and the search system actively mediates between them, helping the group as a whole find information that no individual already knows, and that no individual would have easily found, had that person been working alone.   For more information on this oft-ignored area, please <a href="http://irgupf.com/2009/03/30/collaborative-information-seeking-ongoing-recap/">see our earlier series of posts</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2010/03/15/dont-forget-explicitly-collaborative-information-seeking/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Search versus Recommendation: Not The Only Tension</title>
		<link>http://irgupf.com/2010/01/05/more-tensions/</link>
		<comments>http://irgupf.com/2010/01/05/more-tensions/#comments</comments>
		<pubDate>Tue, 05 Jan 2010 11:14:59 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>
		<category><![CDATA[Exploratory Search]]></category>
		<category><![CDATA[Information Retrieval Foundations]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=1131</guid>
		<description><![CDATA[Greg Linden has an interesting post on Search on a domain like YouTube.  I reproduce it here because I would like to elaborate on it:
The article focuses on YouTube&#8217;s &#8220;plans to rely more heavily on personalization and ties between users to refine recommendations&#8221; and &#8220;suggesting videos that users may want to watch based on what [...]]]></description>
			<content:encoded><![CDATA[<p>Greg Linden has an interesting post on Search on a domain like YouTube.  I reproduce it here because I would like to elaborate on it:</p>
<blockquote><p>The article focuses on YouTube&#8217;s &#8220;plans to rely more heavily on personalization and ties between users to refine recommendations&#8221; and &#8220;suggesting videos that users may want to watch based on what they have watched before, or on what others with similar tastes have enjoyed.&#8221;  What is striking about this is how little this has to do with search. As described in the article, what YouTube needs to do is entertain people who are bored but do not entirely know what they want. YouTube wants to get from users spending &#8220;15 minutes a day on the site&#8221; closer to the &#8220;five hours in front of the television.&#8221; This is entertainment, not search. Passive discovery, playlists of content, deep classification hierarchies, well maintained catalogs, and recommendations of what to watch next will play a part; keyword search likely will play a lesser role.</p></blockquote>
<p>My feeling is that the dichotomy that is being drawn does not exhaustively cover the space.  I would characterize the space using the following two orthogonal dimensions: (1) Information Need Clarity and (2) User Engagement.  The first dimension (clarity) is related to the degree with which the user understands his or her own information need, i.e. has something specific in mind that he is looking for and/or understands what he needs to do to find it.  That need may either be well understood, or (<a href="http://blog.codalism.com/?p=984">to borrow Nick Belkin&#8217;s terminology</a>) &#8220;anomalous&#8221;: The user doesn&#8217;t know what he or she doesn&#8217;t know.  The second dimension is related to the level at which the user applies himself to the information seeking process.  That level may be active or passive.</p>
<p>Greg points out two modes: &#8220;<em>Active Understood</em>&#8221; (typical navigational web search) and &#8220;<em>Passive Anomalous</em>&#8221; (entertainment/discovery/recommendation).  But I believe that there are more than these two modes.  A large, interesting design space opens up when one realizes that information seeking can be &#8220;<em>Active Anomalous</em>&#8221; and &#8220;<em>Passive Understood</em>&#8220;.</p>
<p><img class="aligncenter size-full wp-image-1136" title="dimensions" src="http://irgupf.com/wp-content/uploads/2010/01/dimensions2.png" alt="dimensions" width="966" height="119" /></p>
<p><a href="http://en.wikipedia.org/wiki/Exploratory_search">Exploratory Search</a> is a good example of Active Anomalous seeking.  One doesn&#8217;t yet fully know or understand what it is that one is looking for, but at the same time one is willing to engage with an information system in order to discover what it is that he or she does not yet know.  And the system itself is designed not necessarily toward trying to answer a well understood need, but toward helping the user map out and better comprehend a space.</p>
<p>Collaborative Information Seeking (see <a href="http://irgupf.com/2009/03/30/collaborative-information-seeking-ongoing-recap/">here</a> and <a href="http://palblog.fxpal.com/?p=272">here</a> and <a href="http://workshops.fxpal.com/cscw2010cis/CFP.aspx">here</a>) is a good example of where an need may be well understood, but a user does not necessarily have to actively express every last query detail in order to get more information on a topic.  Why not?  Because when User #1 is explicitly collaborating with User #2, an algorithmic mediation engine can push some of User #2&#8217;s activity on to User #1 without requiring User #1 to make additional effort.  Note that I am not implying that every aspect of collaborative information seeking is passive; quite the contrary, as it requires at least one co-collaborator to be active.  I am only pointing out that it is a domain in which it becomes possible for a user to passively obtain specific information on a well understood need.</p>
<p>There is a lot discussion in the Information Retrieval Community on the similarities and differences between Search and Recommendation.  A fruitful tension opens up as one travels back and forth along the diagonal from Active Understood to Passive Anomalous; the two approaches often end up complementing each other.  Where I see much less discussion is on the tension that opens up along the other diagonal, between Passive Understood and Active Anomalous.  When Exploratory Search meets Collaborative Information Seeking, it yields <a href="http://www.fxpal.com/?p=abstract&amp;abstractID=460">Collaborative Exploratory Search</a> and a whole host of interesting possibilities.  Over the coming year I will be blogging more about the tension along this alternative diagonal (both here at on the<a href="http://palblog.fxpal.com/"> FXPAL blog</a>) and what it means for the Information Retrieval systems I and others are designing.  Happy 2010!</p>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2010/01/05/more-tensions/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Exploration, Collaboration, and Open Government</title>
		<link>http://irgupf.com/2009/09/22/exploration-collaboration-and-open-government/</link>
		<comments>http://irgupf.com/2009/09/22/exploration-collaboration-and-open-government/#comments</comments>
		<pubDate>Tue, 22 Sep 2009 14:54:28 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>
		<category><![CDATA[Exploratory Search]]></category>
		<category><![CDATA[Information Retrieval Foundations]]></category>
		<category><![CDATA[Social Implications]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=943</guid>
		<description><![CDATA[What sort of information retrieval system would you build if you knew that all the users of your system would be expert or highly-motivated amateur searchers?  What sort of system would you build when you have a very large collection of unstructured information, and the goal in searching that information is not to find one [...]]]></description>
			<content:encoded><![CDATA[<p>What sort of information retrieval system would you build if you knew that all the users of your system would be expert or highly-motivated amateur searchers?  What sort of system would you build when you have a very large collection of unstructured information, and the goal in searching that information is not to find one document (e.g. navigate to a home page), but to find (a) relationships between documents, or (b) large sets of documents that all pertain to a single topic?  How would your algorithms be different?  How would your interfaces be difference?  How would the process itself (that middle layer in between algorithms and interfaces) be different?</p>
<p>Via <a href="http://thenoisychannel.com/2009/09/21/transparent-text-symposium-day-1/">Daniel Tunkelang&#8217;s recent post</a>, I think that Government information might be a perfect domain in which to ask (and answer) these sorts of questions.  The U.S. <a href="http://www.whitehouse.gov/open/">Open Government Initiative</a> has as its goal the release of loads of raw government data for use by any individual or organization.  How are people going to use this data?  What types of questions will they ask?  What types of questions could they ask, if given the proper tools (i.e. what might they not know that they want to ask, until it becomes possible?)</p>
<p>Two types of information retrieval might be perfect for this domain: Exploratory Search and (Explicitly) Collaborative Search.  <span id="more-943"></span>In exploratory search, the goal of your information seeking is to learn, discover, compare, contrast, etc. In explicitly collaborative search, your goal is to do something similar, but with another set of like-minded partners working with you on the same task/topic.  Each partner may have different expertise; one may be an expert in energy policy, another might understand trade and commerce, and another might have experience with the inner workings of Congress and understand how it works on a practical level.  If you put all these people together right now, the only way they can work together on a shared task is to search separately and then email each other their results.  What if, however, you could design a system that not only mediated between them on an interface level (immediate notification of marked documents and passages, shared highlighting of seen documents, etc.) but mediated between them on an algorithmic level as well?  Algorithmic mediation of the collaborative process would mean that the retrieval system itself has a hand in both combining and partitioning the inputs and actions of the search team members, as necessary.  They might then be able to find important, valuable information that none of the searchers, had they been working alone, could have.</p>
<p>It seems like an interesting domain, and one with real, potentially quite important consequences and societal implications.  It will be interesting to watch as this develops.</p>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2009/09/22/exploration-collaboration-and-open-government/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>There is No Crowd</title>
		<link>http://irgupf.com/2009/09/17/there-is-no-crowd/</link>
		<comments>http://irgupf.com/2009/09/17/there-is-no-crowd/#comments</comments>
		<pubDate>Thu, 17 Sep 2009 22:15:00 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>
		<category><![CDATA[Information Retrieval Foundations]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=910</guid>
		<description><![CDATA[Via Xavier Amatriain: The Dirty Little Secret About the &#8220;Wisdom of the Crowds&#8221; &#8211; There is No Crowd:
This is hardly the first time that the so-called &#8220;wisdom of the crowds&#8221; has been called into question. The term, which implies that a diverse collection of individuals makes more accurate decisions and predications than individuals or even [...]]]></description>
			<content:encoded><![CDATA[<p>Via <a href="http://technocalifornia.blogspot.com/">Xavier Amatriain</a>: <a href="http://www.readwriteweb.com/archives/the_dirty_little_secret_about_the_wisdom_of_the_crowds.php">The Dirty Little Secret About the &#8220;Wisdom of the Crowds&#8221; &#8211; There is No Crowd:</a></p>
<blockquote><p>This is hardly the first time that the so-called &#8220;wisdom of the crowds&#8221; has been called into question. The term, which implies that a diverse collection of individuals makes more accurate decisions and predications than individuals or even experts, has been used in the past to describe how everything from Wikipedia to user-generated news sites like Digg.com offer better services than anything created by a smaller group could do.</p>
<p>Of course, we now know that simply isn&#8217;t true. For one thing, Wikipedia isn&#8217;t written and edited by the &#8220;crowd&#8221; at all. In fact, <a href="http://asc-parc.blogspot.com/2007/05/long-tail-and-power-law-graphs-of-user.html">1% of Wikipedia users are responsible for half of the site&#8217;s edits</a>. Even Wikipedia&#8217;s founder, Jimmy Wales, <a href="http://www.aaronsw.com/weblog/whowriteswikipedia">has been quoted as saying</a> that the site is really written by a community, &#8220;a dedicated group of a few hundred volunteers.&#8221;</p>
<p>&lt;snip&gt;</p>
<p>Still, there [has] yet to be a perfect solution to the problem. Perhaps it&#8217;s time we give up the idea that the &#8220;wisdom of the crowds&#8221; was ever a driving force behind any socialized, user-generated anything and realize that, just like in life, there will always be active participants as well as the passive passerbys.</p></blockquote>
<p>I have never quite liked the notion of &#8220;wisdom of crowds&#8221;, and the hype behind it even less, so I&#8221;m glad to see signs that the hype cycle is finally starting to wind down.  However, by having to confront exactly what it was that I didn&#8217;t like about the notion, I was intellectually forced to propose an alternative: Explicit Collaboration in Search.  <a href="http://irgupf.com/2009/04/21/dagstuhl-seminar-on-content-based-retrieval/">As I wrote half a year ago</a>:<span id="more-910"></span></p>
<blockquote><p>[In early 2006], I was having a visceral reaction against all the hype surrounding “collective intelligence” and “wisdom of the crowds” as a primary basis for doing information retrieval. I was not interested in collaboration as massive data crunching on top an anonymous crowd, nor even on top of a known crowd of friends.  I wasn’t interested in crowds of any kind. Rather, I thought (and still do) there is a lot more value left to be extracted from content-based search methods.</p>
<p>However, where these thoughts soon led me was to a notion of collaboration in search quite different from “wisdom of crowds” methods. With <a href="http://fxpal.com/?p=back">Maribeth Back</a> and <a href="http://fxpal.com/?p=gene">Gene Golovchinsky</a>, I envisioned collaboration as a sort of “musical jam session”, where a small set of common-goal searchers got together and “played” their search “tunes” <em>together</em> over a content-based retrieval back end.  The purpose of this jamming wasn’t to repeat each others’ notes (”people who play this note also play that note”) but to play melodies and baselines that were different, but that worked together toward the larger goal of creating a full “song”, a commonly constructed set of relevant information.  To date, this notion of collaboration in search has proven, and continues to be, <a href="http://www.elsevier.com/framework_products/promis_misc/ipmspecialissue.pdf">quite</a> <a href="http://www.fxpal.com/publications/FXPAL-PR-08-460.pdf">fruitful</a>.  There are all sort of research “melodies” left to be played, all sorts of songs left to be sung, by all sorts of researchers.  I continue to be excited about this notion of “search jamming”, and look forward to the solutions that the community will continue to invent.</p></blockquote>
<p>I have not spend too much time blogging about this ongoing collaborative information seeking research on IRGupf.  Most of what (Gene and) I have written on that topic appears on the <a href="http://palblog.fxpal.com/?cat=22">FXPAL Blog</a> (see [<a href="http://palblog.fxpal.com/?p=249">1</a>][<a href="http://palblog.fxpal.com/?p=272">2</a>][<a href="http://palblog.fxpal.com/?p=350">3</a>][<a href="http://palblog.fxpal.com/?p=267">4</a>][<a href="http://palblog.fxpal.com/?p=529">5</a>][<a href="http://palblog.fxpal.com/?p=274">6</a>][<a href="http://palblog.fxpal.com/?p=103">7</a>]).  Still, I found the aforementioned &#8220;There is no Crowd&#8221; post interesting enough to warrant a brief mention here.  I find Perez&#8217;s (the author&#8217;s) conclusion compelling: <em>Perhaps it&#8217;s time we give up the idea that the &#8220;wisdom of the crowds&#8221; was ever a driving force behind any socialized, user-generated anything</em>.  If you want to put the &#8220;social&#8221; into your information seeking systems and algorithms, then it makes sense to really make it social, i.e. there should be explicit interaction with a known person, rather than implicit, passive behavioral aggregation.</p>
<p>[Note that explicit collaboration in information seeking is only one of many possible alternatives to the "wisdom of the crowd".  <a href="http://technocalifornia.blogspot.com/">Xavier</a> proposes a different solution in a <a href="http://technocalifornia.blogspot.com/2009/05/wisdom-of-few.html">paper published at SIGIR 2009</a>.]</p>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2009/09/17/there-is-no-crowd/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>A Bird in the Hand&#8230;</title>
		<link>http://irgupf.com/2009/08/24/a-bird-in-the-hand/</link>
		<comments>http://irgupf.com/2009/08/24/a-bird-in-the-hand/#comments</comments>
		<pubDate>Mon, 24 Aug 2009 13:03:42 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>
		<category><![CDATA[Information Retrieval Foundations]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=83</guid>
		<description><![CDATA[As a researcher, I have more questions than answers.  And one of the questions that I have is in regards to the widely-accepted maxim that users are too lazy to give explicit relevance feedback to the search engine.  See Danny Sullivan&#8217;s take, here.
Perhaps I am stuck back in a view of Information Retrieval that is [...]]]></description>
			<content:encoded><![CDATA[<p>As a researcher, I have more questions than answers.  And one of the questions that I have is in regards to the widely-accepted maxim that users are too lazy to give explicit relevance feedback to the search engine.  <a href="http://daggle.com/why-search-sucks-you-wont-fix-it-the-way-you-think-203">See Danny Sullivan&#8217;s take, here</a>.</p>
<p>Perhaps I am stuck back in a view of Information Retrieval that is 10-15 years old, but I tend to find my views heavily shaped or influenced by things like the following bit from<a href="http://people.ischool.berkeley.edu/~hearst/irbook/10/node8.html"> Marti Hearst&#8217;s chapter</a> in <a href="http://people.ischool.berkeley.edu/~hearst/irbook/">Modern Information Retrieval</a>:</p>
<blockquote><p>An important part of the information access process is query reformulation, and a proven effective technique for query reformulation is <em>relevance feedback</em>.  In its original form, relevance feedback refers to an interaction cycle in which the user selects a small set of documents that appear to be relevant to the query, and the system then uses features derived from these selected relevant documents to revise the original query.  This revised query is then executed and a new set of documents is returned.  Documents from the original set can appear in the new results list, although they are likely to appear in a different rank order.  Relevance feedback in its original form has been shown to be an effective mechanism for improving retrieval results in a variety of studies and settings [<a href="http://people.ischool.berkeley.edu/%7Ehearst/irbook/biblio.html#salton90a">salton90a</a>][<a href="http://people.ischool.berkeley.edu/%7Ehearst/irbook/biblio.html#harman92c">harman92c</a>][<a href="http://people.ischool.berkeley.edu/%7Ehearst/irbook/biblio.html#buckley94b">buckley94b</a>].  In recent years the scope of ideas that can be classified under this term has widened greatly.</p></blockquote>
<p>Given that explicit relevance feedback works, why is it essentially non-existent on the web?  A bird in the hand (an explicit relevance judgment) is worth two in the bush (two implied or inferred relevance judgments).  <span id="more-83"></span>Danny Sullivan (above) argues against explicit relevance interaction by pointing to a bunch of complicated, non-intuitive interfaces and laughing;  nothing beats a simple interface, he says.  But clicking thumbs up or down on information that you do or do not find relevant is not complicated.  <a href="http://pandora.com">Pandora</a> uses it all the time in the music information retrieval domain.  By itself, that shows that users are not too lazy to use such a tool.  So why do we not have it in the web information retrieval domain?  Hearst continues:</p>
<blockquote><p>Relevance feedback introduces important design choices, including which operations should be performed automatically by the system and which should be user initiated and controlled.  Bates discusses this issue in detail [<a href="http://people.ischool.berkeley.edu/%7Ehearst/irbook/biblio.html#bates90b">bates90b</a>], asserting that despite the emphasis in modern systems to try to automate the entire process, an intermediate approach in which the system helps automate search at a <em>strategic</em> level is preferable.  Bates suggests an analogy of an automatic camera versus one with adjustable lenses and shutter speeds. On many occasions, a quick, easy method that requires little training or thought is appropriate.  At other times the user needs  more control over the operation of the machinery, while still not wanting to know about the low level details of its operation.</p></blockquote>
<p>I see the same thing today that Bates saw twenty years ago: Modern systems are trying to automate the entire process.  You often don&#8217;t even know that you&#8217;re doing relevance feedback. Full automation may be helpful in certain specific scenarios, in that it might save me a click or two, it also is not transparent enough for me to correct when it goes wrong.  Bates concludes, and I agree, that an intermediate, strategic approach is better.  The system should handle the low-level details, but not remove control of the strategic process from the hands of the user.</p>
<p>One of the reasons I am interested in this topic is related to my work in collaborative information seeking (algorithmic mediation of the explicitly collaborative process).  One of the challenges that we face in designing not only interfaces, but the underlying strategic roles that will be adopted by collaborating searchers, is how much of the process to make explicit and how much to leave implicit.  Certainly we do not want to require that collaborators do everything themselves.  This is the current state-of-the-art, in which the only way to collaborate on the web is to email links back and forth.  This is clearly a non-optimal solution.  But on the other hand, it may also cause the collaborators a good deal of frustration if the system attempts to do everything for them.  There has to be a balance.</p>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2009/08/24/a-bird-in-the-hand/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Dagstuhl Seminar on Content-Based Retrieval</title>
		<link>http://irgupf.com/2009/04/21/dagstuhl-seminar-on-content-based-retrieval/</link>
		<comments>http://irgupf.com/2009/04/21/dagstuhl-seminar-on-content-based-retrieval/#comments</comments>
		<pubDate>Wed, 22 Apr 2009 00:48:52 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>
		<category><![CDATA[Information Retrieval Foundations]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=513</guid>
		<description><![CDATA[As a researcher, it is occasionally quite interesting to reread thoughts and positions that I&#8217;ve taken in years and works past. Sometimes I can observe a marked shift from my previous thinking; avenues or approaches that I once considered fruitful I now no longer do. And sometimes I can observe hints and seeds of my [...]]]></description>
			<content:encoded><![CDATA[<p>As a researcher, it is occasionally quite interesting to reread thoughts and positions that I&#8217;ve taken in years and works past. Sometimes I can observe a marked shift from my previous thinking; avenues or approaches that I once considered fruitful I now no longer do. And sometimes I can observe hints and seeds of my current research; avenues of which I only had a vague inkling have blossomed into larger pursuits.</p>
<p>In April of 2006 I had the good fortune to attend a <a href="http://www.dagstuhl.de/en/program/calendar/semhp/?semnr=06171">Dagstuhl Seminar on Content-Based Multimedia Information Retrieval</a> (I am toward the upper left corner of the <a href="http://www.dagstuhl.de/Gruppenbilder/06171.A.B.JPG">seminar group photo</a>).  Ramesh Jain has a <a href="http://ngs.ics.uci.edu/blog/?p=227">good writeup of Dagstuhl Seminars</a>, what they are and how they work.  In the <a href="http://www.dagstuhl.de/Materials/index.en.phtml?06171">abstract of my Seminar presentation</a> I wrote:</p>
<p><span id="more-513"></span></p>
<blockquote><p>The classic problem of ad hoc information retrieval involves a user with an information need, a representation or expression of that information need (the query), and a system or retrieval engine that compares the query against a collection of items in order to return the most relevant items to the user information need. Despite numerous and obvious exceptions, in general text information retrieval has a fairly high correlation between the syntax of a query as expressed in language and the semantics of the information need. Textual similarity is highly correlated with relevance. On the other hand, in content-based multimedia retrieval (images, video, music, 3d models), objects encompass multitudinous semantics in many different dimensions. In music for example there are properties of pitch, tempo, rhythm, timbre, singer characteristics, genre, instrumentation, year of production, and so on. The correlation between similarity and relevance is much lower. Two music pieces might be similar because they both use similar instruments, timbres, tempos and singers, but they are not necessary both relevant to my information need if I am looking for waltzes, and one piece is in 3/4 and the other in 4/4.</p>
<p>The current popular solution to this problem, characterized by buzzwords such as &#8220;collective intelligence&#8221;, &#8220;wisdom of crowds&#8221; and &#8220;Web 2.0&#8243;, is to bypass content altogether. By instead aggregating the media interactions (playlists, tags, click behavior, etc.) of massive numbers of people, the collective intelligence approach hopes to be able to determine relevance directly, without the need for content-based methods. If people are not only the ultimate consumers, but also the ultimate producers of relevance, why waste any effort on a problem as difficult as content-based retrieval? In our presentation we reject this notion of complete reliance on collective intelligence methods and argue that content-based methods are necessary. Aggregate crowd relevance information may be able to tell us what should be retrieved, but it still will not tell us why something was retrieved. For that, we still need to rely on the explanatory power of content. Therefore, we propose the &#8220;cognitive disclosure&#8221; paradigm, in which semantic representations are chosen a priori by designers of a content retrieval system, i.e. content-features necessary to call a piece of music a &#8220;waltz&#8221;, or to call an image a &#8220;landscape&#8221;. These semantic categories are then revealed to users at retrieval time, to allow them more intelligent selection of the types of information that is relevant to them.<span> </span>This problem is still very difficult and there are no easy solutions.<span> </span>However, our purpose is simply to explain why &#8220;wisdom of crowds&#8221; approaches will inevitably fall short, and content-based methods are still going to be necessary.</p></blockquote>
<p>At the time, I was having a visceral reaction against all the hype surrounding &#8220;collective intelligence&#8221; and &#8220;wisdom of the crowds&#8221; as a primary basis for doing information retrieval. I was not interested in collaboration as massive data crunching on top an anonymous crowd, nor even on top of a known crowd of friends.  I wasn&#8217;t interested in crowds of any kind. Rather, I thought (and still do) there is a lot more value left to be extracted from content-based search methods.</p>
<p>However, where these thoughts soon led me was to a notion of collaboration in search quite different from &#8220;wisdom of crowds&#8221; methods. With <a href="http://fxpal.com/?p=back">Maribeth Back</a> and <a href="http://fxpal.com/?p=gene">Gene Golovchinsky</a>, I envisioned collaboration as a sort of &#8220;musical jam session&#8221;, where a small set of common-goal searchers got together and &#8220;played&#8221; their search &#8220;tunes&#8221; <em>together</em> over a content-based retrieval back end.  The purpose of this jamming wasn&#8217;t to repeat each others&#8217; notes (&#8221;people who play this note also play that note&#8221;) but to play melodies and baselines that were different, but that worked together toward the larger goal of creating a full &#8220;song&#8221;, a commonly constructed set of relevant information.  To date, this notion of collaboration in search has proven, and continues to be, <a href="http://www.elsevier.com/framework_products/promis_misc/ipmspecialissue.pdf">quite</a> <a href="http://www.fxpal.com/publications/FXPAL-PR-08-460.pdf">fruitful</a>.  There are all sort of research &#8220;melodies&#8221; left to be played, all sorts of songs left to be sung, by all sorts of researchers.  I continue to be excited about this notion of &#8220;search jamming&#8221;, and look forward to the solutions that the community will continue to invent.</p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2009/04/21/dagstuhl-seminar-on-content-based-retrieval/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Collaborative Information Seeking (Ongoing Recap)</title>
		<link>http://irgupf.com/2009/03/30/collaborative-information-seeking-ongoing-recap/</link>
		<comments>http://irgupf.com/2009/03/30/collaborative-information-seeking-ongoing-recap/#comments</comments>
		<pubDate>Mon, 30 Mar 2009 21:26:35 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=381</guid>
		<description><![CDATA[Now seems as good a time as any to post a quick recap of the series of collaborative information seeking posts that Gene and I have been writing over on Palblog.  We&#8217;re about halfway through the series.

Communicating about Collaboration
Communicating about Collaboration: Intent
Communicating about Collaboration: Synchronization
Social Search
Social Search Redux

I will post another recap once we finish [...]]]></description>
			<content:encoded><![CDATA[<p>Now seems as good a time as any to post a quick recap of the series of collaborative information seeking posts that Gene and I have been writing over on Palblog.  We&#8217;re about halfway through the series.</p>
<ul>
<li><a href="http://palblog.fxpal.com/?p=249">Communicating about Collaboration</a></li>
<li><a href="http://palblog.fxpal.com/?p=272">Communicating about Collaboration: Intent</a></li>
<li><a href="http://palblog.fxpal.com/?p=267">Communicating about Collaboration: Synchronization</a></li>
<li><a href="http://palblog.fxpal.com/?p=350">Social Search</a></li>
<li><a href="http://palblog.fxpal.com/?p=529">Social Search Redux</a></li>
</ul>
<p>I will post another recap once we finish the series.</p>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2009/03/30/collaborative-information-seeking-ongoing-recap/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Social ?= Collaborative</title>
		<link>http://irgupf.com/2009/03/20/social-search-vs-collaborative-search/</link>
		<comments>http://irgupf.com/2009/03/20/social-search-vs-collaborative-search/#comments</comments>
		<pubDate>Fri, 20 Mar 2009 17:31:44 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>
		<category><![CDATA[Social Implications]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=280</guid>
		<description><![CDATA[There is an an interesting comment thread happening over on the FXPAL blog, about the differences between social search and collaborative search:
http://palblog.fxpal.com/?p=350#comments
]]></description>
			<content:encoded><![CDATA[<p>There is an an interesting comment thread happening over on the FXPAL blog, about the differences between social search and collaborative search:</p>
<p><a href="http://palblog.fxpal.com/?p=350#comments">http://palblog.fxpal.com/?p=350#comments</a></p>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2009/03/20/social-search-vs-collaborative-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Collaborative Information Seeking: Intent</title>
		<link>http://irgupf.com/2009/03/18/collaborative-information-seeking-intent/</link>
		<comments>http://irgupf.com/2009/03/18/collaborative-information-seeking-intent/#comments</comments>
		<pubDate>Wed, 18 Mar 2009 18:18:05 +0000</pubDate>
		<dc:creator>jeremy</dc:creator>
				<category><![CDATA[Collaborative Information Seeking]]></category>
		<category><![CDATA[Information Retrieval Foundations]]></category>

		<guid isPermaLink="false">http://irgupf.com/?p=232</guid>
		<description><![CDATA[Over on the FXPAL blog, Gene and I added the 2nd of a series of posts mapping out the collaborative information seeking systems domain.  Here is an excerpt:
When we view collaboration from the IR perspective, it becomes clear that collaboration does not merely refer to a decision to work together.  Rather, it refers to a [...]]]></description>
			<content:encoded><![CDATA[<p>Over on the FXPAL blog, Gene and I added the <a href="http://palblog.fxpal.com/?p=272">2nd of a series of posts</a> mapping out the collaborative information seeking systems domain.  Here is an excerpt:</p>
<blockquote><p>When we view collaboration from the IR perspective, it becomes clear that collaboration does not merely refer to a decision to work together.  Rather, it refers to a decision to work together <em>on an information need</em>.  Users who have actively and consciously decided to work with each other on a specific information need are <em>explicitly</em>(intentionally) engaged in collaborative information seeking.  A user whose information seeking activities are influenced (for the better, of course) by others, but without the user’s active involvement, is <em>implicitly</em> or non-intentionally collaborating with others.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://irgupf.com/2009/03/18/collaborative-information-seeking-intent/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
