<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>ia play &#187; search</title>
	<atom:link href="http://www.iaplay.com/category/search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.iaplay.com</link>
	<description>the good life in a digital age</description>
	<lastBuildDate>Wed, 21 Dec 2011 11:13:02 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.2</generator>
		<item>
		<title>Giving the user choice over &#8220;Did you mean?&#8221;</title>
		<link>http://www.iaplay.com/2011/02/03/giving-the-user-choice-over-did-you-mean/</link>
		<comments>http://www.iaplay.com/2011/02/03/giving-the-user-choice-over-did-you-mean/#comments</comments>
		<pubDate>Thu, 03 Feb 2011 13:21:24 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=1487</guid>
		<description><![CDATA[Be really really wary of expanding the users queries without telling them. Don&#8217;t just give them results for aubergine and results for eggplant, when they only searched for Aubergine. You think you are being clever and helpful. If you&#8217;re wrong about the expansion then you are just being extremely irritating. Either: a) Suggest the expansion [...]]]></description>
			<content:encoded><![CDATA[<div>Be really really wary of expanding the users queries without telling them. Don&#8217;t just give them results for aubergine and results for eggplant, when they only searched for Aubergine. You think you are being clever and helpful. If you&#8217;re wrong about the expansion then you are just being extremely irritating.</p>
<p>Either:<br />
a) Suggest the expansion but don&#8217;t run it for them. Risks them missing it.<br />
b) Run the expansion but tell them you&#8217;ve done it. Still risks them missing it.</p>
<p>Google&#8217;s experimenting with both approaches over the years. And currently has a bit of a mixed approach. Don&#8217;t assume their approach has &#8220;cracked&#8221; the problem.</p></div>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2011/02/03/giving-the-user-choice-over-did-you-mean/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>search: which features actually help?</title>
		<link>http://www.iaplay.com/2011/02/01/search-which-features-actually-help/</link>
		<comments>http://www.iaplay.com/2011/02/01/search-which-features-actually-help/#comments</comments>
		<pubDate>Tue, 01 Feb 2011 05:47:41 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=1247</guid>
		<description><![CDATA[1. Ranking This is the least visible thing, that you might not consider a feature, that mostly gets ignored and is absolutely the most important thing for you to dedicate time to getting right. If the query isn&#8217;t particularly ambiguous then you need the top results to be right, without asking the searcher to do [...]]]></description>
			<content:encoded><![CDATA[<h2>1. Ranking</span></h2>
<p>This is the least visible thing, that you might not consider a feature, that mostly gets ignored and is absolutely the most important thing for you to dedicate time to getting right.</p>
<p>If the query isn&#8217;t particularly ambiguous then you need the top results to be right, without asking the searcher to do much else.</p>
<p>Ranking isn&#8217;t sexy and it takes care and attention. But isn&#8217;t magic, it&#8217;s just rules. Ask what the rules are. Don&#8217;t be fobbed off. If no-one knows, work it out yourself.</p>
<h2>2. Manual Suggestions (query expansion/narrowing)</h2>
<p>This basically means Best Bets.</p>
<p>I&#8217;m very, very attached to Best Bets. This is mostly because I&#8217;ve been a search product manager as well as an IA on search re-design projects. Once the project team has packed up, the product manager (or web manager/editor) can still improve results and resolve problems using Best Bets. And they will need to. Promise.</p>
<h2>3. Automated Suggestions (query expansion/narrowing)</h2>
<p>We can&#8217;t spell and we can&#8217;t type. And then we blame the poor old search engine when it doesn&#8217;t find what we were looking for.</p>
<p>Any decent search solution needs to have some solution to misspellings (where to put them is a problem for another day!). You can do some of this with Best Bets, but with a big and diverse enough set of users you&#8217;ll probably need something a bit more automatic like Google&#8217;s Did You Mean?</p>
<p>A related but broader concept is suggesting related searches. You might have spelt your query correctly but there&#8217;s a similar term that would get you better results. Ask.com used to do this.</p>
<p>It might seem perverse to prioritise the manual intervention over the automated one. I&#8217;d usually expect to have both but I have a few reasons for picking manual if it comes to a choice:</p>
<ul>
<li>the manual option is probably cheaper to add on if neither comes as standard</li>
<li>automated suggestions often get better over time but might start a bit ropy</li>
<li>automated suggestions may be &#8216;black-box&#8217; you might not be able to do anything with them if they are wrong/misleading. And every system I&#8217;ve worked with and/or used makes mistakes sometimes.</li>
</ul>
<p>It&#8217;s worth asking whether there is any control over the automated suggestions. Is there a dictionary? Is the right language (esp. UK v US English)? Can we edit it? How?</p>
<h2>4. Filters and sort options (after you got search results)</h2>
<p>These tend to get missed by users or interfere with their understanding of the page. Not all users will understand them, especially complex faceted filters. The positioning of filters/facets is very difficult to get right. Users home in on the top results, so above the first result is most likely to get noticed and also most likely to get noticed for being in an annoying position.</p>
<p>If you are doing product search then I&#8217;d probably still prioritise 1-3 but I&#8217;d strongly argue you need 4 as well.</p>
<h2>5. Clever query language</h2>
<p>Quote marks seem to be reasonably widely understood, so I might argue these should be higher up your expectation list.</p>
<p>But unless you&#8217;ll have access to your users and be able to train them all… I wouldn&#8217;t prioritise operators like wildcards, NOT/And/Or etc..</p>
<p>Find out what you get out of the box. Make that information available to interested users. But don&#8217;t invest lots of development effort and money here.</p>
<h2>6. Filters and sort options (before you run the search)</h2>
<p>a) Radio buttons and drop-downs.  These get missed, people don&#8217;t think about using them, they tend to just stick words in and hit go. Other users won&#8217;t use them because they don&#8217;t know they need to use them until they see the search results aren&#8217;t focused enough. So then they have to go backwards. So you might as well go with (4).</p>
<p>If you can sensibly default them then they can be more useful but establishing what the sensible default  is problematic.</p>
<p>b) Advanced search pages.<br />
These are basically a collection of filters for the user to set before you run the search. Search specialists inevitably find advanced search useful but your average end-user doesn&#8217;t. The exception here is power users  but be sure the users actually are &#8220;power&#8221; users.  You are likely to find power users where there are time/cost pressures around searching e.g. staff answering customer calls or researchers using databases where they pay for searches. In these situations even reasonably techno-phobic users are motivated to get to grips with advanced searches including some of the more complex query building ones. </p>
<p>Another reason advanced search might be worthwhile is if your power users are also your most mouthy. If the segment of your audience that blogs/tweets is also the segment that might demand power features then you might consider the feature as marketing. </p>
<p>(Don&#8217;t be worried by people being intimated by the label &#8220;advanced&#8221;.  If they are intimated by the word then they&#8217;ll be intimated by the features. )</p>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2011/02/01/search-which-features-actually-help/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Best Bets in SharePoint</title>
		<link>http://www.iaplay.com/2011/01/31/best-bets-in-sharepoint/</link>
		<comments>http://www.iaplay.com/2011/01/31/best-bets-in-sharepoint/#comments</comments>
		<pubDate>Mon, 31 Jan 2011 05:38:26 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[search]]></category>
		<category><![CDATA[sharepoint]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=1423</guid>
		<description><![CDATA[SharePoint search allows you to create Best Bets. They can be created by the Site Collection administrator. If you go to Site Settings, you should see &#8216;Search Keywords&#8217; under the Site Collection Administration heading.  If you don&#8217;t see it you probably haven&#8217;t got the right permissions. You create a keyword, associate some synonyms with it [...]]]></description>
			<content:encoded><![CDATA[<p>SharePoint search allows you to create <a href="http://www.iaplay.com/2009/07/27/best-bets-functionality-for-search-systems/">Best Bets</a>. They can be created by the Site Collection administrator.</p>
<p>If you go to Site Settings, you should see &#8216;Search Keywords&#8217; under the Site Collection Administration heading.  If you don&#8217;t see it you probably haven&#8217;t got the right permissions.</p>
<p>You create a keyword, associate some synonyms with it and then add one or more Best Bet links. You can set it to expire and/or be reviewed.</p>
<p><strong>Keyword:</strong> The search term that will generate the Best Bets and also is displayed above the Best Bet e.g. PenFriend</p>
<p><strong>Synonym:</strong> Other search terms that will also generate the Best Bet. These aren&#8217;t displayed e.g. Pen Friend</p>
<p><strong>Best Bets:</strong> The editorially picked search result e.g. <a href="http://www.rnib.org.uk/shop/Pages/ProductDetails.aspx?cat=RNIBShopBaseCatalog&amp;category=labelling_penfriend&amp;productID=DL7601">Penfriend Audio Labeller</a></p>
<p>I can&#8217;t for the life of me figure out how to delete a keyword (Best Bet, yes. Keyword, no). Maybe it&#8217;s a permission thing again.</p>
<div style="width: 425px; text-align: left;"><a style="font: 14px Helvetica,Arial,Sans-serif; color: #0000cc; display: block; margin: 12px 0 3px 0; text-decoration: underline;" title="SharePoint Best Bets screenshots" href="http://www.slideboom.com/presentations/291602/SharePoint-Best-Bets-screenshots">SharePoint Best Bets screenshots</a><object id="onlinePlayer291602" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="370" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowScriptAccess" value="always" /><param name="quality" value="high" /><param name="bgcolor" value="#ffffff" /><param name="allowFullScreen" value="true" /><param name="src" value="http://www.slideboom.com/player/player.swf?id_resource=291602" /><param name="name" value="onlinePlayer291602" /><param name="allowfullscreen" value="true" /><embed id="onlinePlayer291602" type="application/x-shockwave-flash" width="425" height="370" src="http://www.slideboom.com/player/player.swf?id_resource=291602" name="onlinePlayer291602" allowfullscreen="true" bgcolor="#ffffff" quality="high" allowscriptaccess="always"></embed></object></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2011/01/31/best-bets-in-sharepoint/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>e-commerce: google keywords</title>
		<link>http://www.iaplay.com/2011/01/28/e-commerce-google-keywords/</link>
		<comments>http://www.iaplay.com/2011/01/28/e-commerce-google-keywords/#comments</comments>
		<pubDate>Fri, 28 Jan 2011 17:15:55 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[e-commerce]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=578</guid>
		<description><![CDATA[This article is part of a series about our e-commerce redesign. Analysing your search referrals only tells you about the traffic you were successful in attracting. Even if you are getting lots of traffic for a particular keyword that might be a tiny fraction of the number of people searching for that keyword. And the referrers [...]]]></description>
			<content:encoded><![CDATA[<p><em>This article is part of a series about our <a href="../2009/06/01/charity-e-commerce-project/">e-commerce redesign</a>.</em></p>
<p>Analysing your search referrals only tells you about the traffic you were successful in attracting. Even if you are getting lots of traffic for a particular keyword that might be a tiny fraction of the number of people searching for that keyword. And the referrers says nothing about what you missed out on completely.</p>
<p>So it helps to look at search engine traffic for keywords in the kind of space your website sits in. The free tools like <a href="https://adwords.google.co.uk/select/KeywordToolExternal">Google AdWords keyword tool</a> have generated lots of debate about how useful they are but I tend to see them as worth a look if you&#8217;re just looking for rough ideas about language and relative popularity.</p>
<p>With our shop research, I didn&#8217;t get much data for easy to see, easy to read, giant print, big print, canes, liquid level indicators, and (my favourite) bumpons. I couldn&#8217;t find information about Moon (the alphabet) because it was drowned by references to the satellite and all the other things called moon.</p>
<p>What I&#8217;ve learnt:</p>
<p>Generally people refer to concrete properties of the product rather than their condition. So it is &#8216;big button phone&#8217; rather than &#8216;easy to see phone&#8217; or &#8216;low vision phone&#8217;.</p>
<p>Singular is much more important than plural for objects like clocks and watches but the opposite is true for book formats e.g large print books. Which is kind of obvious&#8230;you only want one watch but you may want many books. This might have a bit of effect on our labelling policy, but not much as Google doesn&#8217;t seem to make a huge deal about singular verus plural.</p>
<p>There&#8217;s clearly a big opportunity around low vision products. The interest in products for blind people (like Braille) is less significant, which makes perfect sense when you compare the size of the audiences.</p>
<p>And loads of people are interested in magnifiers.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2011/01/28/e-commerce-google-keywords/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SharePoint search administration via SSP</title>
		<link>http://www.iaplay.com/2011/01/28/sharepoint-search-administration-via-ssp/</link>
		<comments>http://www.iaplay.com/2011/01/28/sharepoint-search-administration-via-ssp/#comments</comments>
		<pubDate>Fri, 28 Jan 2011 12:20:34 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[search]]></category>
		<category><![CDATA[sharepoint]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=1345</guid>
		<description><![CDATA[SharePoint search features are managed at 3 levels Farm level (configure the search service, configure crawler timeout settings etc&#8230;) SSP (Shared Services Provider) level Site collection level The SSP functions are accessed via the Shared Services Administration. SSP search functions: add sources to the crawl block URLs and URL patterns from the crawl define crawl schedules [...]]]></description>
			<content:encoded><![CDATA[<p>SharePoint search features are managed at 3 levels</p>
<ol>
<li>Farm level (configure the search service, configure crawler timeout settings etc&#8230;)</li>
<li>SSP (Shared Services Provider) level</li>
<li>Site collection level</li>
</ol>
<p>The SSP functions are accessed via the Shared Services Administration.</p>
<p>SSP search functions:</p>
<ul>
<li>add sources to the crawl</li>
<li>block URLs and URL patterns from the crawl</li>
<li>define crawl schedules</li>
<li>inspect crawl logs and troubleshoot crawls</li>
<li>emergency removal of items</li>
<li>install IFilters to support non-default file types</li>
<li>add/remove file types from the crawl</li>
<li>specify authoritative pages</li>
<li>create scopes for all site collections (you can also create at a site collection level)</li>
</ul>
<p>And in theory specify noise words and create a custom thesaurus.  See <a href="http://www.amazon.co.uk/gp/product/0735625352?ie=UTF8&amp;tag=ip04-21&amp;linkCode=as2&amp;camp=1634&amp;creative=19450&amp;creativeASIN=0735625352">Inside the Index and Search Engines</a><img style="border: none !important; margin: 0px !important;" src="http://www.assoc-amazon.co.uk/e/ir?t=ip04-21&amp;l=as2&amp;o=2&amp;a=0735625352" border="0" alt="" width="1" height="1" /><br />
chapter 5 for more.</p>
<p>You can by default index these types of content source:</p>
<ul>
<li>SharePoint sites</li>
<li>Non-SharePoint websites</li>
<li>Windows file shares</li>
<li>Microsoft Exchange Server public folders (you can index exchange mailboxes with a 3rd party add-on)</li>
</ul>
<p>Crawl management:</p>
<ul>
<li>Full crawl: indexes all content</li>
<li>Incremental crawl: only accesses content that has been updated since last crawl. Faster, but slow if  accessing an external website</li>
<li>Crawl schedules can be specified for each content source</li>
<li>Crawls should be scheduled for low usage times</li>
</ul>
<p>Crawl rules</p>
<ul>
<li>content can be excluded by defining a rule</li>
<li>rules are applied in the specified order so you usually need to move exclude rules in front of include rules.</li>
<li>a URL can be excluded by adding it as an exclude rule</li>
<li>URL patterns can also be excluded and help keep the management of rules neat e.g. http://www.bbc.co.uk/* or http://www.amazon.co.uk/*/dp/*</li>
<li>Exclude rules will remove any matched URLs during the next crawl</li>
<li>If you need to remove a URL in an emergency you do this via &#8220;Search Result Removal&#8221; instead</li>
</ul>
<div style="width: 425px; text-align: left;"><a style="font: 14px Helvetica,Arial,Sans-serif; color: #0000cc; display: block; margin: 12px 0 3px 0; text-decoration: underline;" title="Sharepoint search admin" href="http://www.slideboom.com/presentations/242060/Sharepoint-search-admin">Sharepoint search admin screens</a><object id="onlinePlayer242060" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="370" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowScriptAccess" value="always" /><param name="quality" value="high" /><param name="bgcolor" value="#ffffff" /><param name="allowFullScreen" value="true" /><param name="src" value="http://www.slideboom.com/player/player.swf?id_resource=242060" /><param name="name" value="onlinePlayer242060" /><param name="allowfullscreen" value="true" /><embed id="onlinePlayer242060" type="application/x-shockwave-flash" width="425" height="370" src="http://www.slideboom.com/player/player.swf?id_resource=242060" name="onlinePlayer242060" allowfullscreen="true" bgcolor="#ffffff" quality="high" allowscriptaccess="always"></embed></object></div>
<p>Resources elsewhere:<br />
<a href="http://blogs.technet.com/b/askcore/archive/2009/05/11/introduction-to-sharepoint-search-indexes-for-dpm-administrators.aspx">Introduction to SharePoint Search Indexes for DPM Administrators</a><br />
<a href="http://technet.microsoft.com/en-us/library/cc303425(office.12).aspx"> Enterprise Search administration</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2011/01/28/sharepoint-search-administration-via-ssp/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>why your search engine (probably) isn&#8217;t rubbish</title>
		<link>http://www.iaplay.com/2010/03/05/why-your-search-engine-probably-isnt-rubbish/</link>
		<comments>http://www.iaplay.com/2010/03/05/why-your-search-engine-probably-isnt-rubbish/#comments</comments>
		<pubDate>Fri, 05 Mar 2010 05:04:26 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=1156</guid>
		<description><![CDATA[Now all search engines struggle,  to varying degrees,  with the knotty mess that is natural language. But they don&#8217;t generally don&#8217;t get called rubbish for not succeeding with the meaty search challenges. Rubbish search engines are the ones that can&#8217;t seem to answer the most basic requests in a sensible manner. These are ones that [...]]]></description>
			<content:encoded><![CDATA[<p>Now all search engines struggle,  to varying degrees,  with the knotty mess that is natural language. But they don&#8217;t generally don&#8217;t get called rubbish for not succeeding with the meaty search challenges.</p>
<p>Rubbish search engines are the ones that can&#8217;t seem to answer the most basic requests in a sensible manner. These are ones that get mocked as &#8220;random link generators&#8221;, the jibbering wrecks of their breed.</p>
<p>Go to  <a href="http://www.homebase.co.uk">Homebase</a> and search for &#8220;rabbit hutch&#8221; (we need another one as two of our girls are about to produce heaps of bunnies at the same time).</p>
<p>The first result is &#8220;Small plastic pet carrier&#8221;. There&#8217;s a number of other carriers and cages. Then there&#8217;s a &#8220;Beech Finish Small Corner Desk with Hutch&#8221;. Finally there&#8217;s a Pentland Rabbit Hutch at result no #8.  This is a rubbish set of results. I asked for &#8220;rabbit hutch&#8221; and they&#8217;ve got a rabbit hutch to sell me but they&#8217;re showing me pet carriers and beech finish corner desks.</p>
<p>This is a rubbish set of results. But it doesn&#8217;t mean the search engine is rubbish.</p>
<p>Somebody made a rubbish decision. They&#8217;ve set it up shonky.</p>
<p>So before you reach for the million pound enterprise search project, try having a quick look under the bonnet with a spanner.</p>
<h2>Is it AND or OR?</h2>
<p>This is reasonably easy to test, if you can&#8217;t ask someone who knows.</p>
<p>Pick a word that will be rare on your site and another word that doesn&#8217;t appear with the rare one  e.g.  &#8221;Topaz form&#8221; for my intranet.  A rare word is one that should only appear one or two times in the entire dataset so you can check that the other word doesn&#8217;t appear with it.  You may need to be a bit imaginative but unique things like product codes can be helpful here.  If the query returns no results you&#8217;ve probably got an AND search.  More than a couple of results (and ones that don&#8217;t mention Topaz) and you&#8217;ve probably got OR.</p>
<p>(this can get messed up if there is query expansion going on but hopefully the rare word isn&#8217;t one whatever query expansion rules there are will work on).</p>
<p>AND is more likely to be problematic as a setting. You&#8217;ll get lots of &#8220;no results&#8221;. You&#8217;ll need your users to be super precise with their terminology and spell every word right.  If they are looking for &#8220;holiday form&#8221; and the form is called &#8220;annual leave form&#8221; they&#8217;ll get no results.</p>
<p>OR will generate lots of results. This is ok if the sort order is sensible. Very few people care that Google returned 2,009,990 results for their query. They just care that the first result is spot-on.</p>
<p>So most of the time you probably want an OR set-up.</p>
<p>(preferably combined with support for phrase searching so the users can choose to put their searches in nice speech marks to run an AND search if they want to and know how to).</p>
<h2>Is there crazy stemming/query expansion going on?</h2>
<p><a href="http://en.wikipedia.org/wiki/Query_expansion">Query expansion</a> is search systems trying to be clever,  often getting it wrong and not telling you what they&#8217;ve done so you can unpick it. Basically the search system is taking the words you gave it and giving you results for those words, plus some others that it thinks are relevant or related.</p>
<p>Typical types of expansion are stemming (expand a search for fish to include fishes and fishing), misspellings and synonyms (expand a search for cockerel to include rooster).</p>
<p>This is probably what is happening if you are getting results that don&#8217;t seem to include the words you searched for anywhere on the page (although metadata is another option). </p>
<p>Now this stuff can be really, really helpful. If it is any good.</p>
<p>Have you got smart sophisticated query expansion like Google?  Or does it do silly (from a day-to-day not a Latin perspective) stemming like equating animation with animals? If it is the silly version then definitely switch it off (or tweak it if you can). </p>
<p>Even if you&#8217;ve got smart expansion options available, it&#8217;s generally best practice to either give the user the option of running the expanding (or alternate) query, or at the very least of undoing it if you&#8217;ve got it wrong. They won&#8217;t always spot the options (Google puts lots of effort into coming up with the right way of doing this) but it&#8217;s bad search engine etiquette to force your query on a user.</p>
<h2>Is the sort order sensible?</h2>
<p>That Homebase example. The main problem here is sorting by price low-high. That&#8217;d be fine (actually very considerate of Homebase) if I&#8217;d navigated to a category full of rabbit hutches. But I didn&#8217;t. I searched for rabbit hutches and got a mixed bag of results that included plenty of things that a small child could tell you aren&#8217;t rabbit hutches. </p>
<p>The solution? Sort by relevancy. </p>
<p>I&#8217;ve seen quite a lot of bad search set-ups recently where the search order was set to alphabetical. Why? Unless as <a href="http://twitter.com/currybet/status/9926143860">Martin said</a> when I bemoaned this on Twitter your main use case is &#8220;to enable people to find stuff about aardvarks&#8221;.</p>
<p>News sites sometimes go with most recent as the sort order. Kinda makes sense but you need to be sure the top results are still relevant not just recent.</p>
<p>Interestingly sort order doesn&#8217;t matter so much if you&#8217;ve gone for AND searches and you haven&#8217;t got any query expansion going on. If you&#8217;re pretty sure that everything in the result set is relevant, then you&#8217;ve got more freedom over sort order.  If not,  stick with relevancy.</p>
<p>(I don&#8217;t need to tell you that you want relevancy is high-low, do I?)</p>
<p>So people stop giving me grief over navigation.  Let&#8217;s talk about that rubbish search engine you&#8217;ve got.  I could probably fix that for you.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2010/03/05/why-your-search-engine-probably-isnt-rubbish/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Search Solutions 2009</title>
		<link>http://www.iaplay.com/2009/10/05/search-solutions-2009/</link>
		<comments>http://www.iaplay.com/2009/10/05/search-solutions-2009/#comments</comments>
		<pubDate>Mon, 05 Oct 2009 05:54:52 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[events]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=1103</guid>
		<description><![CDATA[Last week I went to the Search Solutions event, held by BCS in their lovely office in Southampton Street. There were maybe 50 people, 6 or 7 women and seemingly even less laptops (which rather made it stand out from the more web-focused events I usually attend &#8211; because of lack of laptops not the [...]]]></description>
			<content:encoded><![CDATA[<p>Last week I went to the Search Solutions event, held by BCS in their lovely office in Southampton Street. There were maybe 50 people, 6 or 7 women and seemingly even less laptops (which rather made it stand out from the more web-focused events I usually attend &#8211; because of lack of laptops not the male-female ratio).</p>
<p>I didn&#8217;t make masses of notes but I did capture a few points and reminders:</p>
<p>Vivian Lin Dufour from Yahoo talked about <a href="http://help.yahoo.com/l/us/yahoo/search/searchpad/">Search Pad</a>, an attempt to make search more &#8220;stateful&#8221;.</p>
<p>Richard Russell from Google explained how the auctions for Google Ads work. Always interesting to hear more about the money side of things.</p>
<p>Dave Mountain, a geographer (another example of <a href="http://www.iaplay.com/2008/06/06/nomen-et-omen/">Nominative Determinism</a>?) talked about geographical aspects of searching. He explained that if the task is &#8220;finding the nearest cafe&#8221;, then the &#8216;near&#8217; isn&#8217;t a simple statement. There are types of near: as the crow flies, in travel time, in the direction I&#8217;m already going. After all you may not be interested in a cafe that&#8217;s already 5 miles behind you on the motorway. He had some good slides covering this, so hopefully they&#8217;ll be made available.</p>
<p>Tony Russell-Rose discussed Endeca&#8217;s impending pattern library. Should be interesting &#8211; public version to be available in the new year.</p>
<p>David White of Web Optimiser talked amongst other things about the importance of cross-media optimisation. He asked why don&#8217;t more companies, especially b2b ones, have phone numbers in title/description of search results? He also touched on the growth of twitter as a substantial source of referrals (in response to a question about whether Bing was increasing referrals and thus changing optimisation tactics).</p>
<p>Richard Boulton, as well as discussing his efforts with open source search, introduced us to the marvelous concept of <a href="http://devfort.com/">dev/fort/</a>.</p>
<blockquote><p>&#8220;Imagine a place of no distractions, no IM, no Twitter — in fact, no internet. Within, a group of a dozen or more developers, designers, thinkers and doers. And a lot of a food.</p>
<p>Now imagine that place is a fort.&#8221;</p></blockquote>
<p>Well marvellous to me but I wanted to get married in a Napoleonic fort so perhaps I&#8217;m not typical. He also mentioned <a href="http://searchevent.org/">searchevent.org</a>, a day dedicated to open source search systems, which will hopefully happen again sometime.</p>
<p>Andrew Maisey talked about a school of thought that search will increasingly become less important on the site. Dynamic user journeys will encourage more browsing.</p>
<p>(Food was pretty good as usual for the venue.  I&#8217;m hoping that we&#8217;re going back to BCS for our team away-day later in the year and then I can have more of the strawberry tarts.)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2009/10/05/search-solutions-2009/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>SharePoint search: more insights</title>
		<link>http://www.iaplay.com/2009/09/30/sharepoint-search-more-insights/</link>
		<comments>http://www.iaplay.com/2009/09/30/sharepoint-search-more-insights/#comments</comments>
		<pubDate>Wed, 30 Sep 2009 05:56:13 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[search]]></category>
		<category><![CDATA[sharepoint]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=1048</guid>
		<description><![CDATA[Surprisingly this white paper on building multilingual solutions in SharePoints provides a good overview of how the search works, regardless of whether you are interested in the multilingual aspect. White paper: Plan for building multilingual solutions. Read page 15, titled &#8220;overview of the language features in search&#8221; for a description of content crawling and search [...]]]></description>
			<content:encoded><![CDATA[<p>Surprisingly this white paper on building multilingual solutions in SharePoints provides a good overview of how the search works, regardless of whether you are interested in the multilingual aspect.</p>
<p><a href="http://technet.microsoft.com/en-us/library/cc262942.aspx">White paper: Plan for building multilingual solutions</a>.</p>
<p>Read page 15, titled &#8220;overview of the language features in search&#8221; for a description of content crawling and search query extraction. Then 16-18 provide a good overview of individual features and what they are doing. </p>
<blockquote>
<p class="MsoNormal"><a name="_Toc149832531"><strong><span lang="EN-US">Word breakers</span></strong></a><span lang="EN-US"> A word breaker is a component used by the query and index engines to break compound words and phrases into individual words or tokens. If there is no word breaker for a specific language, the neutral word breaker is used, in which case word breaking occurs where there are white spaces between the words and phrases. At indexing time, if there is any locale information associated with the document (for example, a Word document contains locale information for each text chunk), the index engine will try to use the word breaker for that locale. If the document does not contain any locale information, the user locale of the computer the indexer is installed on is used instead. At query time, the locale (HTTP_ACCEPT_LANGUAGE) of the browser from which the query was sent is used to perform word breaking on the query. Additional information about the language availability of the word breaker component is available in <em>Appendix B:</em> <em>Search Language Considerations</em>.</span></p>
<p class="MsoNormal"><a name="_Toc149832532"><strong><span lang="EN-US">Stemming</span></strong></a><span lang="EN-US"> Stemming is a feature of the word breaker component used only by the query engine to determine where the word boundaries are in the stream of characters in the query. A stemmer extracts the root form of a given word. For example, ”running,” ”ran,” and ”runner“ are all variants of the verb ”to run.” In some languages, a stemmer expands the root form of a word to alternate forms. Stemming is turned off by default. Stemmers are available only for languages that have morphological expansion; this means that, for languages where stemmers are not available, turning on this feature in the Search Result Page (CoreResult Web Part) will not have any effect. Additional information about language availability for the Stemmer feature is available in <em>Appendix B:</em> <em>Search Language Considerations</em>.<strong> </strong></span></p>
<p class="MsoNormal"><a name="_Toc149832533"><strong><span lang="EN-US">Noise words dictionary</span></strong></a><strong><span lang="EN-US"> </span></strong><span lang="EN-US">Noise words are words that do not add value to a query, such as ”and,” ”the,” and ”a.” The indexing engine filters them to save index space and to increase performance. Noise word files are customizable, language-specific text files. These files are a simple list of words, one per line. If a noise word file is changed, you must perform a full update of the index to incorporate the changes. Additional information about the noise words dictionary and how to customize it is available at www.microsoft.com.</span></p>
<p class="MsoNormal"><a name="_Toc149832534"><strong><span lang="EN-US">Custom dictionary</span></strong></a><strong><span lang="EN-US"> </span></strong><span lang="EN-US">The custom dictionary file contains values that the search server must include at index and query times. Custom dictionary lists are customizable, language-specific text files. These files are used by Search in both the index and query processes to identify exceptions to the noise word dictionaries. A word such as “AT&amp;T,” for example, will never be indexed by default because the word breaker breaks it into single noise words. To avoid this, the user can add ”AT&amp;T” to the custom dictionary file; as result, this word will be treated as an exception by the word breaker and will be indexed and queried. These files contain a simple list of words, one per line. If the custom dictionary file is changed, you must perform a full update of the index to incorporate the changes. By default, no custom dictionary file is installed during Office SharePoint Server 2007 Setup. Additional information about the custom dictionary file and how to customize it is available at www.microsoft.com. </span></p>
<p class="MsoNormal"><a name="_Toc149832535"><strong><span lang="EN-US">Thesaurus</span></strong></a><span lang="EN-US"> There is a configurable thesaurus file for each language that Search supports. Using the thesaurus, you can specify synonyms for words and also automatically replace words in a query with other words that you specify. The thesaurus used will always be in the language of the query, not necessarily the server’s user locale. If a language-specific thesaurus is not available, a neutral thesaurus (tseneu.xml) is used. Additional information about the thesaurus file and how to customize it is available at www.microsoft.com.</span></p>
<p class="MsoNormal"><a name="_Toc149832536"><strong><span lang="EN-US">Language Auto Detection</span></strong></a><strong><span lang="EN-US"> </span></strong><span lang="EN-US">The Language Auto Detection (LAD) feature generates a best guess about the language of a text chunk based on the Unicode range and other language patterns. Basically, it’s used for relevance calculation by the index engine and in queries sent from the Advanced Search Web Part, where the user is able to specify constraints on the language of the documents returned by a query. </span></p>
<p class="MsoNormal"><a name="_Toc149832537"><strong><span lang="EN-US">Did You Mean?</span></strong></a><strong><span lang="EN-US"> </span></strong><span lang="EN-US">The Did You Mean? feature is used by the query engine to catch possible spelling errors and to provide suggestions for queries. The Did You Mean? feature builds suggestions by using three components:</span></p>
<p class="Bullet1"><!--[if !supportLists]--><span style="font-family: Symbol;" lang="EN-US"><span>·<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal; -x-system-font: none;"> </span></span></span><!--[endif]--><strong><span lang="EN-US">Query log</span></strong><span lang="EN-US"> Information tracked in the query log includes the query terms used, when the search results were returned for search queries, and the pages that were viewed from search results. This search usage data helps you understand how people are using search and what information they are seeking. You can use this data to help determine how to improve the search experience for users.</span></p>
<p class="Bullet1"><!--[if !supportLists]--><span style="font-family: Symbol;" lang="EN-US"><span>·<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal; -x-system-font: none;"> </span></span></span><!--[endif]--><strong><span lang="EN-US">Dictionary lexicon</span></strong><span lang="EN-US"> A dictionary of most-used lexicons provided at installation time.</span></p>
<p class="Bullet1"><!--[if !supportLists]--><span style="font-family: Symbol;" lang="EN-US"><span>·<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal; -x-system-font: none;"> </span></span></span><!--[endif]--><strong><span lang="EN-US">Custom lexicon</span></strong><span lang="EN-US"> A collection of the most frequently occurring words in the corpus, built at query time by the query engine from indexed information. </span></p>
<p class="MsoNormal"><span lang="EN-US">The Did You Mean? suggestions are available only for English, French, German, and Spanish.</span></p>
<p class="MsoNormal"><a name="_Toc149832538"><strong><span lang="EN-US">Definition Extraction</span></strong></a><span lang="EN-US"> The Definition Extraction feature finds definitions for candidate terms and identifies acronyms and their expansions by examining the grammatical structure of sentences that have been indexed (for example, NASA, radar, modem, and so on). It is only available for English.</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2009/09/30/sharepoint-search-more-insights/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>BCS IRSG &#8211; Search Solutions 2009</title>
		<link>http://www.iaplay.com/2009/09/24/bcs-irsg-search-solutions-2009/</link>
		<comments>http://www.iaplay.com/2009/09/24/bcs-irsg-search-solutions-2009/#comments</comments>
		<pubDate>Thu, 24 Sep 2009 05:42:13 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[events]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=979</guid>
		<description><![CDATA[I&#8217;m going to &#8220;Innovations in Web and Enterprise Search&#8221; at BCS next week Search Solutions is a special one-day event dedicated to the latest innovations in web and enterprise search. In contrast to other major industry events, Search Solutions aims to be highly interactive and collegial, with attendance limited to 60-80 delegates. Provisional programme 09:30 [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m going to &#8220;Innovations in Web and Enterprise Search&#8221; at BCS next week</p>
<blockquote><p>Search Solutions is a special one-day event dedicated to the latest innovations in web and enterprise search. In contrast to other major industry events, Search Solutions aims to be highly interactive and collegial, with attendance limited to 60-80 delegates.</p></blockquote>
<p>Provisional programme</p>
<p>09:30 &#8211; 10:00 Registration and coffee</p>
<p>Session 1: (Chair: Tony Russell-Rose)</p>
<p>* 10:00 Introduction &#8211; Alan Pollard, BCS President</p>
<p>* 10:10 &#8220;Enterprising Search&#8221; &#8211; Mike Taylor, Microsoft</p>
<p>* 10:35 Accessing Digital Memory: Yahoo! Search Pad &#8211; Vivian Lin Dufour, Yahoo</p>
<p>* 11:00 &#8220;How Google Ads Work&#8221; &#8211; Richard Russell, Google</p>
<p>11:25 &#8211; 11:45 COFFEE BREAK</p>
<p>Session 2: (Chair: Andy MacFarlane)</p>
<p>* 11:45 &#8220;Location-based services: Positioning, Geocontent and Location-aware Applications&#8221; &#8211; Dave Mountain, Placr</p>
<p>* 12:10 &#8220;Librarians, metadata, and search&#8221; &#8211; Alan Oliver, Ex Libris</p>
<p>* 12:35 &#8220;UI Design Patterns for Search &amp; Information Discovery&#8221;- Tony Russell-Rose, Endeca</p>
<p>13:00 &#8211; 14:15 LUNCH</p>
<p>Session 3: (Chair: Leif Azzopardi)</p>
<p>* 14:15 &#8220;Search-Based Applications: the Maturation of Search&#8221; &#8211; Greg Grefenstette, Exalead</p>
<p>* 14:40 &#8220;How and why you need to calculate the true value of page 1 natural search engine positions&#8221; &#8211; Gary Jennings, WebOptimiser</p>
<p>* 15:05 &#8220;Search as a service with Xapian&#8221; &#8211; Richard Boulton, Lemur Consulting</p>
<p>15:30 &#8211; 16:00 TEA BREAK</p>
<p>Session 4: (Chair: Alex Bailey)</p>
<p>* 16:00 &#8220;The Benefits of Taxonomy in Content Management&#8221;, Andrew Maisey, Unified Solutions</p>
<p>* 16:25 Panel: &#8220;Interactive Information Retrieval&#8221; &#8211; details to follow</p>
<p>17:00 &#8211; 19:00 DRINKS RECEPTION</p>
<p>via <a href="http://irsg.bcs.org/SearchSolutions/2009/sse2009.php">BCS IRSG &#8211; Search Solutions 2009</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2009/09/24/bcs-irsg-search-solutions-2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>search forms on online shops</title>
		<link>http://www.iaplay.com/2009/09/04/search-forms-on-online-shops/</link>
		<comments>http://www.iaplay.com/2009/09/04/search-forms-on-online-shops/#comments</comments>
		<pubDate>Fri, 04 Sep 2009 05:34:17 +0000</pubDate>
		<dc:creator>Karen</dc:creator>
				<category><![CDATA[e-commerce]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.iaplay.com/?p=954</guid>
		<description><![CDATA[I&#8217;ve been thinking about the search functionality for our online shop this week. I&#8217;ll write up our approach to search properly at a later date but for now I thought I share the variety of search forms I&#8217;ve seen on other online shops. Some things of note: The longer search boxes were mostly on book [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been thinking about the search functionality for our online shop this week. I&#8217;ll write up our approach to search properly at a later date but for now I thought I share the variety of search forms I&#8217;ve seen on other online shops.</p>
<p><a title="E-commerce search forms: simple boxes by mog1et, on Flickr" href="http://www.flickr.com/photos/mog1et/3880494265/"><img src="http://farm3.static.flickr.com/2532/3880494265_c4335b7100_o.gif" border="0" alt="E-commerce search forms: simple boxes" width="429" height="229" /></a>
</p>
<p><a title="E-commerce search forms: labelled boxes by mog1et, on Flickr" href="http://www.flickr.com/photos/mog1et/3881290870/"><img src="http://farm4.static.flickr.com/3447/3881290870_eab9a0f25f_o.gif" border="0" alt="E-commerce search forms: labelled boxes" width="542" height="127" /></a></p>
<p><a title="E-commerce search forms: scope drop-downs by mog1et, on Flickr" href="http://www.flickr.com/photos/mog1et/3880494209/"><img src="http://farm4.static.flickr.com/3437/3880494209_890cf98851_o.gif" border="0" alt="E-commerce search forms: scope drop-downs" width="584" height="259" /></a></p>
<p><a title="E-commerce search forms: guidance text by mog1et, on Flickr" href="http://www.flickr.com/photos/mog1et/3880494185/"><img src="http://farm3.static.flickr.com/2594/3880494185_d44abd3c08_o.gif" border="0" alt="E-commerce search forms: guidance text" width="588" height="289" /></a></p>
<p>Some things of note:</p>
<ul>
<li> The longer search boxes were mostly on book sites.</li>
<li>3 sites also offered &#8220;suggestions as you type&#8221; (Amazon, Borders, Ocado)</li>
<li>Only 1 site had an obvious link to an advanced search</li>
<li>All sites handled scopes with a dropdown</li>
</ul>
<p>(Visio stencil is from <a href="http://www.guuui.com/issues/02_07.php">GUUUI</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.iaplay.com/2009/09/04/search-forms-on-online-shops/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

