<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>IREvalEtAl</title>
	<atom:link href="http://blog.codalism.com/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://blog.codalism.com</link>
	<description>William Webber's Research Blog</description>
	<lastBuildDate>Wed, 21 Jul 2010 11:09:40 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How (and why) not to rank academics</title>
		<link>http://blog.codalism.com/?p=1220</link>
		<comments>http://blog.codalism.com/?p=1220#comments</comments>
		<pubDate>Wed, 21 Jul 2010 11:09:40 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1220</guid>
		<description><![CDATA[
The recently-launched Microsoft Academic Search, a product of Microsoft Research Asia, has made a bit of a splash as a potential competitor to Google Scholar.   Although its coverage does not seem as detailed as Google Scholar quite yet, MS Academic Search has a number of additional features, such as author and conference pages, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/badwsky/2051097005/"><img src="http://farm3.static.flickr.com/2132/2051097005_cb600906fd_t.jpg" align="left" style="margin-right: 10px; margin-bottom: 3px"></a>
<p>The recently-launched <a href="http://academic.research.microsoft.com/Default.aspx">Microsoft Academic Search</a>, a product of Microsoft Research Asia, has made a bit of a splash as a potential competitor to Google Scholar.   Although its coverage does not seem as detailed as Google Scholar quite yet, MS Academic Search has a number of additional features, such as author and conference pages, publication activity graphs, and the like.  (It also has a really unwieldly, eight-syllable name; let me abbreviate to MSAS.)</p>
<p><span id="more-1220"></span></p>
<p>Amongst its features, MSAS provides rankings of &#8220;Top Authors&#8221; within various fields of computer science.  This is, needless to say, not something to be done lightly: academia is built around status and reputation, and to rank academics is to make a statement about their status, which a <a href="http://academic.research.microsoft.com/About/Help.htm#Ranking">disclaimer</a> buried in the explanatory notes does little to diminish.  And in this case, MSAS should have thought bit more carefully before they decided to publish these rankings.</p>
<p>In MSAS, authors are ranked by the number of in-domain citations they receive within a given time period.  This is a dubious enough metric in itself.  But even bearing the questionable methodology in mind, MSAS comes up with some slightly surprising top-ten entries in their list of <a href="http://academic.research.microsoft.com/CSDirectory/author_category_8.htm">top information retrieval authors of all time </a>, and some even more surprising ones in their ranking of <a href="http://academic.research.microsoft.com/CSDirectory/author_category_8_last5.htm">top authors in the last five years</a>.</p>
<p>There are, I suspect, a number of problems with how MSAS is calculating citation counts, but a quick browse through the cited-paper lists of some highly-ranked authors shows that their most egregious aberration within the field of information retrieval is that they are <a href="http://academic.research.microsoft.com/Detail.aspx?entitytype=2&#038;searchtype=2&#038;id=161644&#038;orderBy=1">including TREC overview papers</a> in their citation counts.</p>
<p>Now, TREC overview papers attract a lot of citations, because whenever anyone uses the test collection developed at a TREC task, they cite the corresponding overview paper.  But TREC is not a peer-reviewed conference.  And the overview papers in particular are not research publications at all; they are, rather, <a href="http://trec.nist.gov/pubs/trec16/papers/BLOG.OVERVIEW16.pdf">a summary of organizational information, participant descriptions, and result statistics</a>.  Track organizers put a lot of effort into the tracks, and deserve recognition for this effort in other ways; but counting citations to their overview papers is like ascribing conference citations to the person who wrote the preface to the proceedings.  </p>
<p>To be fair, Google Scholar also includes TREC papers in their list of academic publications.  But then Google Scholar does not attempt to rank academics.  This may simply be laziness on Google&#8217;s part, but it might also be a recognition that this is tricky ground to be stepping on.  Providing a slight specialization of a search interface is one thing; taking it upon yourself to summarize and rank the publication career of researchers is another.  If you&#8217;re going to do the latter, you had better pay some attention to doing it correctly.</p>
<p>MSAS has some interesting and useful features.  But, as it stands, the author ranking is not one of them.  It should be taken down until they get their data and methodology right.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1220</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>EVIA and NTCIR</title>
		<link>http://blog.codalism.com/?p=1211</link>
		<comments>http://blog.codalism.com/?p=1211#comments</comments>
		<pubDate>Wed, 07 Jul 2010 06:07:35 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1211</guid>
		<description><![CDATA[ The latest iteration of the NTCIR effort, NTCIR-8, has concluded.  NTCIR is a collaborative information retrieval forum, focusing on tasks in East Asian languages (currently Chinese, Japanese, and Korean, plus English); it is otherwise known as &#8220;the Asian TREC&#8221;.   The proceedings are available online.


This year was my first at NTCIR.  [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/inkhong/3501123776/"><img src="http://farm4.static.flickr.com/3631/3501123776_f2221df401_t.jpg" style="margin-right: 10px; margin-bottom: 3px" align="left"></a> The latest iteration of the NTCIR effort, NTCIR-8, has concluded.  NTCIR is a collaborative information retrieval forum, focusing on tasks in East Asian languages (currently Chinese, Japanese, and Korean, plus English); it is otherwise known as &#8220;the Asian TREC&#8221;.   The proceedings are <a href="http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings8/NTCIR/index.html">available online</a>.
<p>
<span id="more-1211"></span></p>
<p>This year was my first at NTCIR.  It&#8217;s a reasonably sized community, drawn largely but not exclusively from East Asia, engaged in an interesting and diverse range of tasks.  It is, therefore, a good opportunity to meet people in this geographic and scientific area.  There are also some interesting cultural differences.  The audience was very reluctant to ask questions of speakers, and once Charlie Clarke and Ian Soboroff left (on the last day), it was noticeably quieter.
<p>The first day of NTCIR was given over to EVIA (the international workshop for evaluating information access), for which I was co-chair.  There were a number of interesting papers, not least from Charlie Clarke and Ian Soboroff, and (repeatedly) from Tetsuya Sakai.  We&#8217;ve informally renamed it the &#8220;Tetsuya Sakai Memorial Colloquium&#8221;.  The proceedings for EVIA are also <a href="http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings8/EVIA/index.html">available online</a>.
<p>The organization of NTCIR is changing from this year, with a stronger focus upon community-led initiatives.  The new Evaluation Co-Chairs are Hideo Joho, of the University of Tsukuba, and the famous Tetsuya Sakai, of Microsoft Research Asia.  The <a href="http://research.nii.ac.jp/ntcir/ntcir-ws9/Call_for_NTCIR-9_Task_Proposals.pdf">Call for NTCIR-9 Task Proposals</a> has been released.  If you&#8217;re interested in proposing a task for the next iteration of NTCIR, concluding in December 2011, please grab the Call, and contact the organizers.  And once the tasks have been decided and announced, please consider taking part in one of the efforts. I recommend NTCIR as an approachable, interesting, and energetic community to get involved with.<br />
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1211</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CIKM reviewing: too much and too little</title>
		<link>http://blog.codalism.com/?p=1195</link>
		<comments>http://blog.codalism.com/?p=1195#comments</comments>
		<pubDate>Mon, 05 Jul 2010 11:58:14 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1195</guid>
		<description><![CDATA[
I&#8217;ve just finished my CIKM 2010 reviewing assignments, and have come out of the process with a number of questions:

Is nine papers too many to review?
This year, I was assigned nine papers to review.  Last year it was five.  For SIGIR (if my own local cache of reviews is to be believed) it [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/18789396@N00/144251306/"><img src="http://farm1.static.flickr.com/53/144251306_03d289b8eb_t.jpg" style="margin-right: 10px; margin-bottom: 3px" align="left"></a><br />
I&#8217;ve just finished my CIKM 2010 reviewing assignments, and have come out of the process with a number of questions:</p>
<p><span id="more-1195"></span></p>
<h3>Is nine papers too many to review?</h3>
<p>This year, I was assigned nine papers to review.  Last year it was five.  For SIGIR (if my own local cache of reviews is to be believed) it was only two this year (so few!) and eight the year before.</p>
<p>Nine papers is quite a heavy reviewing load.  It used to take me the best part of a day to give a responsible review.  Now, with experience, I seem to have got it down to four hours or so.  Even so, a nine-paper stack takes a few days to get through.  Another colleague, himself a responsible reviewer, opted out this year because of the load.</p>
<p>The advantage with having a large number to review, though, is that it gives you a sufficient sample to calibrate your own reviewing.  This is particularly important for a conference with as low an acceptance rate as CIKM, whose rate has hovered around 16% in recent years.  With nine papers, you have a ration of one accept and one maybe.  With only, say, four papers, it is more difficult to know whether you are being overly indulgent or overly stingy.</p>
<h3>Is 16% of submissions too few to accept?</h3>
<p>There was a very interesting article in the most recent issue of the Communications of the ACM, by JiLin Chen and Joseph A. Konstan, entitled <a href="http://doi.acm.org/10.1145/1743546.1743569">&#8220;Conference Paper Selectivity and Impact&#8221;</a>.  The authors surveyed the ACM Digital Library, comparing the acceptance rates and impact factors of ACM conferences and journals.  They found that, on average, journal articles achieve a similar citation rate as papers from conferences with an acceptance rate of 35%.  This is interesting in itself, as it implies (as people in CS always claim) that top conferences are more prestigious than journals (although part of the cause might be the ACM&#8217;s glacial turnover rate in getting journal submissions to print).  </p>
<p>More interestingly for our current discussion, though, was the finding (on slightly questionable methodology, it is true) that the most-cited 10% of submissions at conferences with acceptance rates below 15% receive fewer citations than the most-cited 10% of submissions at conferences with between 15% and 25% acceptance rates.  I say that the methodology was slightly questionable because it compared all the papers from a conference with 10% acceptance with, say, the most highly cited half of papers from a conference with 20% acceptance, which elides random effects (more citeable papers from the stricter conference may have been rejected by chance).  Nevertheless, the finding gives quantitive form to the often-expressed rule-of-thumb that an acceptance rate over 20% means accepting bad papers, whereas one under 20% means rejecting good ones.</p>
<p>Is it the case (as the authors speculate) that conferences with too-tight acceptance rates are squeezing out innovative papers in favour of formulaic ones?</p>
<h3>Are big conferences eating small conferences?</h3>
<p>The main reason, of course, why CIKM reviewers have such a heavy workload, and CIKM acceptance rates are so low, is that CIKM receives a huge and swelling number of submissions, as the following table shows:</p>
<p><center></p>
<table>
<tr>
<th>Year</th>
<th>Location</th>
<th>Submissions</th>
<th>Acceptance rate</th>
</tr>
<tr>
<td>2006</td>
<td>Arlington, VA, US
<td>537</td>
<td>15%</td>
</tr>
<tr>
<td>2007</td>
<td>Lisbon, Portugal</td>
<td>512</td>
<td>17%</td>
</tr>
<tr>
<td>2008</td>
<td>Napa Valley, CA, US</td>
<td>772</td>
<td>17%</td>
</tr>
<tr>
<td>2009</td>
<td>Hong Kong, China</td>
<td>847</td>
<td>15%</td>
</tr>
<tr>
<td>2010</td>
<td>Toronto, Canada</td>
<td>945</td>
<td>??</td>
</tr>
</table>
<p></center></p>
<p>(By the way, the above table also gives quantitive justification to my qualitative assesment that no-one in their right mind wants to go to a conference in Europe.)</p>
<p>Meanwhile, my anecdotal impression is that smaller conferences are struggling to attract submissions; certainly, this has been the case for a number of smaller conferences I have been involved with recently. </p>
<p>There are many reasons why big conferences could be eating small conferences.  One of them might be a growing emphasis upon prestige of venue in assessing publication impact.  Whatever the reason, it is hardly a desireable trend.  The large conferences are sausage factories for academic publications &#8212; CIKM churned through 55 presentations on the opening day last year, ignoring posters.  There is also both anecdotal (and, given the previous point, now quantitative) evidence that the large conferences suppress innovation in favour of formula.  Plus the sheer scale of the larger conferences make them questionable venues for actually meeting and getting to know other people in your area &#8212; unless, that is, you are a senior researcher meeting up with your peers.  </p>
<p>In contrast, smaller conferences provide more of the opportunities that conferences properly provide (leaving aside their dubious role as high-status publication venues).  The lack of parallel tracks means that there is more unity to the proceedings, and more of an opportunity to hear a range of work.  The smaller number of attendees means there is more of an opportunity to get to know other researchers.  And the higher acceptance rate means there is more room for novel, off-beat, and experimental ideas.</p>
<h3>Why do people publish incrementally at conferences?</h3>
<p>One of my pet hates is researchers who publish their work incrementally at conferences: the basic idea as an innovation at one conference, then a series of tweaks to it at successive ones.  Two or three of the CIKM submissions I reviewed this year were in this category: one group of authors gave additional experiments for a previous publication; another developed a framework for choosing the parameters for their earlier optimization method.  (For the third, double-blind reviewing meant I couldn&#8217;t be sure whether they were minimally extending their own work, or someone else&#8217;s.)  This is a real perversion of the purported point of conferences as venues for the fast publication of new ideas.  Once you have your SIGIR publication for your research program, surely that is enough validation?  Isn&#8217;t the next step to develop it over the next year or so into a fully-fledged journal article, rather than dribbling out new ideas as they occur to you?  Such incremental submissions should be stamped out with all vigour.  These minor evolutions clutter up high-profile conferences like CIKM and SIGIR, squeezing out newer, fresher work.</p>
<h3>Why am I repeatedly given my own submission to review?</h3>
<p>This is the second CIKM at which I have been assigned my own submission to review.  The first time appeared to be a chance occurrence, but the second began to make me suspicious.  How many others does this happen to?  Let&#8217;s assume for simplicity that people review only in the research sub-field they submit to; that there are as many sub-fields at CIKM as there are sessions; and that submissions are evenly divided amongst sessions.  There have been around 900 submissions to CIKM for each of the past two years, and 25 or so sessions per conference.  This means 36 submissions per sub-field.  If you review 8 submissions, then there is a (35/36) * (34/35) * &#8230;  * (28/29) = 7/9 chance that you will <b>not</b> randomly be given your own paper to review, or a 2/9 chance that you <b>will</b>.  Therefore, the chance of it happening two years in a row is 4/81 (although of course there&#8217;s a selection bias here: you only take notice of the two-year sequence that it does happen in).  So it&#8217;s not that improbable an occurrence.  But given it must be happening to over 20% of the program committee each year, surely the allocation algorithm would have been fixed to make sure reviewers weren&#8217;t assigned their own paper?  Or am I the only one who lets the chairs know when it happens?</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1195</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Evaluating keyword search in databases</title>
		<link>http://blog.codalism.com/?p=1171</link>
		<comments>http://blog.codalism.com/?p=1171#comments</comments>
		<pubDate>Fri, 18 Jun 2010 00:31:33 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1171</guid>
		<description><![CDATA[I was recently invited to contribute a short article to a special issue of the IEEE Data Engineering Bulletin on keyword search in databases.  Since database keyword search is not an area I have worked on previously, I decided the most worthwhile contribution I could make was to survey evaluation practice in the area, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/toepfer/3674430468/in/set-72157594583739979"><img src="http://farm4.static.flickr.com/3649/3674430468_e0abec73e0_t.jpg" align="left" style="margin-right: 10px; margin-bottom: 3px"></a>I was recently invited to contribute <a href="ftp://ftp.research.microsoft.com/pub/debull/A10mar/webber-paper.pdf">a short article</a> to a special issue of the <a href="http://tab.computer.org/tcde/bull_about.html">IEEE Data Engineering Bulletin</a> on <a href="ftp://ftp.research.microsoft.com/pub/debull/A10mar/issue1.htm">keyword search in databases</a>.  Since database keyword search is not an area I have worked on previously, I decided the most worthwhile contribution I could make was to survey evaluation practice in the area, and compare it to what is done in mainstream information retrieval.
<p>
<span id="more-1171"></span></p>
<p>Keyword search in databases is not what I originally thought it was.  That is, it is not full-text search on individual text fields.  Rather, the search is done over the content of whole objects in the database, in the form of (joined) tuples.  And since there are many ways that tuples can be joined in a database, one of the immediate questions is how to realize these joins, what level of object granularity to consider for retrieval, and what effect tuple structure has upon the similarity score matching queries to results.
<p>The field of keyword search on databases (or, more generally, on structured data) is still relatively new; the initial work in the area was done in 2002.  And one of the aspects of the field that is still being developed is its evaluation resources and methodology.  As I describe in my article, different research groups develop their own test sets.  A few structured collections are commonly used, such as DBLP.  But almost invariably, each research group specifies their own retrieval task, creates their own test queries (or selects them from a log), and performs their own evaluation.  Perhaps as a result, a common pattern is that each paper reports almost perfect effectiveness, doubling or tripling that of the previous method &#8212; which itself had achieved almost perfect effectiveness, doubling or tripling that of <i>its</i> predecessor.
<p>As I suggest in my article, such evaluation results suggest a lack of objectivity in the evaluation process.  In particular, it is likely that if a research group has set out to solve one particular problem in keyword search (for instance, matching of keywords to schema terms), then they are likely to formulate test tasks that illustrate the problem they are trying to solve.  Such targetted evaluations have their place, but what is needed in addition is a common set of retrieval tasks, intended to be representative of general retrieval needs, which serves as a neutral benchmark against which each group&#8217;s methods can be tested.  That is to say, what is needed is a public test collection, the core evaluation tool in IR.<br />
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1171</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Easter eggs in academic books</title>
		<link>http://blog.codalism.com/?p=1166</link>
		<comments>http://blog.codalism.com/?p=1166#comments</comments>
		<pubDate>Mon, 05 Apr 2010 01:20:47 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1166</guid>
		<description><![CDATA[I&#8217;m currently reading Nonsampling Error in Surveys, by Judith Lessler and William Kalsbeek (Wiley, 1992) for a project I&#8217;m working on.  It is, for the most part, an informative but necessarily rather dry treatment of statistical questions in survey design and interpretation.  The segue at the end of Chapter 10 (p276), however, reads:

If [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/ruprechtl/70189414/sizes/o/"><img src="http://farm1.static.flickr.com/35/70189414_da8b49aee9_t.jpg" align="left" style="margin-right: 10px; margin-bottom: 3px"></a>I&#8217;m currently reading <a href="http://au.wiley.com/WileyCDA/WileyTitle/productCd-0471869082.html">Nonsampling Error in Surveys</a>, by Judith Lessler and William Kalsbeek (Wiley, 1992) for a project I&#8217;m working on.  It is, for the most part, an informative but necessarily rather dry treatment of statistical questions in survey design and interpretation.  The segue at the end of Chapter 10 (p276), however, reads:</p>
<blockquote><p>
If the pancake does not decide to enter the Olympics as a disk, causing the athletes to fall far below records set previously by Russian athletes because of the sticky maple syrup, we will tell you next, dear reader, about measurement variability.
</p></blockquote>
<p>This is even referenced in the index under &#8220;pancake(s)&#8221;.  I wonder whether this was intended as an Easter egg for the reader, or to test the attention of the editor&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1166</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>EVIA deadline extended</title>
		<link>http://blog.codalism.com/?p=1160</link>
		<comments>http://blog.codalism.com/?p=1160#comments</comments>
		<pubDate>Tue, 30 Mar 2010 09:44:02 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1160</guid>
		<description><![CDATA[ We&#8217;ve received a couple of requests for more time to complete EVIA submissions, and so have extended the deadline to 23:59, GMT-12 on Friday, 2nd April.  (Yes, I realize that the consequent does not necessarily follow from the antecedent, but we&#8217;re just naturally accommodating.) Note the timezone: the submission system is open as [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/tranchis/3638033597/"><img src="http://farm4.static.flickr.com/3651/3638033597_5da28081c1_t.jpg" align="left" style="margin-right: 10px; margin-bottom: 2px"></a> We&#8217;ve received a couple of requests for more time to complete EVIA submissions, and so have <a href="http://tetsuyasakai.blogspot.com/2010/03/evia-2010-deadline-extended.html">extended the deadline</a> to 23:59, GMT-12 on Friday, 2nd April.  (Yes, I realize that the consequent does not necessarily follow from the antecedent, but we&#8217;re just naturally accommodating.) Note the timezone: the submission system is open as long as it is 2nd April anywhere in the world.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1160</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SIGSIGIR</title>
		<link>http://blog.codalism.com/?p=1148</link>
		<comments>http://blog.codalism.com/?p=1148#comments</comments>
		<pubDate>Fri, 26 Mar 2010 22:20:45 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1148</guid>
		<description><![CDATA[One of the most important emerging fields within information retrieval is that of complaining about SIGIR.  It is a field of work with a very wide participation.  Researchers complain about the upcoming deadline in January; reviewers complain about their reviewing load and the quality of submissions during February; the senior program committee complains [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/myloonyland/364530488/"><img src="http://farm1.static.flickr.com/145/364530488_57e83f1f11_t.jpg" align="left" style="margin-right: 10px; margin-bottom: 3px"></a>One of the most important emerging fields within information retrieval is that of complaining about SIGIR.  It is a field of work with a very wide participation.  Researchers complain about the upcoming deadline in January; reviewers complain about their reviewing load and the quality of submissions during February; the senior program committee complains about reviewers, the review process, the acceptance criteria, and each other throughout March; accepted and rejected authors alike <a href="http://palblog.fxpal.com/?p=3277">complain about the quality of reviews</a>; conference attendees complain about the quality of the accepted pages; all of these complaints are rehearsed at the conference business meeting; and bloggers complain about all the complaining.<br />
<span id="more-1148"></span><br />
Alongside the cycle of impassioned disaffection with SIGIR is a parallel industry in recycling rejected SIGIR papers.  Traditionally, these were bounced to CIKM in June, and CIKM rejects back to SIGIR the following January.  As SIGIR submission numbers climb, though, and the acceptance rate ebbs away, other publication venues are increasingly being fertilized with SIGIR&#8217;s nutritious run-off.  There is even now <a href="http://nonrel.wordpress.com/2010/03/25/welcome-to-not-relevant/">an online journal devoted to SIGIR rejects</a>, launched two days ago by the market-savvy Ian Soboroff, as this year&#8217;s rejection slips were being emailed out.</p>
<p>On an unrelated topic, the submission deadline for <a href="http://research.nii.ac.jp/ntcir/ntcir-ws8/EVIA-2010/">EVIA 2010</a> (of which I am co-chair) is this coming Wednesday, March 31st.  We happen to be using the same paper format as SIGIR.  If anyone has work on information retrieval evaluation they&#8217;re looking for a venue for, we&#8217;d be very happy to consider it for publication at EVIA.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1148</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Come to EVIA, see California for free!</title>
		<link>http://blog.codalism.com/?p=1137</link>
		<comments>http://blog.codalism.com/?p=1137#comments</comments>
		<pubDate>Mon, 08 Feb 2010 05:56:38 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1137</guid>
		<description><![CDATA[ I&#8217;ve recently developed a morbid interest in the literature of financial disasters.  In this vein, I&#8217;ve just finished reading Kindleberger et al., Manias, Panic, and Crashes: A History of Financial Crises &#8212; an interesting book, but remarkable for the confused way the writing is organized, with constant re-iterations, digressions, and non-sequiturs, as if [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/hyperlemon/319319310/"><img src="http://farm1.static.flickr.com/123/319319310_cf5b820188_t.jpg" align="left" style="margin-right: 10px; margin-bottom: 2px"></a> I&#8217;ve recently developed a morbid interest in the literature of financial disasters.  In this vein, I&#8217;ve just finished reading Kindleberger et al., <i>Manias, Panic, and Crashes: A History of Financial Crises</i> &#8212; an interesting book, but remarkable for the confused way the writing is organized, with constant re-iterations, digressions, and non-sequiturs, as if someone had taken several working revisions, cut them up, dropped them on the floor, and then re-assembled the pieces without too much care for their coherence.<br />
<span id="more-1137"></span><br />
Anyway, one of the chapters of the original book (spread out over several fragments in the re-assembled version) describes the boom turned bust in Japan in the late 1980s.  Kindleberger relates, for instance, the story of the Japanese bank which paid $600 million-odd for an office complex in the states, when the asking price was $300 million, merely so that they could gain the record for the world&#8217;s most expensive office purchase.  Another anecdote is that at the height of the boom, the real estate covered by the Imperial Palace in Tokyo had a value greater than all of the real estate in California.</p>
<p>If such stories of dizzying excess captivate you, then you&#8217;ll not want to miss the EVIA 2010, the Evaluating Information Access workshop, held at NII in Tokyo, just across the expressway from the Imperial Palace itself!  This is the third iteration for EVIA, the first international workshop dedicated to information retrieval evaluation issues.  I&#8217;ve been invited to join Tetsuya Sakai, the <i>eminence grise</i> of Asian information retrieval research, and Mark Sanderson, as co-chair this year.  It&#8217;s an interesting time for evaluation research, with increasing interest in HCIR issues and the emergence of crowd-sourced evaluation using Mechanical Turk and the like.</p>
<p>Deadline for submission is March 31st; please see the <a href="http://research.nii.ac.jp/ntcir/ntcir-ws8/EVIA-2010/index.html">Call For Papers</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1137</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Guilty pleasures</title>
		<link>http://blog.codalism.com/?p=1128</link>
		<comments>http://blog.codalism.com/?p=1128#comments</comments>
		<pubDate>Fri, 08 Jan 2010 08:04:55 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1128</guid>
		<description><![CDATA[Bertrand Russell&#8217;s practically-minded grandmother used to disapprovingly refer to his studies of higher mathematics as &#8220;this life you are leading&#8221;.  I have my own bookish degeneracy, namely, books.  When at work I think with guilty pleasure of getting home, getting into my armchair, and reading.  Besides reading, my preferred pastime is &#8212; [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/southarmstudio/2679961615/"><img src="http://farm4.static.flickr.com/3206/2679961615_349cc8fa1a_t.jpg" align="left" style="margin-right: 10px; margin-bottom: 2px"></a>Bertrand Russell&#8217;s practically-minded grandmother used to disapprovingly refer to his studies of higher mathematics as &#8220;this life you are leading&#8221;.  I have my own bookish degeneracy, namely, books.  When at work I think with guilty pleasure of getting home, getting into my armchair, and reading.  Besides reading, my preferred pastime is &#8212; being at work.  Staying home, on holiday, renders me liable to impressment into the sort of extended, aimless social activity that leaves me feeling vaguely psychopathic.  My partner and I are in the happy equilibrium of being able to return home, me from work, her from the beach, and commiserating each the other on the rigours endured.  My officemate Anh artfully combines the two activities; he feigns work so as to get time away from his family at the weekend, in the office, reading novels.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1128</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Accidental publication</title>
		<link>http://blog.codalism.com/?p=1125</link>
		<comments>http://blog.codalism.com/?p=1125#comments</comments>
		<pubDate>Wed, 06 Jan 2010 00:22:26 +0000</pubDate>
		<dc:creator>william</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.codalism.com/?p=1125</guid>
		<description><![CDATA[ I am in the habit of summarizing interesting articles as draft postings on my blog, in order to get the content of the article into my head.   Generally, I don&#8217;t go on to fully polish and publish the summarizing post.  But if I absent-mindedly hit the &#8220;publish&#8221; button when what I [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://hopeliesat24framespersecond.files.wordpress.com/2009/07/070808_sz_mononcle.jpg" width="100" align="left" style="margin-right: 10px; margin-bottom: 2px"> I am in the habit of summarizing interesting articles as draft postings on my blog, in order to get the content of the article into my head.   Generally, I don&#8217;t go on to fully polish and publish the summarizing post.  But if I absent-mindedly hit the &#8220;publish&#8221; button when what I want is &#8220;save draft&#8221;, then I do end up publishing the post, without any polishing.  This happened to my notes on a Wired article on the rise and rise of the <a href="http://www.wired.com/print/medtech/drugs/magazine/17-09/ff_placebo_effect">placebo effect</a>.  It&#8217;s gone from my blog now, but if you subscribe via a reader, the disjointed sentences likely ended up in your in tray.  Apologies.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.codalism.com/?feed=rss2&amp;p=1125</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
