Come to EVIA, see California for free!

February 8th, 2010

I’ve recently developed a morbid interest in the literature of financial disasters. In this vein, I’ve just finished reading Kindleberger et al., Manias, Panic, and Crashes: A History of Financial Crises — an interesting book, but remarkable for the confused way the writing is organized, with constant re-iterations, digressions, and non-sequiturs, as if someone had taken several working revisions, cut them up, dropped them on the floor, and then re-assembled the pieces without too much care for their coherence.
Read the rest of this entry »

Guilty pleasures

January 8th, 2010

Bertrand Russell’s practically-minded grandmother used to disapprovingly refer to his studies of higher mathematics as “this life you are leading”. I have my own bookish degeneracy, namely, books. When at work I think with guilty pleasure of getting home, getting into my armchair, and reading. Besides reading, my preferred pastime is — being at work. Staying home, on holiday, renders me liable to impressment into the sort of extended, aimless social activity that leaves me feeling vaguely psychopathic. My partner and I are in the happy equilibrium of being able to return home, me from work, her from the beach, and commiserating each the other on the rigours endured. My officemate Anh artfully combines the two activities; he feigns work so as to get time away from his family at the weekend, in the office, reading novels.

Accidental publication

January 6th, 2010

I am in the habit of summarizing interesting articles as draft postings on my blog, in order to get the content of the article into my head. Generally, I don’t go on to fully polish and publish the summarizing post. But if I absent-mindedly hit the “publish” button when what I want is “save draft”, then I do end up publishing the post, without any polishing. This happened to my notes on a Wired article on the rise and rise of the placebo effect. It’s gone from my blog now, but if you subscribe via a reader, the disjointed sentences likely ended up in your in tray. Apologies.

RIA Dataset

December 17th, 2009

Alistair passed me a copy of the latest issue of Information Retrieval, which is devoted to reports from the Reliable Information Access workshop. The workshop was run in 2003, and the reports are being published in 2009, so we are not discussing breaking news here. Still, the concept of the workshop was very interesting: invite a dozen leading information retrieval research groups to a six-week, on-site experiment employing seven different retrieval systems, to tackle (broadly speaking) the question of why information retrieval technology is not improving. There were two specific subtasks: an intensive failure analysis of why retrieval systems, individually and collectively, performed poorly on certain topics and not on others; and a multi-dimensional exploration of the effectiveness, limits, commonalities, and differences of pseudo-relevance feedback techniques, one of the few promising general-purpose retrieval techniques that go beyond keyword matching.

Read the rest of this entry »

Philosopher Kings

December 7th, 2009

Is the notion of establishing a neo-Spartiate state ruled by technocratic Philosopher Kings, after the fashion of Plato’s Republic, on everyone’s lips at the moment, or is it just the social circle I move in?

Switching envelopes for fun and profit

December 1st, 2009

I hold two envelopes, and tell you that one holds twice as much money as the other. I then let you choose and keep one. You are about to take the envelope in my left hand, but then decide to do a quick probability calculation. Let the amount of money in the left envelope be A. Then the amount in the right envelope is either 2A or A/2. Now, since you picked the left envelope at random, these two amounts are equally likely for the right envelope. Therefore, your expected gain in switching to choosing the right envelope is (0.5 * 2A) + (0.5 * A/2) = 1.25A, whereas your gain in stick with the left envelope is A. So, logically, you should switch. But then, if you’d chosen the right envelope first, then the same logic would dictate that you should switch to the left one. And that can’t be right. So what’s wrong in your reasoning?
Read the rest of this entry »

Malaysian latex exports

November 30th, 2009

OK, stick with me on this one. Latex is derived from the rubber plant. Rubber is one of Malaysia’s chief exports. There is a widely-used typesetting tool called LaTeX. Therefore (pay attention to the clever link here), it is only natural that the Malaysian LaTeX User Group blog should be one of the most fruitful and creative out there when it comes to clever LaTeX hints and tricks, particularly relating to creating rich layouts and presentations. Highly recommended.

Which of these terms is not like the topic?

November 20th, 2009

Via Jeff Dalton and the LingPipe blog, comes Reading Tea Leaves: How Humans Interpret Topic Models, an enjoyable paper on evaluating the meaningfulness to humans of the topics that topic models like Latent Dirichlet Allocation produce. The authors come up with a couple of interesting evaluation tasks, which they execute on Mechanical Turk. The one I particularly liked was to introduce an alien word into the word list of a topic, and then see whether the user can locate the intruded word — something like the old “which of these things is not like the other” song/game on Sesame Street. Their finding is that human evaluation and the automated or statistical evaluation techniques generally used do not agree on which topic model is the most effective; but it is the inventiveness of the method they developed to test the human interpretation of topic coherence that I particularly liked.

ACSSGIRIT

November 10th, 2009

According to Tetsuya Sakai, who runs information retrieval research throughout greater Asia via a number of front organizations, AIRS 2010 will be hosted by the National Taiwan University in Taipei, which is great, because it gives me an excuse to visit Taipei again. Tetsuya also announced that AIRS is changing its name from the Asian Information Retrieval Symposium to the Asian Information Retrieval Societies conference. The change is a valid one, since AIRS really is a conference, not a symposium (that is, it is an occasion for carrying things together, not a drinking party). However, the name change has had the interesting side effect of bringing into existence a whole new class of organization, namely an Asian Information Retrieval Society.

Read the rest of this entry »

Apologies

November 9th, 2009

Sorry:

A.) For not blogging frequently during CIKM. I was exhausted at the end of day one, drunk at the end of day two, and lazy at the end of day three. I’ll write up some notes later, but I realize that they’ll lack that tweet-like freshness.

B.) That my blog went down. My swap partition decided to go on holiday on someone else’s VPS. We seem to be back now.

C.) To Professor Hara. While your eyes do look like the moon reflected in a lake at night, I now realize that this was inappropriate to say to a married man over the banquet table.