Democracies and Mutual Funds

I was chatting with Rajiv today about politics and history and economics and came up with an interesting analogy.

We were talking about how monarchs can really do amazing things for a country if they’re good, but if they’re not good or just crazy they can really screw things up. In a democracy, by contrast, the people have the power but don’t necessarily know what they really want or how to get it done; things have to be voted on; nothing radical tends to happen.

So basically, democracy is like a mutual fund – low risk, medium return. Monarchy is more like an individual stock – more risky but with the potential for much higher returns. Democracies stick around while monarchies eventually get wiped out by a string of too many bad leaders.

Idea for a new thermostat design

A striking fact: most of my recent roommates — all smart enough to get into MIT — completely failed to understand our thermostat. At first, I attributed this to the notion that MIT students simply lack many everyday, non-academic skills. This may have some truth to it, but the wider conclusion is that the standard thermostat design is just not intuitive.

Here is how a thermostat works: The user sets an (unmarked) temperature with a dial or slider. From this, the thermostat extrapolates a low-temperature cutoff and a high-temperature cutoff. The low cutoff is lower than the user setting, and the high cutoff is higher than the user setting. The thermostat turns on the air conditioning if the temperature rises above the high cutoff. The air conditioning then REMAINS ON until the temperature is pushed all the way past the low cutoff. The temperature then rises naturally until it hits the high cutoff again, and again the air conditioner kicks in. (Switch “high” with “low” in the case of heating.) This makes sense technologically, because air conditioners and heaters are more efficient if they stay on for a while.

The way you are supposed to refine your temperature setting with a thermostat is as follows: If you are too cold, you move the setting until the air conditioning clicks off. This is your way of saying “I don’t want it to get colder than this.” Alternatively, if you are too hot, you move the setting in the cold direction until the air conditioning kicks in. “I don’t want it hotter than this.” If your range of acceptable temperatures is less than the thermostat’s, then you will be changing your setting on every cycle – but you won’t be changing it very much.

Judging from the vast range of settings I have found my roommates leaving the thermostat in, here is how they seem to WANT to interact with it: “Right now I’m really really hot, so I’m going to turn the temperature way down. Five minutes later, I’m still pretty hot, so I’m going to turn the temperature down some more.” At this point, the temperature setting is way colder than the comfortable level, so eventually it will become very cold in the apartment and the roommate will be really really cold and thus turn the temperature way up. Cycle repeats.

So here is my new thermostat design. There is a round dial with no temperature markers — the only markings indicate the “colder” direction vs. the “hotter” direction. If the user is very very hot, they turn the dial strongly in the “cold” direction. This tells the thermostat that the user temperature setting should be set substantially below the current air temperature. Five minutes later, it has cooled down a bit, but the user is still hot. They go back to the thermostat and turn it in the “cold” direction, but not as far as before because they are less hot. The thermostat correspondingly sets a user temperature that is moderately colder than the current air temperature.

In other words, the “user temperature” is determined not by the absolute position of the dial but by the amount of turning in a given adjustment. This design allows people to indicate their level of discomfort, as they seem to want to do intuitively, and avoids the “escalation” problems that occur with the traditional thermostat design.

Thoughts on Pen Interfaces

I recently did a small usability study which demonstrated that a pen stroke recognition interface was not a good choice for my graph sketching application. The failure of this interface helps to explain why pen interfaces have not yet become widely used. Even for an application domain that blatantly lends itself to a sketch-like interface, and even with a fairly accurate stroke recognizer, the recognition approach was a clear loser. For one thing, users seemed disconcerted by the unpredictable nature of the stroke recognition; they were downright annoyed when the system failed to read their mind. Users also seemed stressed about having to perform more accurately in order for the system to correctly recognize their intentions.

Although improvements in software and hardware interfaces could lessen both of these problems, I think the deeper issue here is that of appropriate *constraints*. Pen interfaces tend to be highly unconstrained, which gives them flexibility and power but also makes them overwhelming, stressful, ambiguous, and often inefficient. The most obvious example is with text input: typing is faster, more satisfying, and more accurate than tablet PC handwriting precisely because typing is so much more constrained. Each button does precisely one thing: insert a particular character into the event stream. Even if I had a futuristic handwriting recognizer that recognized with human accuracy and felt as good as paper, I would still rather use a keyboard for the task of inputting characters.

A similar argument can be made for the graph sketching domain. The reason I think the arc interface turned out to be most efficient (and enjoyable) was that it provided the correct degree of constraint for the task at hand. Curves, even complex ones, are really just a series of segment endpoints and curve points (which specify the amount and direction of bulge). The arc interface in effect let users precisely and easily specify these three points to create each arc segment. If they knew what they wanted the first time around, there was no need to go back and adjust anything, and there were no surprises from the recognizer. Creating complex curves only required lifting the mouse button momentarily to indicate an upcoming change in curvature, so that the computer could display the precise desired line.

The problem with true sketching is that there is ambiguity in every pen stroke, and even the most advanced stroke recognizer will not be able to read minds. The only clear way to resolve this ambiguity is to increase the number of constraints by letting the user point out exactly what they want. One approach to this specification is to display “n-best” lists of the potential options the recognizer thinks you might mean. But given the ease of simply specifying one’s intentions the first time around, and the fact that every segment is potentially ambiguous if left to a recognizer, I think there is a strong case to be made that the arc interface will be the best approach for this line-graphing task no matter the improvements in software and hardware.

More generally, I think that sketching is too under-constrained for many of the tasks that researchers have applied it to. For example, in any domain involving a small, fixed number of symbols, such as electronics or chemistry diagrams, the constrained approach would be to specify start and end points and press a button corresponding to the desired symbol. By contrast, sketching the diagram freehand takes substantial time and always has potential for recognition ambiguity. Freehand sketching may be more intuitive because that is what users are used to doing, but a more constrained interface may prove more efficient in a similar way that typing proved more efficient than handwriting. Of course, usability studies would be required to test these hypotheses.

Conversely, the tasks that freehand sketching are good for are less constrained applications like art or solving certain math problems. These are applications with an open-ended set of symbols and diagrams that require the flexibility of stroke input. Another class of applications which merit stroke input are limited-capability mobile devices that do not have space for a lot of buttons (either “soft” or physical). But in my opinion, such devices are only a temporary solution; ultimately, we should not limit our input devices but instead figure out how to make full capabilities possible in mobile settings.

Moving Blog

I think I will move my blog here because – you have to admit – it’s more reliable than WSO.

Global warming needs a political breakthrough

What has really struck me recently about global warming is how politics is the limiting factor in solving the problem. It seems that the political system is ill-equipped for dealing with something so global and so long-term. As a scientist, I read article after article about how we have the technologies and the economic strategies to solve global warming today, if only the politicians would cooperate (by funding the technologies and implementing the regulatory strategies). As a scientist, I don’t have the tools to even begin to understand what needs to be done to actually get the politicians to cooperate. In other words, I feel helpless when it comes to addressing what seems to be *the* important factor. We need a political science breakthrough rather than a technological breakthrough.

Artificial Intelligence is about Computers making Decisions

All of intelligence – anything observable that intelligent creatures DO – that is, anything by which you can *tell* something is intelligent – is a *decision*. This could be conscious or subconscious, but a decision nonetheless. You could have done something else or said something else. You said “no” to every other option and chose whatever it is you did. I’ve always been interested in this process of decision-making.

Computer science is my methodology – in order to solve problems, I write computer programs. This is how I naturally approach research, this is what I’m good at, this is what I do.

So I think of *intelligence* as *decision making*; and thus I want *computers* to make *decisions*. In fact, AI is all about computers making decisions. I think that is a *very* deep statement. That is what makes AI both powerful and scary. Talking to a computer does not seem so scary; computers making the world’s decisions does seem scary. But a talking computer is one that *makes decisions* about what to say. I can see why using a statistical/probabilistic framework would have been disconcerting for AI pioneers trying to get computers to make the *right* decisions.

Indeed, natural language generation is part of the very *essence* of AI! The very thing that forms the basis of the Turing test! So in some sense it really is “AI-complete”. That’s exciting, but it also means be careful – pragmatically, I need to find research that is tractable. Perhaps a computer that makes decisions for generating natural language about something specific – like Regina’s football database. Or perhaps a computer that makes decisions for generating a specific type of natural language about something general. What might I mean by a “specific type of natural language”? I’ll have to think about that. Or go ask the linguists.

Academics is about intellectual pleasure

I just realized something: academics is pure pleasure. It’s hard to imagine that I’m saying this directly after working on my thesis all night, but it occurred to me while perusing a photography exhibit moments ago. Liberal arts academics is pure, intellectual pleasure. Or call it “quality” from the Zen and the Art of Motorcycle Maintenance perspective. We study ourselves and our world because it’s *interesting*. It feeds the mind.

A guy did a photography project on “body terrains”. He was looking at the similarity of landscapes to human bodily forms (it so happens that my girlfriend was one of these bodily forms, but that’s besides the point). He said there was a lot of “work left to do” in the field of representing landscapes as bodies. I thought, wow. This is just pure intellectual pleasure. At some point in the past, I would have degraded it because it seems so useless from any “practical” perspective. But now I see that it is full of quality. Mind food.

Someone was saying the other day how Scott Lewis, director of the Williams Outing Club, had the best job in the world: he gets to spend his time doing what he loves, wilderness sports — and it’s even healthy! But Joe Cruz has said the same thing about Philosophy; it’s mind food; pure pleasure; intellectual delicacy. I guess it doesn’t contribute to physical health, though.

But it seems clear that this point about pure pleasure gets lost on students starting on day one. What they don’t realize is that all the assignments, the exams, the dissertations, are mostly about finding the people who are most able to intellectualize, and get the most pleasure out of doing so. This is true at least in the framework of applying to graduate school. Sure, your ability to succeed in school is related to your ability to succeed at an intellectual job. But I think I understand a little better now what that grad student at Penn meant about caring more about the research than the location. For him, the pleasure of the thoughts was more important than the pleasures of the friends or the city or whatever else is determined by the location.

I guess it’s just so ironic that students everywhere complain so much about school work, when really it’s all about pleasure.

Today I attended my very last class at Williams College. I’m going to miss this place, for the friends, the fun, and the intellectual pleasure of it all.

Networks vs. Hierarchies as representations of thought and language

My intuition says that creativity is network-based and rational thought is hierarchy-based. Note that a hierarchical tree is simply one form of network or graph.

Also, there is no fundamental difference between graphs and matrices. This could explain why natural language researchers have not been so excited about latent semantic analysis (LSA), which is simply one way of representing a graph in a format that the computer can operate on naturally. This graph represents certain relationships in “meaning” between words.

Finally, an LSA/graph approach may or may not be more appropriate than a hierarchy/grammar approach, depending on the specific problem to be solved. Graphs are less constrained and therefore seem most powerful, though also potentially need a lot more computational resources. Also, human brains appear to be general networks, not constrained to hierarchies, but that may or may not be important to know.