Random Observations

Thursday, October 15, 2009

Literature not popular in the USA

I grew up in Canada, but now live in the USA. In the grand scheme of things, Canada and the USA are very, very similar. Yet even so there are odd differences. One of them being that there are books which are popular around most of the English-speaking world (and often farther) that are popular in Canada which nobody in the USA has heard of.

In some cases it is obvious why. For instance consider Yes, Minister which is one of the most brilliant descriptions ever of how bureaucracies work. However it assumes you understand the British parliamentary system. This political system is used in most places that were part of the British empire, including Great Britain, England, India and Australia. Therefore there is little surprise that while I've met plenty of fans of the series from all of those countries, virtually nobody in the USA has even heard of it.

In other cases it is less obvious to me why it is so. Consider, for instance, the comic series Asterix and Obelix. Originally written in French, the series has been translated into many, many different languages and is loved by children around the world. I've talked with fans from France who didn't know it was translated, fans from Spain who didn't realize it was in English, fans from India who didn't know it wasn't originally written in English, and so on. Yet I think I've talked with a grand total of one person from the USA who had heard of it - and I believe he learned about it while traveling through Europe!

Why would this be so? I suspect that some marketer looked at the series and said, "A comic series set in ancient France shortly after the Roman invasion? No American kid will ever go for that!" And so it was never marketed here.

I think this was just a bad decision. Certainly no American that I've lent it to (including several children) had trouble with the material. They all loved it. Besides, the books are meant to be appreciated on many levels. I can read it to my 4 year old son and he laughs at how Obelix accidentally breaks doors when he knocks at them. I loved it at 9 even though I missed most of the jokes embedded in the names. And my kids' babysitter wants me to buy more so she can read them. It really is a series that grows with you as you learn enough to understand more of the jokes.

In short you'd think that any series that is popular around the world in multiple languages is worth trying in the USA as well. But apparently publishers don't think that way. And so American audiences miss out on some really great works.

But this makes me wonder. I'm aware of these works because I grew up in Canada. But the USA and Canada are very similar. What popular works would I love that I don't know because they were never marketed in either country?

Wednesday, October 14, 2009

Why do we use checks?

Think about this scenario. I hand you a piece of paper. This piece of paper has all of the information you need to take any amount of money directly from my bank account. It has written on it the amount I wish to give you. You assume that I can indeed give you that amount, and I assume that you will not steal from me.

Does this seem like a sane thing to do? Just think of all that can go wrong here. You could steal from me. You might be honest, but someone else could take the information from you and steal from me. (This happened to Donald Knuth - people would get a check from him, scan them and post the proof, then scammers looking for pictures of checks online found them and forged checks. Knuth's reward checks are now no longer valid because of this.) I may not have money in my bank account. Perhaps I don't now, but I hope to. Banks actually give a little leeway called "float" to as a convenience to their customers. But this can backfire. If someone arranges things carefully they can bounce a series of payments between accounts, get the bank ready to say that all of the accounts have money, withdraw the money, and leave the bank holding the bag!

Obviously I've just describe a check. But, you ask, what is the alternative? Funny you should ask. Several months ago I went to the Netherlands. They don't have checks there. What do they do instead?

Well if you're my utility company and you want me to pay you, you send me the information I will need to deposit money in your bank account. I go to my bank and transfer money there. You get notified when it arrives. The technical name for this in English is giro transfer.

Now stop and think about how many problems this solves. I never hand out my bank details to anyone. You never have to deal with a bounced check. There is no possibility of anything like a kiting scheme. And the only practical change is that instead of my giving you information that can be used to draw from my account, you give me enough information to put money into your account.

The moral is that checking systems are fundamentally flawed. The design of a giro transfer system is fundamentally sound. Unfortunately tradition is set so that checks are here for a while to come. And people are honest enough that the problems don't generally rise to the point that would make people object. Sure, the security problems are obvious when you think about it. But as always when people aren't being bitten by the problems, people forget about the security implications.

Of course checks are losing popularity between credit cards, easy cash withdrawals, and automated payments. So there is hope that some day they will be seen to be superfluous and will eventually be abandoned. In the meantime checks serve as yet another example showing how little we care about security, even when it comes to our money.

Tuesday, October 13, 2009

Limitations of Capitalism

I've commented that Capitalism is the most effective way known of getting people to do what it gets them to do. However you have no control of what that is. I thought I would expand on that. In that light, here are some of the major flaws of capitalism:

There are big problems we want solved that have no associated revenue. For example consider the problem of taking care of the million or so untreatable schizophrenics in the USA. (I estimated that number by knowing that the USA has 300 million people, about 1% of people have schizophrenia at some point in their life, and of those about 1/3 spontaneously recover, 1/3 respond to treatment, and 1/3 are not treatable.) Left to their own devices these people are unable to function. No matter how loving their families, family resources get severely strained supporting them. And there is no realistic hope of integrating them into society. As a society we do not wish to kill them and do not want them starving to death, so we need to take care of them. The magnitude of the problem is more than charity can support, so this is a valid role for the government. (Total charitable giving in the USA is about $300 billion/year. If we assume that institutionalizing a person in a place with medical care costs $30,000/year, then that would suck up 10% of all charitable giving. On just one cause.)

Capitalism ignores external costs. A rational profit seeking individual who can acquire revenue and leave costs for others, will. A classic example is pollution. Pollution is a diffuse cost that is shared by an entire community and is mostly not experienced by the polluter. Therefore polluters have little incentive to reduce pollution. Government regulation can solve the problem by artificially providing the incentive.

Capitalism ignores external benefits. What do universal education, basic research, and sound policing have in common? The providers of the benefit cannot readily recoup the benefit they provide. Poor kids who will do better with an education are poor right now, their parents can afford that education. It is the nature of research that it proceeds best by sharing ideas, but when ideas are shared then there is generally little or no connection between the people who did the basic research and the people who commercialize it. Law and order is great, but the people who enforce the law aren't generally the people who build businesses that can prosper because they exist within a well-regulated society. Private markets therefore are poor at allocating any of these things.

Capitalism has perverse incentives with asymmetric information. As an individual you are in an extremely poor position to judge whether cooks at a local restaurant wash their hands, the security of a safe, or how effective a medical treatment is. So people judge on the basis of things they can see that they hope are a good proxy for what they want to know. Such as the quality of the decor, how sturdy the safe looks, and the presentation of the person selling you the treatment. But you can fake those without providing the attribute people really want. As a result private markets left to their own devices will provide unhealthy restaurants, insecure products, and unreliable treatments. (Government is good about public health, only rarely intervenes in security, and frequently doesn't pay attention to effectiveness of treatments. If you know where to look, this shows.)

None of this is to say that we want or need government intervention in everything. As I said, capitalism is incredibly effective at generating economic activity, much of which benefits us all. Besides, government has its own characteristic failure modes, which are at least as bad, if not worse, than the ways that capitalism fails.

However when there is a solid justification for government intervention, usually one of these failure modes in free markets is behind it. And if you can identify no specific reason why free markets should do a bad job in that area, then odds are good that the effects of the government intervention is somewhere between ineffective and bad.

Monday, October 5, 2009

Not done is nothing

I have a problem. I'll start personal projects, get to the point where I am satisfied that I could finish it, then don't. Sometimes it takes quite a bit of work to get there. For instance consider this explanation of the Kelly criterion for optimal betting patterns. I did quite a bit of work on it, created a calculator that could be useful. In addition I began a library to let me do basic differential calculus and linear algebra in JavaScript, and worked out exactly how to build numerical approximations of optimal betting patterns. Then I got distracted.

I began this blog. Then I got distracted. OK, it is the nature of blogs to never be done, and I certainly couldn't keep up the pace I started with, but I still could have done more.

The side project I now spend time thinking about is a design for a web development platform that I think would be really powerful. It could be a revolutionary way to work..if it ever gets completed.

I have no shortage of excuses. I have very little time for any side project once you subtract working full time, helping my ex-employer on the side, and taking care of my young children. I am interested in a large variety of things, and so have no shortage of distractions. Furthermore I really do constantly learn new stuff.

None of this is to say that I can't finish things. I am great at finishing things when I have an external reason to. I finish things for my employer all the time. I put a lot of work into my Effective A/B Testing tutorial, and a lot of people have been happy with the result.

However when I am doing something out of personal interest, I lose interest once I've completely learned how to do it. And as a result nobody else benefits from my efforts.

In short, I'm pretty much the opposite of being a member of The Cult of Done. :-(

Tuesday, September 29, 2009

What makes it science?

Many people draw a division between the hard sciences and mathematics on the one hand, and everything else on the other. The implication being that one side is "really" science and the other is not. Which claim to upset members of "soft sciences" like psychology.

This post explores the question of how justified this division is.

The starting point for my thinking is something I learned from the wonderful essay In Oldenburg's Long Shadow about the serial pricing crisis.

To understand the serial pricing crisis you must first understand the science citation index. This is nothing more or less than an index of how many times a given paper has been cited by other published papers. When you look at it, you find that papers that appear in some journals are consistently cited more often than others. This is a direct measurement of how influential the journal is, and leads to the impact factor. When you look at journals across math and the hard sciences you find that there is a hierarchy of journals. At the bottom you have low impact journals that publish only for a small niche. But key papers from that niche are published in more prominent journals that are looked at by people in a wider range of subjects. And this goes all of the way to the highest impact journal of all, Nature, which is where people try to publish the absolute best work across all of math and the hard sciences. It doesn't matter whether you're a physicist or a biologist, the absolute best research goes to Nature.

So from the science citation index we can measure the impact factor of a journal, which in turn tells us its value to researchers. The value to researchers told publishers what universities were willing to pay, and so publishers have been steadily increasing the price of the most important journals. This costs universities more than they are happy with, and so is called a crisis. Librarians call journals "serials" (because you get a series of copies of a journal), hence this crisis is called the serial pricing crisis.

Now let's look at this in reverse. Across all of math and the hard sciences it is possible to make a somewhat reasonable comparison of how important any given paper is, and how good any given journal is. Furthermore there are small groups of editors whose judgment is regularly trusted to compare the best papers from different areas of science and select which are worthy to be in their journals. In the extreme example, the editors of Nature are trusted to draw comparison across all of the hard sciences. And for the most part, scientists agree with these decisions.

When you think about it, it is truly remarkable. It implies that there is a relatively well shared concept of relative value across all of the sciences. Of course people in the hard sciences seldom remark on it, it is just how things are.

To see how remarkable it is, compare with the humanities and social sciences. They have no such hierarchy. Instead of one grand hierarchy you get independent clumps of researchers who talk to each other but not the other groups. And they find this so natural that I have seen social scientists express disbelief that, say, a physicist in fluid mechanics can hear a key result in particle physics and will know that it is important. But it is true. If you ask one physicist, "What are the 10 most important results in physics in the last 30 years" and take that list to another, the other physicist will agree that those are all important. If you ask a psychologist for a similar list and take it to another, the other is likely to not even recognize many of the items.

What is going on here is a confirmation of one of Thomas Kuhn's key claims in The Structure of Scientific Revolutions. Which is that in a mature science (his term) researchers have come to share a paradigm about what would be progress. When a paradigm has become so compelling that virtually all researchers in the area accept it, then people who are not in that field can see the agreement that progress is happening. When no paradigm can compel general acceptance, then from a distance all that is visible is confusion.

Kuhn is careful to point out that within the field there will be groups of researchers who are doing good work and making progress. This is certainly is true. For instance I've brought up psychology. Yet if you read books like Parenting From the Inside Out you will find that solid research is being done, that comes up with valuable information. (I highly recommend this book to anyone, parent or not, who is willing to work through it carefully.) But the problem is that the case for this line of research is not compelling enough to convince other psychologists that this is the right way to try to understand the mind. So from a distance there isn't a clear impression of solid progress being made.

Therefore the hard/soft science division boils down to shared paradigms. In the hard sciences certain lines of research have become so compelling that everyone agrees that they are the right way to go. Because of this agreement, people in nearby fields get a clear picture of what progress looks like in that field. With clear pictures of what progress looks like in multiple fields, the ground is set for making comparisons between fields, which has evolved into a reasonably well shared value system across the entirety of the hard sciences.

The soft sciences share none of this structure. As a result there is no shared agreement within the soft science about what is important, let alone a shared agreement on the relative importance of different areas of science.

To close I would like to illustrate how much shared agreement there is within the hard science about what progress looks like. I'll do this by giving my personal top 10 lists of scientific advances in each century since science began to take off in the 1600s. I haven't tried to put them in any particular order. (They often are somewhat chronological.) While people may quibble with some of my specific choices, people who are well versed in the hard sciences will generally agree on the importance of these items.

1600s
1. Objects of different mass fall at the same rate (Galileo, physics)
2. Telescope used for astronomy (Galileo, astronomy)
3. Kepler's laws for planetary orbits (Kepler, astronomy)
4. Circulatory system accurately described (Harvey, biology)
5. Microbes discovered (Leeuwenhoek, biology)
6. Hooke's law of elasticity (Hooke, physics)
7. Newton's laws of motion (Newton, physics)
8. Newton's law of gravity (Newton, physics)
9. Speed of light first measured (Ole Römer, astronomy/physics)
10. Calculus (Newton/Leibniz, mathematics)

1700s
1. Lightning explained as static electricity (Ben Franklin, physics)
2. Fluid mechanics began to be analyzed (Bernoulli, physics)
3. Linnaean taxonomy system created (Linnaeus, biology)
4. Halley's comet's orbit predicted (Halley, astronomy)
5. Coulomb's law for attraction of electric charges (Coulomb, physics)
6. Oxygen discovered (Priestly/Scheele, chemistry) leading to the rejection of pholostigon (Lavoisier)
7. Uranus discovered (William Herschel, astronomy)
8. Conservation of mass demonstrated (Lavoisier, chemistry)
9. Stability of solar system confirmed (Laplace, astronomy)
10. Gravitational constant measured (Cavendish, physics)

1800s
1. Fourier series discovered, used to analyze heat transport (Joseph Fourier, mathematics/physics)
2. Ice ages discovered, theory of The Flood rejected (Louis Agassiz, geology)
3. Central Limit Theorem aka The Bell Curve (de Moivre/Laplace/Galton/Lyapunov etc, statistics) different versions were proven at different times, and Galton was making good use of it years before it was finally proven in generality by Lyapunov
4. Thermodynamics (many people starting with Carnot, physics)
5. Conservation of Energy (Joule/Mayer, physics)
6. Descent with Modification aka Evolution (Darwin, biology)
7. Germ theory (Pasteur, biology)
8. Atomic theory (Avagadro/Loschmidt etc, chemistry)
9. Maxwell's equations of electromagnetism (James Maxwell, physics)
10. Periodic table (Mendeleev, chemistry)

1900s
1. Relativity (Einstein, physics)
2. Radioactive Dating (Ernest Rutherford/Bertrand Boltwood, physics)
3. Quantum Mechanics (Heisenberg/Schrödinger, physics)
4. Gödel's Incompleteness Theorem (Kurt Gödel, mathematics)
5. Hypothesis Testing (Ronald Fisher/Jerzy Neyman/Karl Pearson/Egon Pearson, statistics) - Egon was Karl's son
6. The Structure of DNA (Watson/Crick/Franklin, biology)
7. Continental Drift (proposed Wegener and confirmed by lots of people at once, geology)
8. The Big Bang (Georges Lemaître/Edwin Hubble, astronomy) general acceptance followed the discovery of the CMBR by Arno Penzias and Robert Wilson
9. Synthesis of the Elements in Stars aka B²FH (Geoffrey Burbidge/Margaret Burbidge/William Fowler/Fred Hoyle, astronomy/physics)
10. Standard Model (Sheldon Glashow/Steven Weinberg/Abdus Salam, physics) tens of billions of dollars have been spent verifying this theory!
2000s - There is likely more disagreement over these

Monday, September 28, 2009

Teaching linear algebra

In a recent Hacker News post I made reference to an interesting teaching experience I had in the mid-90s. This is a longer explanation of the same.

I was a graduate student in math at Dartmouth College. I wound up teaching an introduction to linear algebra course that was also the first course where students were asked to do proofs. The class was somewhere in the range of 15-20 students. If I remember correctly, this was in the fall of 1996.

In preparation for the class I set myself goals around how well the students would learn the material taught. After some thought I settled on four ideas that I would use:

Homework not present at the start of class would not be accepted. However students were only graded on the best 20 out of 27 possible homework sets.
All homework sets were cumulative. Generally 1/3 was the current day's material, 1/3 from the last week, and 1/3 from anywhere in the course. Those thirds were in increasing order of difficulty.
Every class would start with a question and answer session to last no less than 10 minutes.
Every student could expect to be asked at least one question every other class.

These ideas may seem odd, but there was a method to my madness. Here is each idea explained.

Homework not present at the start of class would not be accepted. However students were only graded on the best 20 out of 27 possible homework sets.

The point was to make sure that class started on time, with everyone ready to pay attention for question and answer time. I also didn't want to deal with people doing homework during lecture, evaluating sick excuses, etc. The leniency of not having to turn in 7 homework sets compensated for the rigidness of the policy. And cumulative homework sets meant that I didn't have to worry about students not practicing any given day's material.

This worked even better than I hoped. The downside was that I had an argument on the second day when someone came in 2 minutes late and was not allowed to turn in his homework. But the first complaint was the last, and the students liked the freedom to decide when something else took precedence over doing homework.
All homework sets were cumulative. Generally 1/3 was the current day's material, 1/3 from the last week, and 1/3 from anywhere in the course. Those thirds were in increasing order of difficulty.

This was the most important idea I wanted to try. I had long been aware that research on memory had demonstrated that when you're reminded of something as you're forgetting it, it goes into much longer term memory. As a result periodic review at lengthening intervals is very effective in increasing long term recall. A typical effective study schedule being to review after half an hour, the next day, the next week, then the next month.

Now of course you can tell students this until you're blue in the face - but they won't do it. However when the study schedule is disguised as homework, they don't have a choice.

This really seemed to work. What I noticed on tests is that students were noticeably shaky on material they had learned in the previous week, occasionally didn't remember stuff for a half-month before that, but absolutely nailed every concept that they'd first learned at least 3 weeks earlier. I credit the forced review schedule from cumulative homework sets for much of that.
Every class would start with a question and answer session to last no less than 10 minutes.

For me this was the most important part of the class. The questions that came up in this session were my opportunity to refresh people on what they were forgetting, and were how I kept track of what topics should come in for more review on future homework sessions. Given my knowledge of how critical review is to learning, I honestly felt that time spent answering questions was more valuable than lecture. As long as there were questions, there was no maximum on how much time I was willing to spend on this.

Of course the challenge is getting students to ask questions. My strategy was simple: I told them that someone will ask questions and someone will answer them, but they don't want me to be the one asking questions. On the second day nobody asked me any questions and I had to demonstrate. I picked a random person and asked her to explain a key point from the first day's lecture. She couldn't. I asked another student the same question. Again difficulty. I asked if everyone was sure that they had no questions. Someone asked me the question that I had been asking everyone else. I answered the question, answered the follow-up, and the point was made. I never again had to ask a question during question and answer period. :-)
Every student could expect to be asked at least one question every other class.

My goal here was to be sure that every student was awake and following the lecture. It was never my goal to embarrass anyone or put them on the spot. To that end I developed a rhythm. Every few minutes I'd stop, say, "Let's make that a question," ask the question, pause so everyone could think through the answer, then ask a random person the question. I made sure to rotate people around so that everyone got their turn fairly.

The questions I'd ask were always straightforward. They were things like, "What is the result of this calculation?" Or, "Why is this step OK?"

I treated failure to get the answer as my failures, not theirs. If they couldn't get the answers then they weren't following the lecture, and I needed to slow it down, figure out the rough spots, etc. It might seem that the constant interruptions were slow. But I found that having everyone pay attention more than made up for it. The class as a whole moved as fast as any other class - but with far greater comprehension. And the interactivity made the class become very open about asking questions.

As a bonus I managed to convince the entire class that taking notes was not worthwhile. I learned this lesson about math in first year undergrad. What you do is read ahead in the textbook. If you really want a set of notes, you can make them from the textbook before class. Then show up at class having read the day's material and ready to pay attention. Then if anything that the professor says doesn't make sense to you when you're paying attention and have already read the day's lesson, then ask the question then and there. If you don't understand it, then probably nobody else does either. Add to that periodic reviews, and you'll have a huge edge in any math courses.

Nobody ever believes that that works. But this class had no choice because there is simply no way to take notes and pay attention at the same time. Which meant that the note takers couldn't answer questions. But within a few days they learned to not take notes, and I believe did much better for it.

So how well did this package work? As far as my goals were concerned, much better than I had dreamed possible. What really brought this home was the final exam. Based on class performance I drew up a test that I though was a fair test of what I thought they understood. I showed it to some fellow graduate students. They thought I was crazy. They thought the class would bomb, and were willing to bet me on whether anyone would get the bonus question.

The class aced the test. That bonus question? 70% of the class got it. I don't remember what the bonus question was, but I do remember another one that I thought was cute. It went like this. Let V be the vector space of all polynomials of degree at most 2. a) Prove that d/dx is a linear operator on V. b) You can put a coordinate system on V by mapping p(x) to (p(0), p(1), p(2)). (Please imagine that flipped 90 degrees so it is a column.) Find the matrix that represents d/dx in this coordinate system. My fellow grad students got me worried that this might be too advanced for an introductory linear algebra courses. But I needn't have worried - the only significant errors were minor arithmetic mistakes in the calculation. And I think I dinged someone for not having enough detail in the proof.

Furthermore I was lucky enough to talk to some of my students about the experience a few months later. The general consensus was that the material really stuck. Furthermore nobody studied for the final. No joke. As one girl said, "I tried studying because I thought I should, but I gave up after a half-hour because I already knew it all." That is how I think it should be - if you study properly through the course, then you won't need to study for the final. Because you've already learned it. And you'll have a leg up on the next course because you still remember the material that everyone else has forgotten.

So were there any downsides? Unfortunately there were some big ones. I had set goals around learning. I failed to set any around happiness. Having to pay attention during class was hard on the class. Also it motivated them to work hard. Since everyone worked hard and they thought that I was going to grade them on a curve, there was a lot frustration that they wouldn't properly be recognized for their work. (In fact I gave half of them A's in the end.) This frustration showed up the teacher evaluations at the end of the course. :-(

Therefore if I had to do it over I'd ask somewhat fewer questions, hand out a lot more compliments, make it clear that I would not grade on a curve, and if they performed anything like that first class, I'd be even more liberal with good grades. Of course the point is moot since I've found myself profitably displaced from math to software development. But if anyone decides to replicate my experience, I'd recommend paying more attention than I did to those issues.