Origin of life video
I just did a reddit Science AMA today, and whenever you talk about evolution, invariably the question of the origin of life comes up. It's not my field, as I am not a chemist, but here is a great video explaining a model of how life could have formed spontaneously from chemical elements in the pre-biotic Earth. It is based on research from Jack Szostak's lab.
From now on I will be pointing to this whenever I'm asked how the first cells came about.
Evolutionary dynamics in holey fitness landscapes
What do real fitness landscapes look like? Do they look more like the image on the left, a nearly-neutral holey fitness landscape, or the one on the right, a rugged fitness landscape with many distinct peaks?
Those are only in two dimensions, so the question is also if depicting anything in two dimensions conveys intuitions that are at all correct.
Holey fitness landscapes (Gavrilets and Gravner, 1997, Gavrilets 1997) are approximations of real fitness landscapes where all genotypes are assigned a fitness value of either zero or one. After normalizing fitnesses to be between zero and one, those that are lower than one are assigned a fitness of zero1. Because real fitness landscapes are of extremely high dimensionality2, and assuming that genotypes have fitnesses that are randomly distributed3, it follows that there exist a nearly-neutral network of genotypes connected by single mutations that has fitness (effectively) equal to one.
The proposition is then that this holey landscape model is a good approximation of real fitness landscapes. It hypothesizes that the evolutionary dynamics on real fitness landscapes is similar to that on holey landscapes, and that distinct peaks like in the image on the right do not really exist. And this is a testable prediction.
Take a look at these videos. They depict populations evolving in two-dimensional fitness landscapes at a very high mutation rate. (You can also download the videos from my research website.)
In all three cases the population size is 2304 (that's (3*16)2, in case you're wondering), mutation rate is 0.5, the grid is 200x200 pixels (i.e. genotypes), and mutations cause organisms to move to a neighboring pixel. Ten percent of the population is killed every computational update (which gives an approximate generation time of 10 updates), and those dead individuals are replaced by offspring from the survivors selected with a probability proportional to fitness (asexual reproduction). Top: neutral landscape where all genotypes have the same fitness. Middle: Half-holey landscape with square holes of 10% lower fitness (size of holes is 14x14 pixels). Bottom: Holey landscape where the genotypes in the holes have fitness zero.
The proposition is that the dynamics of the populations should be the same no matter how deep the holes are. The populations in the half-holey and in the holey landscapes should evolve in comparable ways if the holey landscape is a good approximation.
So what do you think?
What I think is that the evolving population in the top (neutral) and middle (half-holey) landscapes resemble each other, whereas they look nothing like the bottom (holey) landscape. In the half-holey landscape the population takes advantage of the holes all the time, meaning that many individuals who are in them reproduce, even though they have a clear fitness disadvantage. The lesson is that being disadvantaged is just okay, and populations can easily cross quite deep valleys in the fitness landscape. But obviously not when the valleys consist of genotype with zero fitness; evolution in holey landscapes is much impeded compared to rugged landscapes, which is why I think they are not a good approximation.
Caveats: These populations are evolving at a very high mutation rate. When I redid it with a much lower mutation rate (0.05), the neutral and half-holey landscapes stop resembling each other, and the half-holey and holey landscapes look more alike. However, evolution happens so slowly in this case that it is difficult to distinguish the dynamics, so the matter is unresolved so far (however, I have other evidence that lower and more realistic mutation rates do not change this conclusion - some preliminary data in Østman and Adami (2013)). A second caveat is that the whole holey landscape idea relies on the fitness landscape being multidimensional, and so how can I even allow myself to compare evolution of populations in half-holey and holey landscapes in just two dimensions? That is valid question: the intuitions we get from these animations may lead us to think we know something about evolution in multi-dimensional landscapes, while the original premise of Gavrilets' idea was that we exactly cannot. Unfortunately, while this is an empirical question - meaning that it could be tested - the holey landscape model posits that the neutral network appears at very high dimensionality. What this dimensionality is is unclear, so even if I were to evolve populations in 2,000 dimensions (which is not computationally feasible - the limit is a little over 30 binary loci), one could always claim that not even that many are enough. Sighs.
1 Genotypes with fitness greater than 1 divided by the population size, N, are effectively the same, because selection cannot "see" differences smaller than 1/N.
2 High dimensionality means a large number of genes (loci) or number of nucleotides.
3 We already know that this is not a very good assumption, as there are indications that fitness landscapes are non-randomly structured with high fitness genotypes clustered with other fit genotypes (Østman et al, 2010), but we don't know if it is enough to render the holey landscape model useless.
References
Gavrilets S, and Gravner J (1997). Percolation on the fitness hypercube and the evolution of reproductive isolation. Journal of theoretical biology, 184 (1), 51-64 PMID: 9039400
Gavrilets S (1997). Evolution and speciation on holey adaptive landscapes. Trends in ecology & evolution, 12 (8), 307-12 PMID: 21238086
Østman B and Adami C (2013). Predicting evolution and visualizing high-dimensional fitness landscapes, in Recent Advances in the Theory and Application of Fitness Landscapes" (A. Engelbrecht and H. Richter, eds.). Springer Series in Emergence, Complexity, and Computation DOI: 10.1007/978-3-642-41888-4_18
Those are only in two dimensions, so the question is also if depicting anything in two dimensions conveys intuitions that are at all correct.
Holey fitness landscapes (Gavrilets and Gravner, 1997, Gavrilets 1997) are approximations of real fitness landscapes where all genotypes are assigned a fitness value of either zero or one. After normalizing fitnesses to be between zero and one, those that are lower than one are assigned a fitness of zero1. Because real fitness landscapes are of extremely high dimensionality2, and assuming that genotypes have fitnesses that are randomly distributed3, it follows that there exist a nearly-neutral network of genotypes connected by single mutations that has fitness (effectively) equal to one.
The proposition is then that this holey landscape model is a good approximation of real fitness landscapes. It hypothesizes that the evolutionary dynamics on real fitness landscapes is similar to that on holey landscapes, and that distinct peaks like in the image on the right do not really exist. And this is a testable prediction.
Take a look at these videos. They depict populations evolving in two-dimensional fitness landscapes at a very high mutation rate. (You can also download the videos from my research website.)
In all three cases the population size is 2304 (that's (3*16)2, in case you're wondering), mutation rate is 0.5, the grid is 200x200 pixels (i.e. genotypes), and mutations cause organisms to move to a neighboring pixel. Ten percent of the population is killed every computational update (which gives an approximate generation time of 10 updates), and those dead individuals are replaced by offspring from the survivors selected with a probability proportional to fitness (asexual reproduction). Top: neutral landscape where all genotypes have the same fitness. Middle: Half-holey landscape with square holes of 10% lower fitness (size of holes is 14x14 pixels). Bottom: Holey landscape where the genotypes in the holes have fitness zero.
The proposition is that the dynamics of the populations should be the same no matter how deep the holes are. The populations in the half-holey and in the holey landscapes should evolve in comparable ways if the holey landscape is a good approximation.
So what do you think?
What I think is that the evolving population in the top (neutral) and middle (half-holey) landscapes resemble each other, whereas they look nothing like the bottom (holey) landscape. In the half-holey landscape the population takes advantage of the holes all the time, meaning that many individuals who are in them reproduce, even though they have a clear fitness disadvantage. The lesson is that being disadvantaged is just okay, and populations can easily cross quite deep valleys in the fitness landscape. But obviously not when the valleys consist of genotype with zero fitness; evolution in holey landscapes is much impeded compared to rugged landscapes, which is why I think they are not a good approximation.
Caveats: These populations are evolving at a very high mutation rate. When I redid it with a much lower mutation rate (0.05), the neutral and half-holey landscapes stop resembling each other, and the half-holey and holey landscapes look more alike. However, evolution happens so slowly in this case that it is difficult to distinguish the dynamics, so the matter is unresolved so far (however, I have other evidence that lower and more realistic mutation rates do not change this conclusion - some preliminary data in Østman and Adami (2013)). A second caveat is that the whole holey landscape idea relies on the fitness landscape being multidimensional, and so how can I even allow myself to compare evolution of populations in half-holey and holey landscapes in just two dimensions? That is valid question: the intuitions we get from these animations may lead us to think we know something about evolution in multi-dimensional landscapes, while the original premise of Gavrilets' idea was that we exactly cannot. Unfortunately, while this is an empirical question - meaning that it could be tested - the holey landscape model posits that the neutral network appears at very high dimensionality. What this dimensionality is is unclear, so even if I were to evolve populations in 2,000 dimensions (which is not computationally feasible - the limit is a little over 30 binary loci), one could always claim that not even that many are enough. Sighs.
1 Genotypes with fitness greater than 1 divided by the population size, N, are effectively the same, because selection cannot "see" differences smaller than 1/N.
2 High dimensionality means a large number of genes (loci) or number of nucleotides.
3 We already know that this is not a very good assumption, as there are indications that fitness landscapes are non-randomly structured with high fitness genotypes clustered with other fit genotypes (Østman et al, 2010), but we don't know if it is enough to render the holey landscape model useless.
References
Gavrilets S, and Gravner J (1997). Percolation on the fitness hypercube and the evolution of reproductive isolation. Journal of theoretical biology, 184 (1), 51-64 PMID: 9039400
Gavrilets S (1997). Evolution and speciation on holey adaptive landscapes. Trends in ecology & evolution, 12 (8), 307-12 PMID: 21238086
Østman B and Adami C (2013). Predicting evolution and visualizing high-dimensional fitness landscapes, in Recent Advances in the Theory and Application of Fitness Landscapes" (A. Engelbrecht and H. Richter, eds.). Springer Series in Emergence, Complexity, and Computation DOI: 10.1007/978-3-642-41888-4_18
Why do you believe?
Important update April 22nd, 2014: I have been corrected on the usage of the phrases "believe in" and "believe that". In the first sentence below I wrote "believe in something", but what I really meant was "believe that something". Thus, saying "I believe in evolution" is wrong, because it is not a matter of faith. People often respond to this by saying "No, I don't believe in evolution. I accept that evolution is true based on the overwhelming evidence." I have objected to this on several occasions before, but now see the difference. I prefer to say "I don't believe in evolution, I believe that evolution is true (based on the evidence)".
If you say you believe in something, what is it that you mean by that? For example, if you say you believe you will find a hundred-dollar bill today, what is that belief built upon?
I posit that what you actually believe (as opposed to what you say you believe) is really based on probabilities. Perhaps not accurately so, but all future events are of course unknown, though some come very close to 100% certainty, and it is thus really the only way to predict anything with any kind of accuracy.
Trivial example: If you roll a die, you might say that you believe you will get a six. But if we assume this is a fair die, you must assess that the chance is about one in six, so your belief should really be that you do not get a six. In that case you really can't have a rational belief that any one side will come up, though you do of course know with almost 100% certainty (i.e., 100% probability) that one of the six numbers will come up (the die could land on an edge).
In science we make models. That is the essence of the scientific endeavor. A model is basically some explanation of something; hypotheses and theories at opposite ends of a spectrum of explanatory depth are models. So if you as a scientists say you cannot imagine how something would work, how something could have happened, etc., then you basically aren't doing your job. People who dismiss science as rigid and uncreative have not understood what it really entails. Coming up with explanations is among the most creative things people can do. If you're writing a work of fiction, it would surely be defeat if you cannot think of a way to make something click in the story. Same thing with science. If you observe something and you can't imagine how that could occur, then get to work!
Positing a hypothesis is emphatically not the same as "believing" it to be true. That I come up with one hypothesis to explain something doesn't mean that I think it is the most likely explanation, nor does it mean that I can't come up with anything else.
For example, why do I think John Travolta said Adele Dazim when referring to Idina Menzel at the Oscars?
Hyp1: He was high as a kite and just mixed up some slightly related names.
Hyp2: Someone played a joke on him and told him that was her name.
Hyp3: It was on purpose because he thinks that Adele Dazim is a prettier name for her.
Hyp4: He wanted to rock the establishment to secure a role in a new indie film.
Hyp5: He has an occasional speech impediment.
Hyp6: It's a special Scientology accent.
I can assign probabilities to each of these. It may not be accurate (they don't need to add to one, and can actually add to more than one since they overlap somewhat), but at least approximate values or perhaps just a relative ranking of them. Each of these hypotheses could be tested, and my personal belief doesn't really have much to do about anything. I don't yet have any evidence either way, and evidence is of course the only thing we can really base rational belief on, though oftentimes that evidence is reflected in theories about, say, human behavior, in which case I can say I find Hyp1 more likely than Hyp6, because I have seen evidence of only one of them happening before.
The answer is evidence. If you have none, they you have no reason to believe anything. Hypothesizing is not the same as believing, and can be done freely without repercussions. At least that is how things ought to be, everywhere and always.
If you say you believe in something, what is it that you mean by that? For example, if you say you believe you will find a hundred-dollar bill today, what is that belief built upon?
I posit that what you actually believe (as opposed to what you say you believe) is really based on probabilities. Perhaps not accurately so, but all future events are of course unknown, though some come very close to 100% certainty, and it is thus really the only way to predict anything with any kind of accuracy.
Trivial example: If you roll a die, you might say that you believe you will get a six. But if we assume this is a fair die, you must assess that the chance is about one in six, so your belief should really be that you do not get a six. In that case you really can't have a rational belief that any one side will come up, though you do of course know with almost 100% certainty (i.e., 100% probability) that one of the six numbers will come up (the die could land on an edge).
In science we make models. That is the essence of the scientific endeavor. A model is basically some explanation of something; hypotheses and theories at opposite ends of a spectrum of explanatory depth are models. So if you as a scientists say you cannot imagine how something would work, how something could have happened, etc., then you basically aren't doing your job. People who dismiss science as rigid and uncreative have not understood what it really entails. Coming up with explanations is among the most creative things people can do. If you're writing a work of fiction, it would surely be defeat if you cannot think of a way to make something click in the story. Same thing with science. If you observe something and you can't imagine how that could occur, then get to work!
Positing a hypothesis is emphatically not the same as "believing" it to be true. That I come up with one hypothesis to explain something doesn't mean that I think it is the most likely explanation, nor does it mean that I can't come up with anything else.
For example, why do I think John Travolta said Adele Dazim when referring to Idina Menzel at the Oscars?
Hyp1: He was high as a kite and just mixed up some slightly related names.
Hyp2: Someone played a joke on him and told him that was her name.
Hyp3: It was on purpose because he thinks that Adele Dazim is a prettier name for her.
Hyp4: He wanted to rock the establishment to secure a role in a new indie film.
Hyp5: He has an occasional speech impediment.
Hyp6: It's a special Scientology accent.
I can assign probabilities to each of these. It may not be accurate (they don't need to add to one, and can actually add to more than one since they overlap somewhat), but at least approximate values or perhaps just a relative ranking of them. Each of these hypotheses could be tested, and my personal belief doesn't really have much to do about anything. I don't yet have any evidence either way, and evidence is of course the only thing we can really base rational belief on, though oftentimes that evidence is reflected in theories about, say, human behavior, in which case I can say I find Hyp1 more likely than Hyp6, because I have seen evidence of only one of them happening before.
The answer is evidence. If you have none, they you have no reason to believe anything. Hypothesizing is not the same as believing, and can be done freely without repercussions. At least that is how things ought to be, everywhere and always.