Evolving the Bacterial Flagellum Through Mutation and Cooption: Part IV
I have previously discussed the various evolutionary mechanisms proposed to overcome the problem posed by Irreducible Complexity (IC). [1] Evolution through the coincidental cooption of an alternative function (CCAF) remains the best non-teleological explanation for the origin of IC. Thus, it is not surprising that the EFM hypothesis builds around this type of thinking. But there are two ways to envision CCAF: simultaneous cooption or gradual cooption. For the sake of simplicity, let us pretend the flagellum is a six-part IC system, where A,B,C,D,E, and G are essential proteins and F* is rotary motility. A simplistic model of simultaneous cooption is shown in figure 1.
Figure 1. Simultaneous CCAF
Here A, B, C, D, E, and G all pre-existed the flagellum and had alternative functions in the cell. Then, through some type of fortuitous event, they all came together and spawned flagellar function. Generating a novel function through the chance conglomeration of six independent proteins seems highly unlikely for two basic reasons:
Thus, it is not surprising that evolutionary biologist, H. Allen Orr, dismisses this type of solution to the problem of IC:
First it will do no good to suggest that all the required parts of some biochemical pathway popped up simultaneously by mutation. Although this "solution" yields a functioning system in one fell swoop, it's so hopelessly unlikely that no Darwinian takes it seriously. As Behe rightly says, we gain nothing by replacing a problem with a miracle.
[3]Perhaps we should then turn to the more "Darwinian" version of CCAF, one that merely envisions gradual addition to the IC system while sustaining the components by giving them each an alternative function along the way. The evolutionary origin of the same six-part system might come together gradually as in figure 2.

Figure 2. The Gradual Cooptional Origin of the Flagellum. A-E and G are gene products. F is an unspecified function. F1-F10 represent 10 different flagellum-independent functions. F* is rotary motility.
Here, we invoke 10 alternative non-flagellar functions to sustain individual parts and partial-IC systems. The advantage to this explanation is that it gets us as far away from simultaneous CCAF as possible. It captures the gradualism at the heart of Darwinism and helps to push each addition to the system closer to observable mutation effects, such as those seen in the generation of antibiotic resistance. That is, mutations in one gene at a time can add the component to the system. What's more, gene duplication can now more plausibly contribute to the account, as a duplication could have occurred prior to each cooption event.
Yet Orr is still skeptical of this type of account:
Second, we might think that some of the parts of an irreducibly complex system evolved step by step for some other purpose and were then recruited wholesale to a new function. But this is also unlikely. You may as well hope that half your car's transmission will suddenly help out in the airbag department. Such things might happen very, very rarely, but they surely do not offer a general solution to irreducible complexity.
[3]His solution is to invoke Original Helping Activities [1, 3], which would make the scenario more complex as many cooptional events would simply aid and not yet be part of a truly IC structure (as defined by Behe). Mutations would later alter them and confer an essential status upon them.
When viewed like this, IC ceases to pose any problem. Since IC is a function-dependent concept, and functions can disappear and emerge, IC states simply evolve. In fact, this viewpoint is so seductive that it has led many to believe a replay of life's tape (borrowing from Gould's metaphor) would simply generate something comparable to the bacterial flagellum. That is, while the actual flagellum we see may not exist in some replay, some comparable, complex motility structure would likely exist playing its role in filling the niche afforded by motility.
Yet despite its seductive appeal, we must ask a simple question : does this explanation really account for the origin of the bacterial flagellum? We are not merely engaged in speculative philosophy here. We are talking about something that actually exists and has an actual history. Possible worlds are fun to think about, but historical claims must have more than an intuitive appeal to the possible to support them. History is not about what could have possibly happened; it is about what did happen. Thus, does this gradual CCAF explanation really explain the origin of something that exists, namely, the bacterial flagellum? That we can vaguely envision it happening means only that we should seriously consider it as an explanation. It does not mean we should adopt it as the explanation. It the previous three essays, I highlighted many of the problems with the EFM hypothesis. Let us then step back from this example to see if further problems exist.
Reverse Engineering
Julie Thomas originally surveyed the bacterial flagellum from the perspectives of Ur-IC and thematic IC. [4] Ur-IC is a postulated IC state that existed in the last common ancestral flagellum. Thematic IC focuses on the various functional roles entailed by the components of a machine to determine if the roles themselves exist in an IC relationship. Thematic IC might also be a helpful concept as we try to begin reverse engineering the flagellum.
If we look to the standard E. coli flagellum, it is composed of eight subsystems with distinct functions:
We could theoretically eliminate many of the genes listed above if we reverse engineer our way to a simpler flagellar prototype. The base itself could possibly be composed of one protein. The motor could be composed of one membrane protein and use the base as the rotor. We can eliminate the switch, as this is more of luxury item. For example, when the motor is switched such than bacteria tumble, this merely amplifies the effects of Brownian motion to facilitate the reorientation of the cell. Simply turning off the motor could accomplish much of the same effect. In fact, this is how the flagella in Rhodobacter sphaeroides work. [5] The export machinery is hard to reverse engineer since it is still a black box. Nevertheless, let's be radical and assume it could function much the same with only three proteins. The drive-shaft could theoretically be composed of a polymer composed of only one protein. The L and P rings are not needed in gram-positive bacteria, since such bacteria lack outer cell membranes, thus we can eliminate this from the prototype (although it raises an interesting question about the origin of gram-positives and gram-negatives to be explored at a later date). Finally, it would seem you could eliminate the hook and filament proteins and simply have the drive shift polymer extend well past the outer membrane. This would leave us with the Base, the Motor, the Export Machinery, and a Filament, which is awfully similar to the assumption about original flagella inherent in the EFM hypothesis (where the filament and motor are added to the export machinery through CCAF). And this would leave us with a total of only six proteins. Of course, a six-protein IC system still poses an IC problem, but in light of the EFM hypothesis, it does not seem insurmountable.
Yet the problem with the above approach is two-fold. Our ability to reverse engineer, in a realistic manner, is greatly hindered by our lack of understanding about the mechanistic interactions between the various components of the flagellum. Furthermore, our experience with designing nanotechnology is practically non-existent. Both factors suggest this six-part system may very well be over-simplistic and non-workable. Recall, for example, that the flagellum must be assembled and the act of assembly may place constraints on the minimal complexity of a rotary nano-propeller. For example, we have already seen in part II that filament assembly appears to be IC, requiring both flagellin and the cap. That is, the cap is not an added luxury item that merely seals off the filament (as was once thought). It is an essential assembly factor, playing the role of a processive chaperone. Thus, we can predict that further understanding of the flagellar components is likely to expand the list of essential players in the minimal flagellar prototype.
The Ur-IC Flagellum
Because of these limitations, it would be helpful to consider the flagellar components last shared by an ancestral flagellum. Thomas [4] originally did this by comparing the flagellar gene content of various distantly related bacteria. Allow me to do likewise and provide a list of flagellar genes found in Aquifex aeolicus, Bacillus subtilis, Escherichia coli, and Treponema pallidum. (Table I)
Table I. Genes Likely Part of the Ur-IC Flagellum [6]
|
Functional Role |
Gene Products |
|
Motor |
MotA, MotB, FliG (C-term) |
|
Base |
FliF, FliG (N-term), FliM/N |
|
Export Machinery |
FlhB, FliQ, FliR, FliP, FliI, FlhA |
|
Drive-shaft |
FlgB, FlgC, FlgG, FliE |
|
Hook and Adapters |
FlgE, FlgL, FlgK, FlgD |
|
Filament |
FliC, FliD |
Thus, 21 genes are shared by these four distantly related bacteria. Aquifex aeolicus are the most thermophilic bacteria, growing just below the boiling point of water. They are also thought to represent the earliest lineages to branch off the eubacterial tree. Bacillus subtilis is a gram-positive soil bacterium that can use a wide variety of carbon sources. Very similar bacteria (Clostridium) used to be thought most primitive. Escherichia coli represent the gram-negative proteobacteria and live in the digestive tracts of many organisms. Their flagella are among the most studied. Treponema pallidum is a spirochete whose flagellum is part of a rather specialized motility organelle known as the axial filament. As I mentioned, these four species are very distantly related, as seen by the phylogenetic tree constructed from 16s rRNA sequence (Fig 3). Furthermore, all four bacteria have experienced very different environmental pressures over the last several billions years. This strongly implies that these 21 genes were present in the last common ancestor of all eubacteria, thus comprising the Ur-IC flagellum. To further test this notion, I surveyed the flagellar genes of Thermotoga maritima since it is also a very deeply branching bacterium. According to the TIGR list of flagellar genes, everything in the Ur-IC list is represented, thus confirming what IC would predict.[7] Furthermore, this Ur-IC state has persisted for billions of years since it appeared. That billions of years of microbial evolution, in each lineage, have not imposed significant permutations on this IC core speaks to its true IC state.

Figure 3. Eubacterial phylogenetic tree. Adapted from [8]
Since the last detectable flagellum most likely contained these 21 genes (22, if we split FliG; 23 if we separate FliM and FliN), we can finally turn to the hypothesis of gradual CCAF to understand why it is so unconvincing.
NEXT: How well do the EFM hypothesis and gradual CCAF explain the origin of the Ur-IC flagellum?
Citations
Note: On 3/07/02, I changed the Ur-IC scoring from 19 to 21 genes. The original scoring did not include the rod protein FlgG because it was not listed in TIGR's flagellar genes for B. subtilis. Further analysis uncovered that in Bacillus, FlgG is flhP. FlgG has clearly been identified in B. subtilis. See Gene 1991 May 15;101(1):23-31. FlgD is listed as a component in E. coli, Thermotoga, and Trepenoma. Although apparently lacking in B. subitilis, it was present in B. halodurans. Using flgD sequence from B. halodurans, I BLASTed the B. subtilis and Aquifex genome, finding significant homologs.
Evolving the Bacterial Flagellum Through Mutation and Cooption: Part V