Phase variation induced by insertions and deletions (INDELs) in genomic homopolymeric tracts (HT) can silence and regulate genes in pathogenic bacteria, but this process is not characterized in MTBC (Mycobacterium tuberculosis complex) adaptation. We leverage 31,428 diverse clinical isolates to identify genomic regions including phase-variants under positive selection. Of 87,651 INDEL events that emerge repeatedly across the phylogeny, 12.4% are phase-variants within HTs (0.02% of the genome by length). We estimated the in-vitro frameshift rate in a neutral HT at 100× the neutral substitution rate at [Formula: see text] frameshifts/HT/year. Using neutral evolution simulations, we identified 4,098 substitutions and 45 phase-variants to be putatively adaptive to MTBC (P < 0.002). We experimentally confirm that a putatively adaptive phase-variant alters the expression of espA, a critical mediator of ESX-1-dependent virulence. Our evidence supports the hypothesis that phase variation in the ESX-1 system of MTBC can act as a toggle between antigenicity and survival in the host.
Keywords: genomics; microbiology; phase variation; tuberculosis.