Intraspecies sequence comparisons for annotating genomes

Genome Res. 2004 Dec;14(12):2406-11. doi: 10.1101/gr.3199704. Epub 2004 Nov 15.

Abstract

Analysis of sequence variation among members of a single species offers a potential approach to identify functional DNA elements responsible for biological features unique to that species. Due to its high rate of allelic polymorphism and ease of genetic manipulability, we chose the sea squirt, Ciona intestinalis, to explore intraspecies sequence comparisons for genome annotation. A large number of C. intestinalis specimens were collected from four continents, and a set of genomic intervals were amplified, resequenced, and analyzed to determine the mutation rates at each nucleotide in the sequence. We found that regions with low mutation rates efficiently demarcated functionally constrained sequences: these include a set of noncoding elements, which we showed in C. intestinalis transgenic assays to act as tissue-specific enhancers, as well as the location of coding sequences. This illustrates that comparisons of multiple members of a species can be used for genome annotation, suggesting a path for the annotation of the sequenced genomes of organisms occupying uncharacterized phylogenetic branches of the animal kingdom. It also raises the possibility that the resequencing of a large number of Homo sapiens individuals might be used to annotate the human genome and identify sequences defining traits unique to our species.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Ciona intestinalis / genetics*
  • DNA-Binding Proteins / genetics
  • Evolution, Molecular
  • Forkhead Transcription Factors
  • Genes, Regulator / genetics
  • Genetic Variation*
  • Genome*
  • Likelihood Functions
  • Models, Genetic
  • Molecular Sequence Data
  • Mutation / genetics*
  • Nuclear Proteins / genetics
  • Phylogeny*
  • Plasmids / genetics
  • Sequence Analysis, DNA
  • Snail Family Transcription Factors
  • Transcription Factors / genetics

Substances

  • DNA-Binding Proteins
  • Forkhead Transcription Factors
  • Nuclear Proteins
  • Snail Family Transcription Factors
  • Transcription Factors

Associated data

  • GENBANK/AY667278
  • GENBANK/AY667279
  • GENBANK/AY667280
  • GENBANK/AY667281
  • GENBANK/AY667282
  • GENBANK/AY667283
  • GENBANK/AY667284
  • GENBANK/AY667285
  • GENBANK/AY667286
  • GENBANK/AY667287
  • GENBANK/AY667288
  • GENBANK/AY667289
  • GENBANK/AY667290
  • GENBANK/AY667291
  • GENBANK/AY667292
  • GENBANK/AY667293
  • GENBANK/AY667294
  • GENBANK/AY667295
  • GENBANK/AY667296
  • GENBANK/AY667297
  • GENBANK/AY667298
  • GENBANK/AY667299
  • GENBANK/AY667300
  • GENBANK/AY667301
  • GENBANK/AY667302
  • GENBANK/AY667303
  • GENBANK/AY667304
  • GENBANK/AY667305
  • GENBANK/AY667306
  • GENBANK/AY667307
  • GENBANK/AY667308
  • GENBANK/AY667309
  • GENBANK/AY667310
  • GENBANK/AY667311
  • GENBANK/AY667312
  • GENBANK/AY667313
  • GENBANK/AY667314
  • GENBANK/AY667315
  • GENBANK/AY667316
  • GENBANK/AY667317
  • GENBANK/AY667318
  • GENBANK/AY667319
  • GENBANK/AY667320
  • GENBANK/AY667321
  • GENBANK/AY667322
  • GENBANK/AY667323
  • GENBANK/AY667324
  • GENBANK/AY667325
  • GENBANK/AY667326
  • GENBANK/AY667327
  • GENBANK/AY667328
  • GENBANK/AY667329
  • GENBANK/AY667330
  • GENBANK/AY667331
  • GENBANK/AY667332
  • GENBANK/AY667333
  • GENBANK/AY667334
  • GENBANK/AY667335
  • GENBANK/AY667336
  • GENBANK/AY667337
  • GENBANK/AY667338
  • GENBANK/AY667339
  • GENBANK/AY667340
  • GENBANK/AY667341
  • GENBANK/AY667342
  • GENBANK/AY667343
  • GENBANK/AY667344
  • GENBANK/AY667345
  • GENBANK/AY667346
  • GENBANK/AY667347
  • GENBANK/AY667348
  • GENBANK/AY667349
  • GENBANK/AY667350
  • GENBANK/AY667351
  • GENBANK/AY667352
  • GENBANK/AY667353
  • GENBANK/AY667354
  • GENBANK/AY667355
  • GENBANK/AY667356
  • GENBANK/AY667357
  • GENBANK/AY667358
  • GENBANK/AY667359
  • GENBANK/AY667360
  • GENBANK/AY667361
  • GENBANK/AY667362
  • GENBANK/AY667363
  • GENBANK/AY667364
  • GENBANK/AY667365
  • GENBANK/AY667366
  • GENBANK/AY667367
  • GENBANK/AY667368
  • GENBANK/AY667369
  • GENBANK/AY667370
  • GENBANK/AY667371
  • GENBANK/AY667372
  • GENBANK/AY667373
  • GENBANK/AY667374
  • GENBANK/AY667375
  • GENBANK/AY667376
  • GENBANK/AY667377
  • GENBANK/AY667378
  • GENBANK/AY667379
  • GENBANK/AY667380
  • GENBANK/AY667381
  • GENBANK/AY667382
  • GENBANK/AY667383
  • GENBANK/AY667384
  • GENBANK/AY667385
  • GENBANK/AY667386
  • GENBANK/AY667387
  • GENBANK/AY667388
  • GENBANK/AY667389
  • GENBANK/AY667390
  • GENBANK/AY667391
  • GENBANK/AY667392
  • GENBANK/AY667393
  • GENBANK/AY667394
  • GENBANK/AY667395
  • GENBANK/AY667396
  • GENBANK/AY667397
  • GENBANK/AY667398
  • GENBANK/AY667399
  • GENBANK/AY667400
  • GENBANK/AY667401
  • GENBANK/AY667402
  • GENBANK/AY667403
  • GENBANK/AY667404
  • GENBANK/AY667405
  • GENBANK/AY667406
  • GENBANK/AY667407