<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>2041-9414-1-2</ui><ji>2041-9414</ji><fm>
<dochead>Short Report</dochead>
<bibl>
<title>
<p>Eukaryotic gene invasion by a bacterial mobile insertion sequence element IS2 during cloning into a plasmid vector</p>
</title>
<aug>
<au id="A1"><snm>Senejani</snm><mi>G</mi><fnm>Alireza</fnm><insr iid="I1"/><email>Alireza.senejani@yale.edu</email></au>
<au ca="yes" id="A2"><snm>Sweasy</snm><mi>B</mi><fnm>Joann</fnm><insr iid="I1"/><email>Joann.sweasy@yale.edu</email></au>
</aug>
<insg>
<ins id="I1"><p>Department of Therapeutic Radiology and Human Genetics, Yale University School of Medicine, New Haven, CT 06520, USA</p></ins>
</insg>
<source>Genome Integrity</source>
<issn>2041-9414</issn>
<pubdate>2010</pubdate>
<volume>1</volume>
<issue>1</issue>
<fpage>2</fpage>
<url>http://www.genomeintegrity.com/content/1/1/2</url>
<xrefbib><pubidlist><pubid idtype="pmpid">20678256</pubid><pubid idtype="doi">10.1186/2041-9414-1-2</pubid></pubidlist></xrefbib>
</bibl>
<history><rec><date><day>28</day><month>11</month><year>2009</year></date></rec><acc><date><day>26</day><month>5</month><year>2010</year></date></acc><pub><date><day>26</day><month>5</month><year>2010</year></date></pub></history>
<cpyrt><year>2010</year><collab>Senejani and Sweasy; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<p>
<it>Escherichia coli (E. coli) </it>are commonly used as hosts for DNA cloning and sequencing. Upon transformation of <it>E. coli </it>with recombined vector carrying a gene of interest, the bacteria multiply the gene of interest while maintaining the integrity of its content. During the subcloning of a mouse genomic fragment into a plasmid vector, we noticed that the size of the insert increased significantly upon replication in <it>E. coli</it>. The sequence of the insert was determined and found to contain a novel DNA sequence within the mouse genomic insert. A BLAST search of GenBank revealed the novel sequence to be that of the Insertion Sequence 2 (IS2) element from <it>E. coli </it>that was likely inserted during replication in that organism. Importantly, a detailed search of GenBank shows that the IS2 is present within many eukaryotic nucleotide sequences, and in many cases, has been annotated as being part of the protein. The results of this study suggest that one must perform additional careful analysis of the sequence results using BLAST comparisons, and further verification of gene annotation before submission into the GenBank.</p>
</sec>
</abs>
</fm><meta>
<classifications>
<classification id="endnote" subtype="user_supplied_xml" type="bmc"/>
</classifications>
</meta><bdy>
<sec>
<st>
<p>Findings</p>
</st>
<p>In October 2009, GenBank (the NIH database in Bethesda, Maryland U.S.A.) reported the genetic sequence database exceeded 106 billion nucleotide bases in more than 3,000, 000 named organisms <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. GenBank, along with the European Molecular Biology Laboratory (EMBL-Bank in Hinxton, U.K.), and the DNA Data Bank of Japan (Mishima, Japan) are the three members of the International Nucleotide Sequence Database Collaboration that exchange information daily to ensure a consistent and complete collection of nucleotide sequence information. This database is expected to keep growing exponentially as sequencing is becoming economically more affordable and demand increases. Therefore, to ensure greater integrity of the data appearing in GenBank, further constructive steps to verify the identity of sequences before submission is becoming increasingly important.</p>
<p>Much of the information in GenBank was obtained by first subcloning a fragment of DNA, followed by its amplification and sequencing. During this process <it>Escherichia coli (E. coli) </it>are commonly used as hosts to multiply the gene of interest faithfully. In this study we report how unusual gene invasions during gene cloning in <it>E. coli </it>have caused incorrect annotation of a number of genes and proteins from numerous diverse species.</p>
<p>We are using a gene targeting approach to generate various knock-in mice. Using PCR, a fragment of the mouse genomic DNA that is 4.5 kb in length and harbors the region of interest was amplified, as shown in Figure <figr fid="F1">1</figr>. After digestion with appropriate restriction enzymes the fragment was inserted into a plasmid. During screening of the transformed host, <it>E. coli </it>Xl1-Blue (Stratagene), one of the resulting recombinant plasmids appeared to carry the insert. However, results from multiple restriction digestions and PCR analysis suggested the presence of an extra fragment of DNA within the insert (Figure <figr fid="F1">1</figr>). After sequencing and comparison of the extra DNA fragment with the entire sequence of the cloned PCR product, the results indicated that the extra piece of DNA element, 1.3 kb in length, was not a result of a duplication of any region of the original PCR amplified region. Furthermore, since the <it>E. coli </it>Xl1-Blue host was recombination deficient, it was not expected to occur as a consequence of potential recombination between the insert and other plasmids or the host genomic DNA.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Schematic diagram illustrating the cloning process that lead to detection of an extra DNA element integrated into the insert during the process</p></caption><text>
   <p><b>Schematic diagram illustrating the cloning process that lead to detection of an extra DNA element integrated into the insert during the process</b>. A PCR fragment, amplified from the gene of interest, with a size of about 4.5 kb, was inserted into a plasmid. The recombinant plasmid was then transformed into <it>E. coli</it>. The PCR was performed using the same sets of primers and extracted recombinant plasmids as a template. The resulting PCR fragment appeared to be about 5.8 kb long. This indicates the presence of extra DNA inside the insert. Further multiple restriction digestion analyses and sequencing confirmed the presence of the extra 1.3 kb DNA fragment within the insert.</p>
</text><graphic file="2041-9414-1-2-1" hint_layout="single"/></fig>
<p>Astonishingly, using nucleotide-nucleotide BLAST and the extra 1.3 kb DNA as a query sequence against the DNA database, we learned that the presence of the unidentified DNA has been reported in many diverse species, as shown in Figure <figr fid="F2">2</figr>. Among the organisms reported to have the identical DNA fragment are members of eukaryotes and bacteria domains. In bacteria, <it>E. coli </it>received the highest number of hits (61); this was followed by <it>Shigella </it>with more than 10 hits. Among eukaryotes, <it>Oryza sativa </it>received the highest number of hits, 11 times; this was followed by <it>Arabidopsis thaliana, Macaca mulatta, and Homo sapiens </it>with 9, 9, and 5 hits, respectively (see Figure <figr fid="F2">2</figr>).</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>Taxonomy BLAST reports of species submitted into the GenBank that contain the bacterial insertion element IS2 <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B8">8</abbr></abbrgrp></p></caption><text>
   <p><b>Taxonomy BLAST reports of species submitted into the GenBank that contain the bacterial insertion element IS2 </b><abbrgrp><abbr bid="B1">1</abbr><abbr bid="B8">8</abbr></abbrgrp>. The IS2 elements in each of these organisms is nearly identical. The numbers indicate how often the insertion element IS2 was found in the BLAST hitlist.</p>
</text><graphic file="2041-9414-1-2-2" hint_layout="double"/></fig>
<p>Results from further sequence analysis indicated that the extra DNA was the <it>E. coli </it>insertion sequence element IS2 that likely incorporated itself into the insert during the cloning process. IS2 is a short 1.3 kb DNA sequence <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp> that acts like a simple self mobile genetic element <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. Although the insertion sequence elements often act as genomic parasites they can sometimes cause chromosome rearrangements and produce mutations leading to elimination or adaptation of their host organism <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. Multiple copy presence of the IS2 was reported in <it>E. coli </it>more than thirty years ago <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp>. During the integration of IS2 into the 4.5 kb fragment we used in our studies, a short piece of DNA with a size of six nucleotides (AGAAAG) was duplicated at the end of insertion. We noticed the presence of a similar duplication (5-8 nucleotides) in nearly all of other entries reported into the GeneBank where the presence of IS2 was recognized (see Table <tblr tid="T1">1</tblr>). As shown in Table <tblr tid="T1">1</tblr>, there are 11 copies of IS2 that were detected when the IS2 nucleotide sequence was used to search against the entire genome of the <it>E. coli </it>W311 [GenBank: <ext-link ext-link-id="AP009048" ext-link-type="gen">AP009048</ext-link>]. With an exception of one (AACCC), which was also reported in an earlier study <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>, there is no detectable similarity between duplicated regions found in the 11 copies of IS2 in <it>E. coli</it>.</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>List of some selected genes that contain the IS2 element.</p></caption><tblbdy cols="3">
      <r>
         <c ca="left">
            <p>
               <b>Gene name and ID</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Location<sup>a</sup></b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Surrounding IS2 sequences<sup>b</sup></b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p><ul>dbj|</ul><ext-link ext-link-id="AP009048.1" ext-link-type="gen">AP009048.1</ext-link>| <it>Escherichia coli </it>W3110, complete genome, Length = 4646332</p>
         </c>
         <c ca="left">
            <p>3742928-3744271</p>
            <p>2320789-2322130</p>
            <p>381819-380478</p>
            <p>2071073-2072413</p>
            <p>4504197-4502855</p>
            <p>1105615-1106957</p>
            <p>1302369-1301034</p>
            <p>1469618-1470959</p>
            <p>2995017-2996347</p>
            <p>3186087-3184747</p>
            <p>1653272-1652545</p>
         </c>
         <c ca="left">
            <p>
               <monospace>GAAAT<b><ul>TGG...TCT</ul></b>AGAAATTGG</monospace>
            </p>
            <p>
               <monospace>GATCG<b><ul>TGG...TCT</ul></b>AGATCGT</monospace>
            </p>
            <p>
               <monospace>GTAAT<b><ul>TGG...TCT</ul></b>AGTAATT</monospace>
            </p>
            <p>
               <monospace>GTGGC<b><ul>TGG...TCT</ul></b>AGTGGC</monospace>
            </p>
            <p>
               <monospace>ACAAGG<b><ul>TGG...TCT</ul></b>ACAAGG</monospace>
            </p>
            <p>
               <monospace>AACCCT<b><ul>TGT...TCT</ul></b>AACCCT</monospace>
            </p>
            <p>
               <monospace>TAATATC<b><ul>TGT...TCT</ul></b>AATATC</monospace>
            </p>
            <p>
               <monospace>GAACCC<b><ul>TGT...TCT</ul></b>AAACCC</monospace>
            </p>
            <p>
               <monospace>tttat<b><ul>TGG...TCT</ul></b>aaacttg</monospace>
            </p>
            <p>
               <monospace>ATAAC<b><ul>TGG...TCT</ul></b>AATAAC</monospace>
            </p>
            <p>
               <monospace>acctg<b><ul>TGG</ul>...<ul>T</ul></b>tcgtc</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>dbj|<ext-link ext-link-id="AP008210.1" ext-link-type="gen">AP008210.1</ext-link>| <it>Oryza sativa</it>, chromosome 4, Length = 35498469</p>
         </c>
         <c ca="left">
            <p>27366872-27368214 11718374-11719715 13525525-13524204</p>
         </c>
         <c ca="left">
            <p>
               <monospace>AGAAAG<b><ul>TGG...TCT</ul></b>AGAAAG GTTAG<b><ul>TGG...TCT</ul></b>AGTTAGT AGAGAT<b><ul>TGG...TCT</ul></b>AGAGATT</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>emb|<ext-link ext-link-id="AL731613.5" ext-link-type="gen">AL731613.5</ext-link>|<ext-link ext-link-id="OSJN00257" ext-link-type="gen">OSJN00257</ext-link><it>Oryza sativa</it>, chromosome 4, Length = 133967</p>
         </c>
         <c ca="left">
            <p>18027-19368</p>
         </c>
         <c ca="left">
            <p>
               <monospace>AGAAAG<b><ul>TGG</ul></b><ul>...</ul><b><ul>TCT</ul></b>AGAAAG</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC004776.1" ext-link-type="gen">AC004776.1</ext-link>| <it>Homo sapiens</it>, chromosome 5, Length = 89626</p>
         </c>
         <c ca="left">
            <p>20138-18796</p>
         </c>
         <c ca="left">
            <p>
               <monospace>ATTTCC<b><ul>TGG...TCT</ul></b>ATTTCCT</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC018501.9" ext-link-type="gen">AC018501.9</ext-link>| <it>Homo sapiens</it>, chromosome 3, Length = 202070</p>
         </c>
         <c ca="left">
            <p>72232-70890</p>
         </c>
         <c ca="left">
            <p>
               <monospace>TATCTGG<b><ul>TGG...TCT</ul></b>ATCTGG</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC170856.2" ext-link-type="gen">AC170856.2</ext-link>| <it>Medicago truncatula</it>, chromosome 2, Length = 101215</p>
         </c>
         <c ca="left">
            <p>69921-68580</p>
         </c>
         <c ca="left">
            <p>
               <monospace>TCAAG<b><ul>TGG...TCT</ul></b>ATCAAGT</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC191971.14" ext-link-type="gen">AC191971.14</ext-link>| <it>Rhesus Macaque</it>, genomic DNA, Length = 174949</p>
         </c>
         <c ca="left">
            <p>169869-171210</p>
         </c>
         <c ca="left">
            <p>
               <monospace>ACACAG<b><ul>TGG...TCT</ul></b>ACACAG</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC202613.6" ext-link-type="gen">AC202613.6</ext-link>| <it>Rhesus macaque</it>, genomic DNA, Length = 181783</p>
         </c>
         <c ca="left">
            <p>146766-145426</p>
         </c>
         <c ca="left">
            <p>
               <monospace>GTTCC<b><ul>TGG...TCT</ul></b>AGTTCC</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC200594.3" ext-link-type="gen">AC200594.3</ext-link>| <it>Rhesus macaque</it>, genomic DNA, Length = 171745</p>
         </c>
         <c ca="left">
            <p>99933-98589</p>
         </c>
         <c ca="left">
            <p>
               <monospace>TAGGTGTT<b><ul>TGG...TCT</ul></b>AGTGTTT</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC198308.7" ext-link-type="gen">AC198308.7</ext-link>| <it>Rhesus Macaque</it>, genomic DNA, Length = 143534</p>
         </c>
         <c ca="left">
            <p>76175-74834</p>
         </c>
         <c ca="left">
            <p>
               <monospace>GTTTG<b><ul>TGG...TCT</ul></b>AGTTTGT</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC196863.3" ext-link-type="gen">AC196863.3</ext-link>| <it>Macaca mulatta</it>, chromosome 2, Length = 172779</p>
         </c>
         <c ca="left">
            <p>155658-154318</p>
         </c>
         <c ca="left">
            <p>
               <monospace>CAAAC<b><ul>TGG...TCT</ul></b>ACAAAC</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC193812.4" ext-link-type="gen">AC193812.4</ext-link>| <it>Canis Familiaris</it>, chromosome 13, Length = 196541</p>
         </c>
         <c ca="left">
            <p>31412-30070</p>
         </c>
         <c ca="left">
            <p>
               <monospace>TAGATCT<b><ul>TGG...TCT</ul></b>AGATCT</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC187015.8" ext-link-type="gen">AC187015.8</ext-link>| <it>Canis familiaris</it>, chromosome 33, Length = 215278</p>
         </c>
         <c ca="left">
            <p>178002-176660</p>
         </c>
         <c ca="left">
            <p>
               <monospace>TAGTTGG<b><ul>TGG...TCT</ul></b>AGTTGG</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>dbj|<ext-link ext-link-id="AK229126.1" ext-link-type="gen">AK229126.1</ext-link>| <it>Arabidopsis thaliana</it>, cDNA, Length = 4564</p>
         </c>
         <c ca="left">
            <p>2374-3716</p>
         </c>
         <c ca="left">
            <p>
               <monospace>AAAGAG<b><ul>TGG...TCT</ul></b>AAAGAGT</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>dbj|<ext-link ext-link-id="AK229400.1" ext-link-type="gen">AK229400.1</ext-link>| <it>Arabidopsis thaliana</it>, cDNA, Length = 3239</p>
         </c>
         <c ca="left">
            <p>869-2210</p>
         </c>
         <c ca="left">
            <p>
               <monospace>AGAAGG<b><ul>TGG...TCT</ul></b>AGAAGG</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AY064986.1" ext-link-type="gen">AY064986.1</ext-link>| <it>Arabidopsis thaliana</it>, cDNA, Length = 3309</p>
         </c>
         <c ca="left">
            <p>1180-2523</p>
         </c>
         <c ca="left">
            <p>
               <monospace>TAGAAGG<b><ul>TGG...TCT</ul></b>AGAAGGT</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>dbj|<ext-link ext-link-id="AK226701.1" ext-link-type="gen">AK226701.1</ext-link>| <it>Arabidopsis thaliana</it>, cDNA, Length = 5499</p>
         </c>
         <c ca="left">
            <p>3369-2028</p>
         </c>
         <c ca="left">
            <p>
               <monospace>AGAATT<b><ul>TGG...TCT</ul></b>AGAATT</monospace>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>gb|<ext-link ext-link-id="AC193907.3" ext-link-type="gen">AC193907.3</ext-link>| <it>Pan troglodytes</it>, chromosome x, Length = 161221</p>
         </c>
         <c ca="left">
            <p>90423-91765</p>
         </c>
         <c ca="left">
            <p>
               <monospace>AGCAGG<b><ul>TGG...TCT</ul></b>AGCAGGT</monospace>
            </p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p><sup>a</sup>The location and the duplicated nucleotides <sup>b</sup>surrounding IS2 sequences are shown. The underlined sequences are the first and last triplet nucleotides of the IS2 element.</p>
   </tblfn></tbl>
<p>We have observed a phenomenon in the laboratory whereby bacterial IS2 elements are incorporated into eukaryotic genes during amplification of a recombinant plasmid vector transformed into <it>E. coli </it>cells. The presence of the bacterial IS2 element is reported in many complete/partial genomic DNA or cDNA sequences of numerous diverse eukaryotic species including <it>Homo sapiens, Mus musculus, Macaca mulatta, Oryza sativa, Arabidopsis thaliana</it>, and many others submitted into GenBank. The insertion of the IS2 element occurred most likely during replication in <it>E. coli</it>, similar to our study. For example, if you BLAST the IS2 element from <it>E. coli </it>K-12, one of the top hits is GenBank: <ext-link ext-link-id="AK227066.1" ext-link-type="gen">AK227066.1</ext-link>
<it>Arabidopsis thaliana </it>mRNA for calcium-dependent protein kinase 19 (CDPK19). If you then take the this <it>Arabidopsis </it>sequence and do a blastn, the top three hits are the <it>Arabidopsis </it>CDPK19 that DO NOT contain the IS2 [GI: 145361922, 30687319, and 836941], whereas the next hits belong to IS2 elements in a variety of organisms including <it>E. coli</it>, plants and animals. On some occasions the IS2 DNA sequence is unintentionally claimed to be a host protein or a part of host protein coding region; i.e. [GenBank: <ext-link ext-link-id="BAE99182" ext-link-type="gen">BAE99182</ext-link>] from <it>Arabidopsis thaliana </it>or [GenBank: <ext-link ext-link-id="CAI64485" ext-link-type="gen">CAI64485</ext-link>] from <it>Oryza stativarotein </it>
<abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>. Similarly, when the nucleotide sequence of IS1 was used as a query, the results indicated the presence of this genetic mobile element in many genes found in numerous members of Eukaryota (data not shown). The results of this study suggest one must perform additional careful analysis of the BLAST results from cloned sequences, and further verification of gene annotation before submission into GenBank.</p>
</sec>
<sec>
<st>
<p>Competing interests</p>
</st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>AS carried out the molecular genetic studies, participated in the sequence alignment and drafted the manuscript. JS coordinated, helped to design of the study and improved the manuscript. All authors read and approved the final manuscript.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>The research was support by CA 016038-33 (to J.S.) and by an Anna Fuller Fellowship (to A.S.).</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>GenBank</p></title><aug><au><snm>Benson</snm><fnm>DA</fnm></au><au><snm>Karsch-Mizrachi</snm><fnm>I</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au><au><snm>Ostell</snm><fnm>J</fnm></au><au><snm>Sayers</snm><fnm>EW</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>D26</fpage><lpage>31</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn723</pubid><pubid idtype="pmcid">2686462</pubid><pubid idtype="pmpid">18940867</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Nucleotide sequence of the transposable DNA-element IS2</p></title><aug><au><snm>Ghosal</snm><fnm>D</fnm></au><au><snm>Sommer</snm><fnm>H</fnm></au><au><snm>Saedler</snm><fnm>H</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>1979</pubdate><volume>6</volume><fpage>1111</fpage><lpage>1122</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/6.3.1111</pubid><pubid idtype="pmcid">327757</pubid><pubid idtype="pmpid">375194</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>The basis of asymmetry in IS2 transposition</p></title><aug><au><snm>Lewis</snm><fnm>LA</fnm></au><au><snm>Gadura</snm><fnm>N</fnm></au><au><snm>Greene</snm><fnm>M</fnm></au><au><snm>Saby</snm><fnm>R</fnm></au><au><snm>Grindley</snm><fnm>ND</fnm></au></aug><source>Mol Microbiol</source><pubdate>2001</pubdate><volume>42</volume><fpage>887</fpage><lpage>901</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1046/j.1365-2958.2001.02662.x</pubid><pubid idtype="pmpid" link="fulltext">11737634</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Dynamics of insertion sequence elements during experimental evolution of bacteria</p></title><aug><au><snm>Schneider</snm><fnm>D</fnm></au><au><snm>Lenski</snm><fnm>RE</fnm></au></aug><source>Res Microbiol</source><pubdate>2004</pubdate><volume>155</volume><fpage>319</fpage><lpage>327</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.resmic.2003.12.008</pubid><pubid idtype="pmpid" link="fulltext">15207863</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Multiple copies of the insertion-DNA sequences IS1 and IS2 in the chromosome of E. coli K-12</p></title><aug><au><snm>Saedler</snm><fnm>H</fnm></au><au><snm>Heiss</snm><fnm>B</fnm></au></aug><source>Mol Gen Genet</source><pubdate>1973</pubdate><volume>122</volume><fpage>267</fpage><lpage>277</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/BF00278602</pubid><pubid idtype="pmpid">4577900</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Multiple IS insertion sequences near the replication terminus in Escherichia coli K-12</p></title><aug><au><snm>Moszer</snm><fnm>I</fnm></au><au><snm>Glaser</snm><fnm>P</fnm></au><au><snm>Danchin</snm><fnm>A</fnm></au></aug><source>Biochimie</source><pubdate>1991</pubdate><volume>73</volume><fpage>1361</fpage><lpage>1374</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/0300-9084(91)90166-X</pubid><pubid idtype="pmpid" link="fulltext">1665988</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Sequence and analysis of rice chromosome 4</p></title><aug><au><snm>Feng</snm><fnm>Q</fnm></au><au><snm>Zhang</snm><fnm>Y</fnm></au><au><snm>Hao</snm><fnm>P</fnm></au><au><snm>Wang</snm><fnm>S</fnm></au><au><snm>Fu</snm><fnm>G</fnm></au><au><snm>Huang</snm><fnm>Y</fnm></au><au><snm>Li</snm><fnm>Y</fnm></au><au><snm>Zhu</snm><fnm>J</fnm></au><au><snm>Liu</snm><fnm>Y</fnm></au><au><snm>Hu</snm><fnm>X</fnm></au><au><snm>Jia</snm><fnm>P</fnm></au><au><snm>Zhang</snm><fnm>Y</fnm></au><au><snm>Zhao</snm><fnm>Q</fnm></au><au><snm>Ying</snm><fnm>K</fnm></au><au><snm>Yu</snm><fnm>S</fnm></au><au><snm>Tang</snm><fnm>Y</fnm></au><au><snm>Weng</snm><fnm>Q</fnm></au><au><snm>Zhang</snm><fnm>L</fnm></au><au><snm>Lu</snm><fnm>Y</fnm></au><au><snm>Mu</snm><fnm>J</fnm></au><au><snm>Lu</snm><fnm>Y</fnm></au><au><snm>Zhang</snm><fnm>LS</fnm></au><au><snm>Yu</snm><fnm>Z</fnm></au><au><snm>Fan</snm><fnm>D</fnm></au><au><snm>Liu</snm><fnm>X</fnm></au><au><snm>Lu</snm><fnm>T</fnm></au><au><snm>Li</snm><fnm>C</fnm></au><au><snm>Wu</snm><fnm>Y</fnm></au><au><snm>Sun</snm><fnm>T</fnm></au><au><snm>Lei</snm><fnm>H</fnm></au><etal/></aug><source>Nature</source><pubdate>2002</pubdate><volume>420</volume><fpage>316</fpage><lpage>320</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature01183</pubid><pubid idtype="pmpid" link="fulltext">12447439</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Database resources of the National Center for Biotechnology Information</p></title><aug><au><snm>Sayers</snm><fnm>EW</fnm></au><au><snm>Barrett</snm><fnm>T</fnm></au><au><snm>Benson</snm><fnm>DA</fnm></au><au><snm>Bryant</snm><fnm>SH</fnm></au><au><snm>Canese</snm><fnm>K</fnm></au><au><snm>Chetvernin</snm><fnm>V</fnm></au><au><snm>Church</snm><fnm>DM</fnm></au><au><snm>DiCuccio</snm><fnm>M</fnm></au><au><snm>Edgar</snm><fnm>R</fnm></au><au><snm>Federhen</snm><fnm>S</fnm></au><au><snm>Feolo</snm><fnm>M</fnm></au><au><snm>Geer</snm><fnm>LY</fnm></au><au><snm>Helmberg</snm><fnm>W</fnm></au><au><snm>Kapustin</snm><fnm>Y</fnm></au><au><snm>Landsman</snm><fnm>D</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au><au><snm>Madden</snm><fnm>TL</fnm></au><au><snm>Maglott</snm><fnm>DR</fnm></au><au><snm>Miller</snm><fnm>V</fnm></au><au><snm>Mizrachi</snm><fnm>I</fnm></au><au><snm>Ostell</snm><fnm>J</fnm></au><au><snm>Pruitt</snm><fnm>KD</fnm></au><au><snm>Schuler</snm><fnm>GD</fnm></au><au><snm>Sequeira</snm><fnm>E</fnm></au><au><snm>Sherry</snm><fnm>ST</fnm></au><au><snm>Shumway</snm><fnm>M</fnm></au><au><snm>Sirotkin</snm><fnm>K</fnm></au><au><snm>Souvorov</snm><fnm>A</fnm></au><au><snm>Starchenko</snm><fnm>G</fnm></au><au><snm>Tatusova</snm><fnm>TA</fnm></au><etal/></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>D5</fpage><lpage>15</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn741</pubid><pubid idtype="pmcid">2686545</pubid><pubid idtype="pmpid">18940862</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>
