LOCUS       AE000222               11519 bp    DNA     linear   BCT 28-AUG-2002
DEFINITION  Escherichia coli K12 MG1655 section 112 of 400 of the complete
            genome.
ACCESSION   AE000222 U00096
VERSION     AE000222.1  GI:1787486
KEYWORDS    .
SOURCE      Escherichia coli K12.
  ORGANISM  Escherichia coli K12
            Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae;
            Escherichia.
REFERENCE   1  (bases 1 to 11519)
  AUTHORS   Blattner,F.R., Plunkett,G. III, Bloch,C.A., Perna,N.T., Burland,V.,
            Riley,M., Collado-Vides,J., Glasner,J.D., Rode,C.K., Mayhew,G.F.,
            Gregor,J., Davis,N.W., Kirkpatrick,H.A., Goeden,M.A., Rose,D.J.,
            Mau,B. and Shao,Y.
  TITLE     The complete genome sequence of Escherichia coli K-12
  JOURNAL   Science 277 (5331), 1453-1474 (1997)
  MEDLINE   97426617
   PUBMED   9278503
REFERENCE   2  (bases 1 to 11519)
  AUTHORS   Petersen,C., Moller,L.B. and Valentin-Hansen,P.
  TITLE     The cryptic adenine deaminase gene of Escherichia coli. Silencing
            by the nucleoid-associated DNA-binding protein, H-NS, and
            activation by insertion elements
  JOURNAL   J. Biol. Chem. 277 (35), 31373-31380 (2002)
   PUBMED   12077137
REFERENCE   3  (bases 1 to 11519)
  AUTHORS   Blattner,F.R.
  TITLE     Direct Submission
  JOURNAL   Submitted (16-JAN-1997) Guy Plunkett III, Laboratory of Genetics,
            University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA.
            Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax:
            608-263-7459
REFERENCE   4  (bases 1 to 11519)
  AUTHORS   Blattner,F.R.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-SEP-1997) Guy Plunkett III, Laboratory of Genetics,
            University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA.
            Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax:
            608-263-7459
REFERENCE   5  (bases 1 to 11519)
  AUTHORS   Plunkett,G. III.
  TITLE     Direct Submission
  JOURNAL   Submitted (13-OCT-1998) Laboratory of Genetics, University of
            Wisconsin, 445 Henry Mall, Madison, WI 53706, USA
COMMENT     This sequence was determined by the E. coli Genome Project at the
            University of Wisconsin-Madison (Frederick R. Blattner, director). 
            Supported by NIH grants HG00301 and HG01428 (from the Human Genome
            Project and NCHGR). The entire sequence was independently
            determined from E. coli K12 strain MG1655. Predicted open reading
            frames were determined using GeneMark software, kindly supplied by
            Mark Borodovsky, Georgia Institute of Technology, Atlanta, GA,
            30332 [e-mail: mark@amber.gatech.edu].  Open reading frames that
            have been correlated with genetic loci are being annotated with CG
            Site Nos., unique ID nos. for the genes in the E. coli Genetic
            Stock Center (CGSC) database at Yale University, kindly supplied by
            Mary Berlyn. A public version of the database is accessible
            (http://cgsc.biology.yale.edu). Annotation of the genome is an
            ongoing task whose goal is to make the genome sequence more useful
            by correlating it with other data.  Comments to the authors are
            appreciated. Updated information will be available at the E. coli
            Genome Project's World Wide Web site
            (http://www.genetics.wisc.edu). *** The E. coli K12 sequence and
            its annotations are periodically updated; this is version M54. No
            sequence changes. Annotation updates: updated gene identifications
            and products; all new functional assignments courtesy of Monica
            Riley; added promoters, protein binding sites, and repeated
            sequences described in reference 1. The unique numeric identifiers
            beginning with a lowercase 'b' assigned to each gene (protein- or
            RNA-encoding) are now designated as gene synonyms instead of
            labels. This should allow them to be searched for in Entrez as gene
            names.
FEATURES             Location/Qualifiers
     source          1..11519
                     /organism="Escherichia coli K12"
                     /strain="K12"
                     /sub_strain="MG1655"
                     /db_xref="taxon:83333"
     promoter        <1..9
                     /note="factor Sigma70; predicted +1 start at 1289405"
     gene            76..1089
                     /gene="hnr"
                     /note="b1235"
     CDS             76..1089
                     /gene="hnr"
                     /function="regulator; Basic proteins - synthesis,
                     modification"
                     /note="o337; 100 pct identical to HNR_ECOLI SW: P37055"
                     /codon_start=1
                     /transl_table=11
                     /product="Hnr protein"
                     /protein_id="AAC74317.1"
                     /db_xref="GI:1787487"
                     /translation="MTQPLVGKQILIVEDEQVFRSLLDSWFSSLGATTVLAADGVDAL
                     ELLGGFTPDLMICDIAMPRMNGLKLLEHIRNRGDQTPVLVISATENMADIAKALRLGV
                     EDVLLKPVKDLNRLREMVFACLYPSMFNSRVEEEERLFRDWDAMVDNPAAAAKLLQEL
                     QPPVQQVISHCRVNYRQLVAADKPGLVLDIAALSENDLAFYCLDVTRAGHNGVLAALL
                     LRALFNGLLQEQLAHQNQRLPELGALLKQVNHLLRQANLPGQFPLLVGYYHRELKNLI
                     LVSAGLNATLNTGEHQVQISNGVPLGTLGNAYLNQLSQRCDAWQCQIWGTGGRLRLML
                     SAE"
     promoter        1122..1150
                     /note="factor Sigma70; predicted +1 start at 1290546"
     gene            1291..2199
                     /gene="galU"
                     /note="b1236"
     CDS             1291..2199
                     /gene="galU"
                     /EC_number="2.7.7.9"
                     /function="enzyme; Degradation of small molecules: Carbon
                     compounds"
                     /note="o302; 100 pct identical to GALU_ECOLI SW: P25520"
                     /codon_start=1
                     /transl_table=11
                     /product="glucose-1-phosphate uridylyltransferase"
                     /protein_id="AAC74318.1"
                     /db_xref="GI:1787488"
                     /translation="MAAINTKVKKAVIPVAGLGTRMLPATKAIPKEMLPLVDKPLIQY
                     VVNECIAAGITEIVLVTHSSKNSIENHFDTSFELEAMLEKRVKRQLLDEVQSICPPHV
                     TIMQVRQGLAKGLGHAVLCAHPVVGDEPVAVILPDVILDEYESDLSQDNLAEMIRRFD
                     ETGHSQIMVEPVADVTAYGVVDCKGVELAPGESVPMVGVVEKPKADVAPSNLAIVGRY
                     VLSADIWPLLAKTPPGAGDEIQLTDAIDMLIEKETVEAYHMKGKSHDCGNKLGYMQAF
                     VEYGIRHNTLGTEFKAWLEEEMGIKK"
     gene            complement(2343..2756)
                     /gene="hns"
                     /note="b1237"
     CDS             complement(2343..2756)
                     /gene="hns"
                     /function="regulator; Basic proteins - synthesis,
                     modification"
                     /note="f137; 99 pct identical to HNS_ECOLI SW: P08936"
                     /codon_start=1
                     /transl_table=11
                     /product="DNA-binding protein HLP-II (HU, BH2, HD, NS);
                     pleiotropic regulator"
                     /protein_id="AAC74319.1"
                     /db_xref="GI:1787489"
                     /translation="MSEALKILNNIRTLRAQARECTLETLEEMLEKLEVVVNERREEE
                     SAAAAEVEERTRKLQQYREMLIADGIDPNELLNSLAAVKSGTKAKRAQRPAKYSYVDE
                     NGETKTWTGQGRTPAVIKKAMDEQGKSLDDFLIKQ"
     promoter        complement(2799..2827)
                     /note="factor Sigma70; predicted +1 start at 1292181"
     promoter        3292..3319
                     /note="factor Sigma70; predicted +1 start at 1292715"
     promoter        3310..3338
                     /note="factor Sigma70; predicted +1 start at 1292734"
     gene            3361..3978
                     /gene="tdk"
                     /note="b1238"
     CDS             3361..3978
                     /gene="tdk"
                     /EC_number="2.7.1.21"
                     /function="enzyme; Salvage of nucleosides and nucleotides"
                     /note="o205; 100 pct identical to KITH_ECOLI SW: P23331"
                     /codon_start=1
                     /transl_table=11
                     /product="thymidine kinase"
                     /protein_id="AAC74320.1"
                     /db_xref="GI:1787490"
                     /translation="MAQLYFYYSAMNAGKSTALLQSSYNYQERGMRTVVYTAEIDDRF
                     GAGKVSSRIGLSSPAKLFNQNSSLFDEIRAEHEQQAIHCVLVDECQFLTRQQVYELSE
                     VVDQLDIPVLCYGLRTDFRGELFIGSQYLLAWSDKLVELKTICFCGRKASMVLRLDQA
                     GRPYNEGEQVVIGGNERYVSVCRKHYKEALQVDSLTAIQERHRHD"
     gene            complement(4260..4850)
                     /gene="ychG"
                     /note="b1239"
     CDS             complement(4260..4850)
                     /gene="ychG"
                     /function="orf; Unknown"
                     /note="f196; 100 pct identical to YCHG_ECOLI SW: P30192"
                     /codon_start=1
                     /transl_table=11
                     /product="orf, hypothetical protein"
                     /protein_id="AAC74321.1"
                     /db_xref="GI:1787491"
                     /translation="MALPPDRTDRGAERYLKDDWHGLQLFAIDGAQFRTPDKPELREY
                     YGSANTSTKRQNAYPVMRLVALMNLGSHILLNAVTAPYRQSETVLAHSMLATIPDNSI
                     TLFDKLFYSEDLLLTLNQKGCNRHWLLPAWKNIASEMIELGNTASPGTIPKRLEHLRG
                     ALEVVFITKRPRPSRPRSVKISKTRYPVKHSAAPLK"
     gene            complement(4802..5032)
                     /gene="b1240"
     CDS             complement(4802..5032)
                     /gene="b1240"
                     /function="orf; Unknown"
                     /note="f76; f76; broken; This 76 aa ORF is 42 pct
                     identical (2 gaps) to 49 residues of an approx. 448 aa
                     protein YI41_ECOLI SW: P03835"
                     /codon_start=1
                     /transl_table=11
                     /product="orf, hypothetical protein"
                     /protein_id="AAC74322.1"
                     /db_xref="GI:1787492"
                     /translation="MRPFAAVVYRGTWLSGWWCNEPITDVVRRLNLSADGEAGMNLLA
                     RSAVTQARQRVGAAPVEWLFRQTAQTGARNVT"
     promoter        complement(4981..5010)
                     /gene="b1240"
                     /note="factor Sigma70; predicted +1 start at 1294363"
     gene            complement(5280..7955)
                     /gene="adhE"
                     /note="b1241"
     CDS             complement(5280..7955)
                     /gene="adhE"
                     /EC_number="1.1.1.1"
                     /function="enzyme; Energy metabolism, carbon:
                     Fermentation"
                     /note="f891; 99 pct identical to ADHE_ECOLI SW: P17547"
                     /codon_start=1
                     /transl_table=11
                     /product="CoA-linked acetaldehyde dehydrogenase and
                     iron-dependent alcohol dehydrogenase;
                     pyruvate-formate-lyase deactivase"
                     /protein_id="AAC74323.1"
                     /db_xref="GI:1787493"
                     /translation="MAVTNVAELNALVERVKKAQREYASFTQEQVDKIFRAAALAAAD
                     ARIPLAKMAVAESGMGIVEDKVIKNHFASEYIYNAYKDEKTCGVLSEDDTFGTITIAE
                     PIGIICGIVPTTNPTSTAIFKSLISLKTRNAIIFSPHPRAKDATNKAADIVLQAAIAA
                     GAPKDLIGWIDQPSVELSNALMHHPDINLILATGGPGMVKAAYSSGKPAIGVGAGNTP
                     VVIDETADIKRAVASVLMSKTFDNGVICASEQSVVVVDSVYDAVRERFATHGGYLLQG
                     KELKAVQDVILKNGALNAAIVGQPAYKIAELAGFSVPENTKILIGEVTVVDESEPFAH
                     EKLSPTLAMYRAKDFEDAVEKAEKLVAMGGIGHTSCLYTDQDNQPARVSYFGQKMKTA
                     RILINTPASQGGIGDLYNFKLAPSLTLGCGSWGGNSISENVGPKHLINKKTVAKRAEN
                     MLWHKLPKSIYFRRGSLPIALDEVITDGHKRALIVTDRFLFNNGYADQITSVLKAAGV
                     ETEVFFEVEADPTLSIVRKGAELANSFKPDVIIALGGGSPMDAAKIMWVMYEHPETHF
                     EELALRFMDIRKRIYKFPKMGVKAKMIAVTTTSGTGSEVTPFAVVTDDATGQKYPLAD
                     YALTPDMAIVDANLVMDMPKSLCAFGGLDAVTHAMEAYVSVLASEFSDGQALQALKLL
                     KEYLPASYHEGSKNPVARERVHSAATIAGIAFANAFLGVCHSMAHKLGSQFHIPHGLA
                     NALLICNVIRYNANDNPTKQTAFSQYDRPQARRRYAEIADHLGLSAPGDRTAAKIEKL
                     LAWLETLKAELGIPKSIREAGVQEADFLANVDKLSEDAFDDQCTGANPRYPLISELKQ
                     ILLDTYYGRDYVEGETAAKKEAAPAKAEKKAKKSA"
     promoter        complement(7968..7995)
                     /note="factor Sigma70; predicted +1 start at 1297350"
     promoter        8340..8368
                     /note="factor Sigma70; predicted +1 start at 1297764"
     gene            8432..9079
                     /gene="ychE"
                     /note="b1242"
     CDS             8432..9079
                     /gene="ychE"
                     /function="putative transport; Not classified"
                     /note="o215; 100 pct identical to YCHE_ECOLI SW: P25743
                     but has 33 additional N-terminal residues"
                     /codon_start=1
                     /transl_table=11
                     /product="putative channel protein"
                     /protein_id="AAC74324.1"
                     /db_xref="GI:1787494"
                     /translation="MIQTFFDFPVYFKFFIGLFALVNPVGIIPVFISMTSYQTAAARN
                     KTNLTANLSVAIILWISLFLGDTILQLFGISIDSFRIAGGILVVTIAMSMISGKLGED
                     KQNKQEKSETAVRESIGVVPLALPLMAGPGAISSTIVWGTRYHSISYLFGFFVAIALF
                     ALCCWGLFRMAPWLVRVLRQTGINVITRIMGLLLMALGIEFIVTGIKGIFPGLLN"
     promoter        9269..9297
                     /note="factor Sigma70; promoter oppA; documented +1
                     at1298695"
     protein_bind    9674..9691
                     /note="central position to opp promoter:377"
                     /bound_moiety="PhoB documented site"
     gene            9817..11448
                     /gene="oppA"
                     /note="b1243"
     CDS             9817..11448
                     /gene="oppA"
                     /function="transport; Protein, peptide secretion"
                     /note="o543; 100 pct identical to OPPA_ECOLI SW: P23843"
                     /codon_start=1
                     /transl_table=11
                     /product="oligopeptide transport; periplasmic binding
                     protein"
                     /protein_id="AAC74325.1"
                     /db_xref="GI:1787495"
                     /translation="MTNITKRSLVAAGVLAALMAGNVALAADVPAGVTLAEKQTLVRN
                     NGSEVQSLDPHKIEGVPESNISRDLFEGLLVSDLDGHPAPGVAESWDNKDAKVWTFHL
                     RKDAKWSDGTPVTAQDFVYSWQRSVDPNTASPYASYLQYGHIAGIDEILEGKKPITDL
                     GVKAIDDHTLEVTLSEPVPYFYKLLVHPSTSPVPKAAIEKFGEKWTQPGNIVTNGAYT
                     LKDWVVNERIVLERSPTYWNNAKTVINQVTYLPIASEVTDVNRYRSGEIDMTNNSMPI
                     ELFQKLKKEIPDEVHVDPYLCTYYYEINNQKPPFNDVRVRTALKLGMDRDIIVNKVKA
                     QGNMPAYGYTPPYTDGAKLTQPEWFGWSQEKRNEEAKKLLAEAGYTADKPLTINLLYN
                     TSDLHKKLAIAASSLWKKNIGVNVKLVNQEWKTFLDTRHQGTFDVARAGWCADYNEPT
                     SFLNTMLSNSSMNTAHYKSPAFDSIMAETLKVTDEAQRTALYTKAEQQLDKDSAIVPV
                     YYYVNARLVKPWVGGYTGKDPLDNTYTRNMYIVKH"
BASE COUNT     3016 a   2576 c   2837 g   3090 t
ORIGIN      
        1 cacttaagtt aattctgaca ggcgcaggtg gcaatagcat gccactattg agtaaagcca
       61 gtcaggggag agaacatgac gcagccattg gtcggaaaac agattctcat tgttgaagat
      121 gagcaggtat ttcgctcgct tctggattca tggttttcct cattgggagc gacaacggta
      181 ctggcggctg atggggtgga tgcccttgag ttgctgggag gtttcactcc agacctgatg
      241 atatgtgata tcgcgatgcc acgaatgaac gggcttaaac tgctggagca tatacgtaac
      301 agaggcgacc agaccccagt tctggtgata tctgccactg aaaatatggc agatattgcc
      361 aaagcgttac gtctgggcgt tgaagatgtt ttgctgaaac cagttaaaga tctgaatcgc
      421 ttgcgcgaga tggtttttgc ctgtctctat cccagcatgt ttaattcgcg cgttgaggaa
      481 gaggaaaggc tttttcgcga ctgggatgca atggttgata accctgccgc agcggcgaaa
      541 ttattacagg aactacaacc gccggttcag caggtgattt cccattgccg ggttaattat
      601 cgtcaattgg ttgccgcgga caaacccggc ctggtgcttg atattgccgc actttcggaa
      661 aacgatctgg cattttattg ccttgatgtc acccgagctg gacataatgg cgtacttgct
      721 gccttgttat tacgcgcatt gtttaacgga ttattacagg aacagcttgc acaccaaaat
      781 caacggttgc cagagttggg cgcgttattg aagcaggtaa accatttact tcgtcaggcc
      841 aatctgccgg ggcagtttcc gctattagtt ggctattatc atcgcgaact gaaaaatctc
      901 attctggttt ctgcgggtct gaatgcgacg ttaaataccg gcgaacacca ggtgcaaatc
      961 agtaatggtg ttccgttagg cactttaggt aacgcttatt tgaatcaatt gagccagcga
     1021 tgcgatgcct ggcaatgcca aatatgggga accggtggtc gactgcgctt gatgttgtct
     1081 gcagaatgag caaacgataa cgcgggctaa atttgcatta cctgctaatg tcggctggtg
     1141 gtactatcgt cgccattcgt ataagtaatt gtcttaatta tgctaactcg cctccttttc
     1201 agaacttagc cccttcgggg tgctgatata ctgggatgcg atacagaaat atgaacacgt
     1261 tcaaaacacg aacagtccag gagaatttaa atggctgcca ttaatacgaa agtcaaaaaa
     1321 gccgttatcc ccgttgcggg attaggaacc aggatgttgc cggcgacgaa agccatcccg
     1381 aaagagatgc tgccacttgt cgataagcca ttaattcaat acgtcgtgaa tgaatgtatt
     1441 gcggctggca ttactgaaat tgtgctggtt acacactcat ctaaaaactc tattgaaaac
     1501 cactttgata ccagttttga actggaagca atgctggaaa aacgtgtaaa acgtcaactg
     1561 cttgatgaag tgcagtctat ttgtccaccg cacgtgacta ttatgcaagt tcgtcagggt
     1621 ctggcgaaag gcctgggaca cgcggtattg tgtgctcacc cggtagtggg tgatgaaccg
     1681 gtagctgtta ttttgcctga tgttattctg gatgaatatg aatccgattt gtcacaggat
     1741 aacctggcag agatgatccg ccgctttgat gaaacgggtc atagccagat catggttgaa
     1801 ccggttgctg atgtgaccgc atatggcgtt gtggattgca aaggcgttga attagcgccg
     1861 ggtgaaagcg taccgatggt tggtgtggta gaaaaaccga aagcggatgt tgcgccgtct
     1921 aatctcgcta ttgtgggtcg ttacgtactt agcgcggata tttggccgtt gctggcaaaa
     1981 acccctccgg gagctggtga tgaaattcag ctcaccgacg caattgatat gctgatcgaa
     2041 aaagaaacgg tggaagccta tcatatgaaa gggaagagcc atgactgcgg taataaatta
     2101 ggttacatgc aggccttcgt tgaatacggt attcgtcata acacccttgg cacggaattt
     2161 aaagcctggc ttgaagaaga gatgggcatt aagaagtaac atccgtatcg gtgttatcca
     2221 cgaaacggcg ttgagcaatc gacgccgttt ttttatagct tattcttatt aaattgtctt
     2281 aaaccggaca ataaaaaatc ccgccgctgg cgggatttta agcaagtgca atctacaaaa
     2341 gattattgct tgatcaggaa atcgtcgagg gatttacctt gctcatccat tgcttttttg
     2401 attacagctg gagtacggcc ttggccagtc caggttttag tttcgccgtt ttcgtcaacg
     2461 tagctatatt ttgccggacg ctgagcacgt ttagctttgg tgccagattt aacggcagca
     2521 aggctattca gcagttcgtt cgggtcaata ccgtcagcga tcagcatttc gcgatattgc
     2581 tgcagtttac gagtgcgctc ttcaacttca gcagcagccg cgctttcttc ttcgcgacgt
     2641 tcgttaacaa caacttctaa tttttccagc atttcttcca gcgtttcaag tgtacattct
     2701 cttgcctgcg cacgaagagt acggatgttg ttcagaattt taagtgcttc gctcattgta
     2761 gtaatctcaa acttatattg gggtggtttg ttgaggtaat aatagagcct taaattcagt
     2821 tgtgcaatag ccaggaatgt aaggaattca aaattgttct ttattttgtg ccgccaataa
     2881 atatcttttc ataaaattag ccagaaaaga cgcggcatat agccctattt acaccgatga
     2941 tttcgcagca cgtgaggtta aaacttcctg attcatgtca cattttatgg ggagattatc
     3001 gtaggctgac gacctttcag tcttctgtat tagttgtgtt tacgagaatt ccctattaag
     3061 cgaatgatga aaagtagaac agtcgcaata agagcatgga cttagtattg cactatctcc
     3121 tggaggtcaa cagagggcta ttacttgcgc aacaggttaa agattgtgaa tagttaccag
     3181 cagtcattta cccgcttata acaagcgagg cagttgtaat gatagctcag aaggattatg
     3241 caaggcttcg taagggagaa cgcatatacc cacttctgtg catactgttg agctgaaaaa
     3301 ctgacgaatt atgataaact ccagccaact ttatttcata tcattgaggg cctgtggctg
     3361 atggcacagc tatatttcta ctattccgca atgaatgcgg gtaagtctac agcattgttg
     3421 caatcttcat acaattacca ggaacgcggc atgcgcactg tcgtatatac ggcagaaatt
     3481 gatgatcgct ttggtgccgg gaaagtcagt tcgcgtatag gtttgtcatc gcctgcaaaa
     3541 ttatttaacc aaaattcatc attatttgat gagattcgtg cggaacatga acagcaggca
     3601 attcattgcg tactggttga tgaatgccag tttttaacca gacaacaagt atatgaatta
     3661 tcggaggttg tcgatcaact cgatataccc gtactttgtt atggtttacg taccgatttt
     3721 cgaggtgaat tatttattgg cagccaatac ttactggcat ggtccgacaa actggttgaa
     3781 ttaaaaacca tctgtttttg tggccgtaaa gcaagcatgg tgctgcgtct tgatcaagca
     3841 ggcagacctt ataacgaagg tgagcaggtg gtaattggtg gtaatgaacg atacgtttct
     3901 gtatgccgta aacactataa agaggcgtta caagtcgact cattaacggc tattcaggaa
     3961 aggcatcgcc acgattaata agaatttctt tactgacagg gtgagcaggg cacttttatc
     4021 ctgtcagttc gttttacgca cttcttccgg gctatatacc cttctcggca gttttttaac
     4081 gccgctatac gcctcacagg gctcttaagc accgacgttg acttgtgacc tgtaaagtac
     4141 aatatccctg tgtttaggcg ttatacatcg tcgcaaatat gatgaaggct aatgctgtcg
     4201 gtttatggaa aagttgcttt gggtaaacaa aaaatacggc cccagaaggg caatgccgtt
     4261 cacttaagag gagcggcact atgtttcaca ggataacggg tttttgatat cttaaccgac
     4321 ctcggccttg atggtcgggg gcgttttgtt atgaacacca cttccagagc accccgaaga
     4381 tgctccagtc gtttcgggat ggtccctggt gacgccgtgt ttcccagctc tatcatttct
     4441 gatgcgatat tcttccacgc aggcagtagc cagtggcggt tacaaccctt ctggttcagc
     4501 gtcagcagca ggtcttcgct gtaaaaaagt ttatcaaaca acgtaataga gttatccggg
     4561 atggtggcga gcatggagtg ggccagcaca gtttcgctct gccggtaagg tgcggtcacg
     4621 gcattcagca gaatgtgact tcccaggttc attaaggcca ccagacgcat aaccgggtag
     4681 gcgttctgcc gcttagtgga tgtgttggca gacccataat attcacgcag ctcgggttta
     4741 tcaggtgtcc tgaactgtgc gccatcaatg gcaaaaagtt gcaggccgtg ccagtcatcc
     4801 ttcaggtaac gttccgcgcc cctgtctgtg cggtctggcg gaagagccat tccactgggg
     4861 cggcccccac gcgctgacgc gcctgggtga cagcgctgcg ggccagcagg ttcatccccg
     4921 cttcgccatc cgcgctcagg ttcagacggc gaacaacatc ggtaattggc tcattgcacc
     4981 accatccaga taaccatgtc ccccggtaaa cgacggcggc gaacggtcgc atgagcagaa
     5041 agcgtcaggc agtgttgtat ccactcggtg ggaaggtgtt ctgcaaatag ttgtgcagag
     5101 ggcggaggca taagcggatg gtcactgaaa tcgagcagat cattgagaag tggcataaga
     5161 aaacggctcc ctgttgtgga agccgttata gtgcctcagt ttaaggatcg gtcaactaat
     5221 ccttaactga tcggcattgc ccagaagggg ccgtttatgt tgccagacag cgctactgat
     5281 taagcggatt ttttcgcttt tttctcagct ttagccggag cagcttcttt cttcgctgca
     5341 gtttcacctt ctacataatc acgaccgtag taggtatcca gcagaatctg tttcagctcg
     5401 gagatcagcg ggtaacgcgg gttagcgccg gtgcactggt catcgaatgc atcttcagac
     5461 agtttatcca cgttcgccag gaagtctgct tcctgaacgc cagcttcacg gatagatttc
     5521 ggaataccca gttcagcttt cagcgtttcc agccatgcca gcagtttctc gatcttagca
     5581 gcagtacggt cgcccggtgc gctcagaccc aagtggtcgg caatttcagc ataacgacgg
     5641 cgagcctgcg gacggtcata ctggctgaat gcagtctgct tggtcgggtt gtcgttcgca
     5701 ttgtagcgaa taacgttaca aatcagcagg gcgtttgcca gaccgtgcgg aatatggaac
     5761 tgggaaccca gtttgtgcgc cattgagtga catacaccca ggaaggcgtt cgcaaacgcg
     5821 atacccgcga tagtcgctgc actgtgaaca cgttcacgcg ctaccggatt tttagaccct
     5881 tcgtggtagg acgctggcag atattctttc agcagtttca gtgcctgcag agcctgacca
     5941 tcagagaact cagatgccag tacagaaaca taagcttcca tggcgtgagt tactgcgtcc
     6001 agaccaccga aagcacacag ggacttcggc atgtccataa ccaggttggc gtcgacaatc
     6061 gccatatccg gagtcagcgc atagtctgcc agcggatatt tctgaccagt agcgtcgtca
     6121 gttacaaccg caaacggagt gacttcagaa cctgtaccag aagtggtggt gacagcgatc
     6181 attttcgctt tcacgcccat tttcgggaac ttgtagatac gtttacggat atccataaag
     6241 cgcagcgcca gctcttcgaa gtgagtttcc ggatgttcgt acataaccca catgatcttc
     6301 gcggcgtcca tcggggaacc accacccagc gcgataatca cgtctggttt gaaggagttt
     6361 gccagttctg cacctttacg aacgatgctc agggtcgggt ccgcttctac ttcgaagaag
     6421 acttcagttt caacgcctgc tgctttcagt acggaagtga tctgatcagc ataaccattg
     6481 ttgaacagga agcggtcagt cacgatgagc gcacgtttgt ggccatcagt aatcacttca
     6541 tccagcgcga ttggcaggga gccacggcgg aagtagatag atttcggaag tttgtgccac
     6601 aacatgtttt cagctcgctt agcaacggtt ttcttgttga tcaggtgttt cggaccaacg
     6661 ttttcagaga tggagttacc accccaagaa ccacaaccca gagtcaggga aggtgcgagt
     6721 ttgaagttat acaggtcacc gataccaccc tgagacgctg gggtgttaat caggatacgc
     6781 gccgttttca ttttctgacc gaagtaagaa acgcgagccg gttggttatc ctggtcagtg
     6841 tacaggcaag aggtatgacc gataccgccc atagcaacca gtttctctgc tttttctacc
     6901 gcgtcttcga aatctttagc gcggtacatt gccagagtcg gggacagttt ttcatgtgcg
     6961 aacggttcgc tttcatcaac aacggtcact tcaccgatca gaatcttggt gttttctggt
     7021 acagagaagc ctgccagttc agcaatttta taggctggct gaccaacgat agccgcgttc
     7081 agcgcaccgt ttttcaggat aacatcctga acagctttca gctctttacc ctgcaacaga
     7141 tagccgccgt gggttgcaaa acgttcacgt acagcgtcat aaacagagtc aacaacaaca
     7201 acagactgtt cagaagcaca gattacgccg ttgtcgaagg ttttggacat cagtacagat
     7261 gcaactgcac gtttgatatc agcagtttca tcgataacaa ctggagtgtt gcccgcgcct
     7321 acaccgatag ctggtttacc ggagctgtat gcggctttaa ccatgcccgg accaccagtc
     7381 gcgaggatca ggttgatgtc tgggtggtgc atcagtgcgt tagacagttc aacagaaggt
     7441 tgatcgatcc agccgatcag atctttcgga gcaccggcag cgatagcagc ctgcagaacg
     7501 atatcagccg ctttgttggt ggcatctttt gcacgcgggt gcggggagaa gataatggcg
     7561 ttacgggtct tcagactgat cagcgatttg aagatagcag ttgaagtcgg gttagtggtc
     7621 ggaacgatac cgcaaataat accgattggt tcagcgatag tgatggtacc aaaagtgtcg
     7681 tcttcagaca gaacaccaca ggttttttca tctttatagg cgttgtagat atattcagaa
     7741 gcaaagtggt ttttgatcac tttatcttcg acgataccca tgccggattc ggcaacggcc
     7801 attttcgcga gtgggattcg agcatctgca gcagccagag cggcggcgcg gaagattttg
     7861 tctacttgct cttgagtgaa actggcatat tcacgctggg ctttttttac acgctctacg
     7921 agtgcgttaa gttcagcgac attagtaaca gccataatgc tctcctgata atgttaaact
     7981 tttttagtaa atcatctgct cgaatacgag agtatagtca gtgcggtgat gatttgctta
     8041 acctatgaaa atcaaaagct tactcgcgct cacactcact gtgatttact aaaagagttt
     8101 aaacattaga gttattatct ctaatgcgtc acttccaggt ggcgtaagca agattactca
     8161 cttctgggta ctgattacgt gatccaaatc aaatttttgc aaagctgaca cctttcagca
     8221 tcgcttttcg ccattatagc taacagttaa taaattgtag tatgatttgg tggctacatt
     8281 agcatgtttt gcacaactag ataacaataa cgaatgatag caattttaag tagttaggag
     8341 gtgaaaaatg ctgtcaaaag gcgtattgtc agcgcgtctt ttcaacctta tttatggcta
     8401 acattatccg gcttttgctt cggagctaac cgtgattcag accttttttg attttcccgt
     8461 ttacttcaaa tttttcatcg ggttatttgc gctggtcaac ccggtaggga ttattcccgt
     8521 ctttatcagc atgaccagtt atcagacagc ggcagcgcga aacaaaacta accttacagc
     8581 caacctgtct gtggccatta tcttgtggat ctcgcttttt ctcggcgaca cgattctaca
     8641 actttttggt atatcaattg attcgttccg tatcgccggg ggtatcctgg tggtgacaat
     8701 agcgatgtcg atgatcagcg gcaagcttgg cgaggataaa cagaacaagc aagaaaaatc
     8761 agaaaccgcg gtacgtgaaa gcattggtgt ggtgccactg gcgttgccgt tgatggcggg
     8821 gccaggggcg atcagttcta ccatcgtctg gggtacgcgt tatcacagca ttagctatct
     8881 gtttggtttc tttgtggcta ttgcattgtt cgctttatgt tgttggggat tgttccgcat
     8941 ggcaccgtgg ctggtacggg ttttacgcca gaccggcatc aacgtgatta cgcgtattat
     9001 ggggctattg ctgatggcat tggggattga atttatcgtt actggtatta aggggatttt
     9061 ccccggcctg cttaattaat tcctttcaaa tgaaacggag ctgccatgct ccgtttactt
     9121 cgtcattatt tttactttgt tcccgcgcag ttatcaaaag caaaaggaat aggtaaaaat
     9181 attcttctca aattacagtt agttataagg atttccttaa ctgcttctcc tcaccatcat
     9241 gttattttcg ccacatcata atcctgggct tgctgaagaa taattgaaat gatattatta
     9301 attccactgc ctttggtaga ggaaagtgct aaataataat caattgttaa attattgtgc
     9361 atttcactac tggaactgta atcagaaaag atagacatgc ttagccaatc tctatttgat
     9421 tgaattgaaa gatgtttgtt aaggcatgga tgcaagctat agattctgat acggtcaata
     9481 aaagagaatt gcttaacaat tttgcaaaat gtattggcga gtaagaaccg catttggtac
     9541 tttccgggca accgccagac gattctttat tggtaatgag aataattaac aattaaagag
     9601 cgtcgcgaaa gaataatgtg tctcgacagg ggagacacag tacgaatcga cataaggtga
     9661 tcgtctgaat caccagaata aataaagtcg gtgatagtaa tacgtaacga taaagtaacc
     9721 tgacagcaga aagtctccga gcctgtgcag ggtcccaatc cgggattaca catgctggtt
     9781 aataccagta attataatga gggagtccaa aaaacaatga ccaacatcac caagagaagt
     9841 ttagtagcag ctggcgttct ggctgcgcta atggcaggga atgtcgcgct ggcagctgat
     9901 gtacccgcag gcgtcacact ggcggaaaaa caaacactgg tacgtaacaa tggttcagaa
     9961 gttcagtcat tagatccgca caaaattgaa ggtgttccgg agtctaatat cagccgagac
    10021 ctgtttgaag gcttactggt cagcgatctt gacggtcatc cagcacctgg cgtcgctgaa
    10081 tcctgggata ataaagacgc gaaagtctgg accttccatt tgcgtaaaga tgcgaaatgg
    10141 tctgatggca cgccagtcac agcacaagac tttgtgtata gctggcaacg ttctgttgat
    10201 ccgaacactg cttctccgta tgccagttat ctgcaatatg ggcatatcgc cggtattgat
    10261 gaaattcttg aagggaaaaa accgattacc gatctcggcg tgaaagctat tgatgatcac
    10321 acattagaag tcaccttaag tgaacccgtt ccgtacttct ataaattact tgttcaccca
    10381 tcaacttcac cggtgccaaa agccgctatc gagaaattcg gcgaaaaatg gacccagcct
    10441 ggtaatatcg tcaccaacgg tgcctatacc ttaaaagatt gggtcgtaaa cgaacgaatc
    10501 gttcttgaac gcagcccgac ctactggaac aacgcgaaaa ccgttattaa ccaggtaacc
    10561 tatttgccta ttgcttctga agttaccgat gtcaaccgct accgtagtgg tgaaatcgac
    10621 atgacgaata acagcatgcc gatcgaattg ttccagaagc tgaaaaaaga gatcccggac
    10681 gaagttcacg ttgatccata cctgtgcact tactattacg aaattaacaa ccagaaaccg
    10741 ccattcaacg atgtgcgtgt gcgtaccgca ctgaaactag gtatggaccg cgatatcatt
    10801 gttaataaag tgaaagcgca gggcaacatg cccgcctatg gttacactcc accgtatact
    10861 gatggcgcaa aattgactca gccagaatgg tttggctgga gccaggaaaa acgtaacgaa
    10921 gaagcgaaaa aactgctggc tgaagcgggt tataccgcag acaaaccgtt gaccatcaac
    10981 ctgttgtata acacctccga tctgcataaa aagctggcga ttgctgcctc ttcattgtgg
    11041 aagaaaaaca ttggtgtaaa cgtcaaactg gttaaccagg agtggaaaac gttcctcgac
    11101 acccgtcacc agggtacttt tgatgtggcc cgtgcaggct ggtgtgctga ctacaacgaa
    11161 ccaacttcct tcctgaacac catgctttcg aacagctcga tgaataccgc gcattataag
    11221 agcccggcct ttgacagcat tatggcggaa acgctgaaag tgactgacga ggcgcagcgc
    11281 acagctctgt acactaaagc agaacaacag ctggataagg attcggccat tgttcctgtt
    11341 tattactacg tgaatgcgcg tctggtgaaa ccgtgggttg gtggctatac cggcaaagat
    11401 ccgctggata atacctatac ccggaatatg tacattgtga agcactaatg gcaatacgtg
    11461 gggcaggagt gtcctgctcc acggtgtctg atttttatcg cattacagaa ggcacaggc
//