LOCUS pyrFEKO-PUR 5753 bp DNA circular 16-MAR-2010 The two PvuII sites that flank the PYR-FE targeting sequences are not shown on the map. FEATURES Location/Qualifiers misc_feature 2397..2893 /function="PYR-FE Downstream" /note="Sequence updated to agree with results of sequence done by Mike 21 Dec 06. Some minor disagreements with Sanger TbDB sequence. " CDS 3898..4755 /function="LACTAMASE" misc_feature 4..369 /note="PYR-FE upstream" /note="ends 136 bp before start of CDS." protein_bind 376..409 /function="loxP" 5'UTR 437..522 /function="Procyclin SAS and UTR" /note="The various procyclin SASs differ by onl 1 nt, really. This sequence taken from pNS10-54, with hindIII site removed (AAGCTT > AAGACT, leaving splice acceptor AG intact)" 5'UTR 523..580 /function="Residual UTR" /note="With minor changes to delete the infamous 'upstream ATG' CDS and about 4 deletions/insertions/changes to remove or create restriction sites, compared to the version in pNS10-54, from which Mike got the sequence." 3'UTR 2902..2960 /note="ALD 3' UTR (part)" /note="starts 96 bp after aldolase stop codon. Note that the SbfI (CCTGCAGG) site occurs in the aldolase 3' utr sequence at this point, so will be in lots of our plasmids." CDS 581..1177 /function="PUR CDS" CDS 581..2350 /function="PUR-TK" CDS 1220..2350 /function="HSVTK CDS" CDS 1184..1213 /function="Ty-1 epitope" protein_bind 2357..2390 /function="loxP" ORIGIN 1 CTGTGGCGAA TAATTGTAGC GGCAGTTTCT AACGGTTATA TGTTTAACTG CATTATGCCC 61 CTCTCTCACT TGGCGCCATC TCTGTGTTAC AACATGCGTT TGATCTATCA GTTTATTATA 121 TTGTTTTCCC TCAAACACAT GCGAAGCGAG TAAGAGTTGA AAATGTTACG GCAAAGGCCA 181 ATGGGCACTG GTAGAAGGCA ACGGATCGCG GTTATGAAAG TTCTTGTGTG TATGTTTATG 241 TGGGGGTGGG GGTGGAAGTT GTGGTAGTGT CATCTCGTGT ACATTGTTTA AGTCGCTTCA 301 ACCTTGTTTC TTTTTTTTTT CCTTTTCTTC TTCTTCGCTA TCTGATTGAG TTTTCATACT 361 AAGTTGGATA AGCTTATAAC TTCGTATAGC ATACATTATA CGAAGTTATa ccggtACAcG 421 TTCTCGTCgg ccggccGCTG CACGCGCCTT CGAGTTTTTT TTCCTTTTCC CCATTTTTTT 481 CAACTTGAAG aCTTCAATTA CACCAAAAAA TAAAATTCAC AAACTTGGtA TTCCTTTGTG 541 TTACATTCTT GATCGCTCGC ACTGACATTA CaGATCTACC ATGACCGAGT ACAAGCCCAC 601 GGTGCGCCTC GCCACCCGCG ACGACGTCCC CAGGGCCGTA CGCACCCTCG CCGCCGCGTT 661 CGCCGACTAC CCCGCCACGC GCCACACCGT CGATCCAGAC CGCCACATCG AGCGGGTCAC 721 CGAGCTGCAA GAACTCTTCC TCACGCGCGT CGGGCTCGAC ATCGGCAAGG TGTGGGTCGC 781 GGACGACGGC GCAGCAGTGG CGGTCTGGAC CACGCCGGAG AGCGTCGAAG CGGGGGCGGT 841 GTTCGCCGAG ATCGGCCCGC GCATGGCCGA GTTGAGCGGT TCCCGGCTGG CCGCGCAGCA 901 ACAGATGGAA GGCCTCCTGG CGCCGCACCG GCCCAAGGAG CCCGCGTGGT TCCTGGCCAC 961 CGTCGGCGTC TCGCCCGACC ACCAGGGCAA GGGTCTGGGC AGCGCCGTCG TGCTCCCCGG 1021 AGTGGAGGCG GCCGAGCGCG CCGGGGTGCC CGCCTTCCTG GAGACCTCCG CGCCCCGCAA 1081 CCTCCCCTTC TACGAGCGGC TCGGCTTCAC CGTCACCGCC GACGTCGAGG TGCCCGAAGG 1141 ACCGCGCACC TGGTGCATGA CCCGCAAGCC CGGTGCCTCT AGAGAAGTCC ATACTAACCA 1201 GGACCCACTT GACGCCACCA TGGCCTCGTA CCCCGGCCAT CAACACGCGT CTGCGTTCGA 1261 CCAGGCTGCG CGTTCTCGCG GCCATAGCAA CCGACGTACG GCGTTGCGCC CTCGCCGGCA 1321 GCAAGAAGCC ACGGAAGTCC GCCCGGAGCA GAAAATGCCC ACGCTACTGC GGGTTTATAT 1381 AGACGGTCCC CACGGGATGG GGAAAACCAC CACCACGCAA CTGCTGGTGG CCCTGGGTTC 1441 GCGCGACGAT ATCGTCTACG TACCCGAGCC GATGACTTAC TGGCGGGTGC TGGGGGCTTC 1501 CGAGACAATC GCGAACATCT ACACCACACA ACACCGCCTC GACCAGGGTG AGATATCGGC 1561 CGGGGACGCG GCGGTGGTAA TGACAAGCGC CCAGATAACA ATGGGCATGC CTTATGCCGT 1621 GACCGACGCC GTTCTGGCTC CTCATATCGG GGGGGAGGCT GGGAGCTCAC ATGCCCCGCC 1681 CCCGGCCCTC ACCCTCATCT TCGACCGCCA TCCCATCGCC GCCCTCCTGT GCTACCCGGC 1741 CGCGCGGTAC CTTATGGGCA GCATGACCCC CCAGGCCGTG CTGGCGTTCG TGGCCCTCAT 1801 CCCGCCGACC TTGCCCGGCA CCAACATCGT GCTTGGGGCC CTTCCGGAGG ACAGACACAT 1861 CGACCGCCTG GCCAAACGCC AGCGCCCCGG CGAGCGGCTG GACCTGGCTA TGCTGGCTGC 1921 GATTCGCCGC GTTTACGGGC TACTTGCCAA TACGGTGCGG TATCTGCAGT GCGGCGGGTC 1981 GTGGCGGGAG GACTGGGGAC AGCTTTCGGG GACGGCCGTG CCGCCCCAGG GTGCCGAGCC 2041 CCAGAGCAAC GCGGGCCCAC GACCCCATAT CGGGGACACG TTATTTACCC TGTTTCGGGC 2101 CCCCGAGTTG CTGGCCCCCA ACGGCGACCT GTATAACGTG TTTGCCTGGG CCTTGGACGT 2161 CTTGGCCAAA CGCCTCCGTT CCATGCACGT CTTTATCCTG GATTACGACC AATCGCCCGC 2221 CGGCTGCCGG GACGCCCTGC TGCAACTTAC CTCCGGGATG GTCCAGACCC ACGTCACCAC 2281 CCCCGGCTCC ATACCGACGA TATGCGACCT GGCGCGCACG TTTGCCCGGG AGATGGGGGA 2341 GGCTAACTGA GAATTCATAA CTTCGTATAG CATACATTAT ACGAAGTTAT GGATCCTAAG 2401 TGGGTCGGAG CTCTATCTTT AAGCTATTGC GGACGTACAC ACATGTTTTC GTTGCAAATT 2461 ATTTTCGtAC CTACTTCAGA TCGTAAGCGT GGGGAATAAT AATAATGCTC CCTACCATGA 2521 ATTTAAaCAG TTTGGTGAAT GAACAATTCa CTTTAAATTG ATGGACCATG AAATGCACTT 2581 TTACGGTCGC AGCGTTTAAC TAAGTGGCGA aGGCAAGTTT TTAATAATAA TAATAAAGCA 2641 AgTAGTAAAC TATATGAGAC AGCAATGGGG TTTGGGAGGG AAGGTTTAAT CGCTTCAAAG 2701 GTATTTGTGT GTGGTTGAGG AGGTGATAGC GAAGTGAGGG TTTCTAATAA CTGTAGAGCA 2761 GCAATAAAAA AAAGGTAGCA GTTGATCAAT TTGCTGTGGT GCCCTCTGCA TTGAAGGGTA 2821 TCTGGGATGT ATGAAGTTCC ATCACTCGGA GCCATCACCT TTCCCTCATT TCTCGTTTAC 2881 ATCCTTTACA TGTCAGCTGC CTGCAGGTTG GTTAGGAAGG GGGGATGATG TAAAAGAAGA 2941 AAATGGGGGG ATTCGAGCCC CCTTGCAGGC ATGCAAGCTA GCTTGTATTC TATAGTGTCA 3001 CCTAAATCGT ATGTGTATGA TACATAAGGT TATGTATTAA TTGTAGCCGC GTTCTAACGA 3061 CAATATGTAC AAGCCTAATT GTGTAGCATC TGGCTTACTG AAGCAGACCC TATCATCTCT 3121 CTCGTAAACT GCCGTCAGAG TCGGTTTGGT TGGACGAACC TTCTGAGTTT CTGGTAACGC 3181 CGTTCCGCAC CCCGGAAATG GTCAGCGAAC CAATCAGCAG GGTCATCGCT AGCCAGATCC 3241 TCTACGCCGG ACGCATCGTG GCCGGCATCA CCGGCGCCAC AGGTGCGGTT GCTGGCGCCT 3301 ATATCGCCGA CATCACCGAT GGGGAAGATC GGGCTCGCCA CTTCGGGCTC ATGAGCGCTT 3361 GTTTCGGCGT GGGTATGGTG GCAGGCCCCG TGGCCGGGGG ACTGTTGGGC GCCATCTCCT 3421 TGCACCATTC CTTGCGGCGG CGGTGCTCAA CGGCCTCAAC CTACTACTGG GCTGCTTCCT 3481 AATGCAGGAG TCGCATAAGG GAGAGCGTCG ATATGGTGCA CTCTCAGTAC AATCTGCTCT 3541 GATGCCGCAT AGTTAAGCCA GCCCCGACAC CCGCCAACAC CCGCTGACGC GCCCTGACGG 3601 GCTTGTCTGC TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG GAGCTGCATG 3661 TGTCAGAGGT TTTCACCGTC ATCACCGAAA CGCGCGAGAC GAAAGGGCCT CGTGATACGC 3721 CTATTTTTAT AGGTTAATGT CATGATAATA ATGGTTTCTT AGACGTCAGG TGGCACTTTT 3781 CGGGGAAATG TGCGCGGAAC CCCTATTTGT TTATTTTTCT AAATACATTC AAATATGTAT 3841 CCGCTCATGA GACAATAACC CTGATAAATG CTTCAATAAT ATTGAAAAAG GAAGAGTATG 3901 AGTATTCAAC ATTTCCGTGT CGCCCTTATT CCCTTTTTTG CGGCATTTTG CCTTCCTGTT 3961 TTTGCTCACC CAGAAACGCT GGTGAAAGTA AAAGATGCTG AAGATCAGTT GGGTGCACGA 4021 GTGGGTTACA TCGAACTGGA TCTCAACAGC GGTAAGATCC TTGAGAGTTT TCGCCCCGAA 4081 GAACGTTTTC CAATGATGAG CACTTTTAAA GTTCTGCTAT GTGGCGCGGT ATTATCCCGT 4141 ATTGACGCCG GGCAAGAGCA ACTCGGTCGC CGCATACACT ATTCTCAGAA TGACTTGGTT 4201 GAGTACTCAC CAGTCACAGA AAAGCATCTT ACGGATGGCA TGACAGTAAG AGAATTATGC 4261 AGTGCTGCCA TAACCATGAG TGATAACACT GCGGCCAACT TACTTCTGAC AACGATCGGA 4321 GGACCGAAGG AGCTAACCGC TTTTTTGCAC AACATGGGGG ATCATGTAAC TCGCCTTGAT 4381 CGTTGGGAAC CGGAGCTGAA TGAAGCCATA CCAAACGACG AGCGTGACAC CACGATGCCT 4441 GTAGCAATGG CAACAACGTT GCGCAAACTA TTAACTGGCG AACTACTTAC TCTAGCTTCC 4501 CGGCAACAAT TAATAGACTG GATGGAGGCG GATAAAGTTG CAGGACCACT TCTGCGCTCG 4561 GCCCTTCCGG CTGGCTGGTT TATTGCTGAT AAATCTGGAG CCGGTGAGCG TGGGTCTCGC 4621 GGTATCATTG CAGCACTGGG GCCAGATGGT AAGCCCTCCC GTATCGTAGT TATCTACACG 4681 ACGGGGAGTC AGGCAACTAT GGATGAACGA AATAGACAGA TCGCTGAGAT AGGTGCCTCA 4741 CTGATTAAGC ATTGGTAACT GTCAGACCAA GTTTACTCAT ATATACTTTA GATTGATTTA 4801 AAACTTCATT TTTAATTTAA AAGGATCTAG GTGAAGATCC TTTTTGATAA TCTCATGACC 4861 AAAATCCCTT AACGTGAGTT TTCGTTCCAC TGAGCGTCAG ACCCCGTAGA AAAGATCAAA 4921 GGATCTTCTT GAGATCCTTT TTTTCTGCGC GTAATCTGCT GCTTGCAAAC AAAAAAACCA 4981 CCGCTACCAG CGGTGGTTTG TTTGCCGGAT CAAGAGCTAC CAACTCTTTT TCCGAAGGTA 5041 ACTGGCTTCA GCAGAGCGCA GATACCAAAT ACTGTCCTTC TAGTGTAGCC GTAGTTAGGC 5101 CACCACTTCA AGAACTCTGT AGCACCGCCT ACATACCTCG CTCTGCTAAT CCTGTTACCA 5161 GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT CTTACCGGGT TGGACTCAAG ACGATAGTTA 5221 CCGGATAAGG CGCAGCGGTC GGGCTGAACG GGGGGTTCGT GCACACAGCC CAGCTTGGAG 5281 CGAACGACCT ACACCGAACT GAGATACCTA CAGCGTGAGC TATGAGAAAG CGCCACGCTT 5341 CCCGAAGGGA GAAAGGCGGA CAGGTATCCG GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC 5401 ACGAGGGAGC TTCCAGGGGG AAACGCCTGG TATCTTTATA GTCCTGTCGG GTTTCGCCAC 5461 CTCTGACTTG AGCGTCGATT TTTGTGATGC TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC 5521 GCCAGCAACG CGGCCTTTTT ACGGTTCCTG GCCTTTTGCT GGCCTTTTGC TCACATGTTC 5581 TTTCCTGCGT TATCCCCTGA TTCTGTGGAT AACCGTATTA CCGCCTTTGA GTGAGCTGAT 5641 ACCGCTCGCC GCAGCCGAAC GACCGAGCGC AGCGAGTCAG TGAGCGAGGA AGCGGAAGAG 5701 CGCCCAATAC GCAAACCGCC TCTCCCCGCG CGTTGGCCGA TTCATTAATG CAG //