LOCUS pHD328 8399 bp DNA circular 19-OCT-2009 This file incorporates various sequence corrections made by Cross Lab since original version from Christine CLayton. It is used by Liz Wirtz to create the 'single marker'line of bloodstream-form Trypanosoma brucei Lister 427. FEATURES Location/Qualifiers misc_feature 1..906 /function="B-TUBULIN target" 5'UTR 910..1106 /function="ALD SAS" 5'UTR 1107..1198 /function="PROCYCLIN SAS" /note="parp sas region derived from pHD108" CDS 4265..5290 /note="HYG" 3'UTR 5339..5619 /function="ACT 3' UTR" 3'UTR 3985..4141 /function="ALD 3' UTR" 5'UTR 4142..4262 /function="ACT 5' UTR" misc_feature 1269..1305 /note="NLS" /note="36-nt insertion of SV40 T antigen NLS (JJ Dunn et al, 1988, Gene 68, 259-266)" CDS 1236..3923 /note="T7RNAP" /note="Sequence corrected October 2009 based on actual sequencing of (unpublished) ZM construct (formally 'pyrFEKO-HYG tetr-t7rnap actual correct'), which matches an updated 2009 sequence in GenBank (FJ881694)." BASE COUNT 2021 a 2135 c 2202 g 2041 t ORIGIN 1 CTGACCGTAT CATGATGACT TTCTCCATCA TCCCATCCCC CAAGGTGTCC GACACTGTCG 61 TCGAGCCGTA CAATACGACT CTCTCCGTGC ACCAACTTGT GGAAAACTCC GATGAGTCGA 121 TGTGCATTGA CAACGAGGCA CTGTACGATA TTTGCTTCCG CACCCTGAAA CTGACAACAC 181 CAACGTTCGG TGACCTGAAC CACTTGGTGT CTGCTGTTGT GTCCGGCGTC ACCTGCTGCC 241 TGCGCTTCCC TGGTCAGTTG AACTCTGACC TCCGTAAGTT GGCTGTGAAC CTTGTCCCAT 301 TCCCGCGTCT GCACTTCTTC ATGATGGGCT TCGCCCCGCT GACCAGCCGC GGCTCGCAGC 361 AGTACCGCGG TCTCTCCGTG CCCGAGCTAA CGCAGCAGAT GTTCGATGCG AAAAACATGA 421 TGCAAGCTGC AGATCCTCGT CACGGCCGCT ACCTGACAGC GTCTGCACTC TTCCGCGGCC 481 GCATGTCGAC GAAGGAGGTT GATGAGCAGA TGCTGAACGT GCAGAACAAG AACTCGTCCT 541 ACTTCATTGA GTGGATCGAT CCCGAACAAC ATCAAGTCCT CTGTTTGCGA TATCCCACCC 601 AAGGGACTCA AGATGGCTGT CACCTTCATT GGCAACAACA CCTGCATCCA GGAGATGTTC 661 CGCCGTGTGG GAGAGCAGTT CACCCTCATG TTCCGTCGCA AGGCGTTCTT GCACTGGTAC 721 ACTGGCGAGG GTATGGACGA GATGGAATTC ACGGAGGCAG AGTCCAACAT GAACGATCTC 781 GTGTCTGAGT ACCAGCAGTA CCAGGATGCC ACGATTGAGG AGGAGGGCGA GTTCGACGAG 841 GAGGAGCAAT ACTAGACGCG GACGGGGCAT TTCCCGTTCG TCATTAGCAG TAGGTAATGA 901 AGATGCTCGA GGGTGCTCAA GCTGTGTAGC GCACGCGTTT CCTTACATAT TTCTCTAACA 961 GGCACGGAAG CCTAACAAAT ACACTTGGCT TATTTTTTTG CCCCCTCATG TCTTGTACAA 1021 ATATTTGCGA TAGCTTAGCT ATCAGCCACA TTAATCAAAC AAGTATACCA ACAAGCCCGA 1081 AAACATAAAC TCAACTGCAA CGAAGCTGGG CTGCACGCGC CTTCGAGTTT TTTTTCCTTT 1141 TCCCCATTTT TTTCAACTTG AAGACTTCAA TTACACCAAA AAGTAAAATT CACAAGCTGA 1201 TCTGATCCGG ATTTACTAAC TGGAAGAGGC ACTAAATGAA CACGATTAAC ATCGCTAAGA 1261 ACGAATTCCT CGAGCCTCCA AAAAAGAAGA GAAAGGTCGA ATTCTCTGAC ATCGAACTGG 1321 CTGCTATCCC GTTCAACACT CTGGCTGACC ATTACGGTGA GCGTTTAGCT CGCGAACAGT 1381 TGGCCCTTGA GCATGAGTCT TACGAGATGG GTGAAGCACG CTTCCGCAAG ATGTTTGAGC 1441 GTCAACTTAA AGCTGGTGAG GTTGCGGATA ACGCTGCCGC CAAGCCTCTC ATCACTACCC 1501 TACTCCCTAA GATGATTGCA CGCATCAACG ACTGGTTTGA GGAAGTGAAA GCTAAGCGCG 1561 GCAAGCGCCC GACAGCCTTC CAGTTCCTGC AAGAAATCAA GCCGGAAGCC GTAGCGTACA 1621 TCACCATTAA GACCACTCTG GCTTGCCTAA CCAGTGCTGA CAATACAACC GTTCAGGCTG 1681 TAGCAAGCGC AATCGGTCGG GCCATTGAGG ACGAGGCTCG CTTCGGTCGT ATCCGTGACC 1741 TTGAAGCTAA GCACTTCAAG AAAAACGTTG AGGAACAACT CAACAAGCGC GTAGGGCACG 1801 TCTACAAGAA AGCATTTATG CAAGTTGTCG AGGCTGACAT GCTCTCTAAG GGTCTACTCG 1861 GTGGCGAGGC GTGGTCTTCG TGGCATAAGG AAGACTCTAT TCATGTAGGA GTACGCTGCA 1921 TCGAGATGCT CATTGAGTCA ACCGGAATGG TTAGCTTACA CCGCCAAAAT GCTGGCGTAG 1981 TAGGTCAAGA CTCTGAGACT ATCGAACTCG CACCTGAATA CGCTGAGGCT ATCGCAACCC 2041 GTGCAGGTGC GCTGGCTGGC ATCTCTCCGA TGTTCCAACC TTGCGTAGTT CCTCCTAAGC 2101 CGTGGACTGG CATTACTGGT GGTGGCTATT GGGCTAACGG TCGTCGTCCT CTGGCGCTGG 2161 TGCGTACTCA CAGTAAGAAA GCACTGATGC GCTACGAAGA CGTTTACATG CCTGAGGTGT 2221 ACAAAGCGAT TAACATTGCG CAAAACACCG CATGGAAAAT CAACAAGAAA GTCCTAGCGG 2281 TCGCCAACGT AATCACCAAG TGGAAGCATT GTCCGGTCGA GGACATCCCT GCGATTGAGC 2341 GTGAAGAACT CCCGATGAAA CCGGAAGACA TCGACATGAA TCCTGAGGCT CTCACCGCGT 2401 GGAAACGTGC TGCCGCTGCT GTGTACCGCA AGGACAAGGC TCGCAAGTCT CGCCGTATCA 2461 GCCTTGAGTT CATGCTTGAG CAAGCCAATA AGTTTGCTAA CCATAAGGCC ATCTGGTTCC 2521 CTTACAACAT GGACTGGCGC GGTCGTGTTT ACGCTGTGTC AATGTTCAAC CCGCAAGGTA 2581 ACGATATGAC CAAAGGACTG CTTACGCTGG CGAAAGGTAA ACCAATCGGT AAGGAAGGTT 2641 ACTACTGGCT GAAAATCCAC GGTGCAAACT GTGCGGGTGT CGATAAGGTT CCGTTCCCTG 2701 AGCGCATCAA GTTCATTGAG GAAAACCACG AGAACATCAT GGCTTGCGCT AAGTCTCCAC 2761 TGGAGAACAC TTGGTGGGCT GAGCAAGATT CTCCGTTCTG CTTCCTTGCG TTCTGCTTTG 2821 AGTACGCTGG GGTACAGCAC CACGGCCTGA GCTATAACTG CTCCCTTCCG CTGGCGTTTG 2881 ACGGGTCTTG CTCTGGCATC CAGCACTTCT CCGCGATGCT CCGAGATGAG GTAGGTGGTC 2941 GCGCGGTTAA CTTGCTTCCT AGTGAAACCG TTCAGGACAT CTACGGGATT GTTGCTAAGA 3001 AAGTCAACGA GATTCTACAA GCAGACGCAA TCAATGGGAC CGATAACGAA GTAGTTACCG 3061 TGACCGATGA GAACACTGGT GAAATCTCTG AGAAAGTCAA GCTGGGCACT AAGGCACTGG 3121 CTGGTCAATG GCTGGCTTAC GGTGTTACTC GCAGTGTGAC TAAGCGTTCA GTCATGACGC 3181 TGGCTTACGG GTCCAAAGAG TTCGGCTTCC GTCAACAAGT GCTGGAAGAT ACCATTCAGC 3241 CAGCTATTGA TTCCGGCAAG GGTCTGATGT TCACTCAGCC GAATCAGGCT GCTGGATACA 3301 TGGCTAAGCT GATTTGGGAA TCTGTGAGCG TGACGGTGGT AGCTGCGGTT GAAGCAATGA 3361 ACTGGCTTAA GTCTGCTGCT AAGCTGCTGG CTGCTGAGGT CAAAGATAAG AAGACTGGAG 3421 AGATTCTTCG CAAGCGTTGC GCTGTGCATT GGGTAACTCC TGATGGTTTC CCTGTGTGGC 3481 AGGAATACAA GAAGCCTATT CAGACGCGCT TGAACCTGAT GTTCCTCGGT CAGTTCCGCT 3541 TACAGCCTAC CATTAACACC AACAAAGATA GCGAGATTGA TGCACACAAA CAGGAGTCTG 3601 GTATCGCTCC TAACTTTGTA CACAGCCAAG ACGGTAGCCA CCTTCGTAAG ACTGTAGTGT 3661 GGGCACACGA GAAGTACGGA ATCGAATCTT TTGCACTGAT TCACGACTCC TTCGGTACCA 3721 TTCCGGCTGA CGCTGCGAAC CTGTTCAAAG CAGTGCGCGA AACTATGGTT GACACATATG 3781 AGTCTTGTGA TGTACTGGCT GATTTCTACG ACCAGTTCGC TGACCAGTTG CACGAGTCTC 3841 AATTGGACAA AATGCCAGCA CTTCCGGCTA AAGGTAACTT GAACCTCCGT GACATCTTAG 3901 AGTCGGACTT CGCGTTCGCG TAACGCCAAA TCAATACGAC TCACTATAGA GGGACAAACT 3961 CAAGGTCATT CGCAAGAGTG GCCGGATCCT GCCCATTTAG TTAGTTGGCT TTTCCCTTGT 4021 CTCGTGTCTT TTCCGTGGAA AGGTTCCCGG AGTAATCTGA TGGCACAGCA GGGAGGTGCG 4081 CCTGCAGGTT GGTTAGGAAG GGGGGATGAT GTAAAAGAAG AAAATGGGGG GATTCGAGCC 4141 CGGGCACAGC AAGGTCTTCT GAAATTCATG TTTTTTTTTT TTTTACTCTG CATTGCAGTC 4201 TCCGCTCTTA TTTAGTTTTG CTTTACGTAA GGTCTCGTTG CTGCCATAAA ATAAGCTACT 4261 AGTGATGAAA AAGCCTGAAC TCACCGCGAC GTCTGTCGAG AAGTTTCTGA TCGAAAAGTT 4321 CGACAGCGTC TCCGACCTGA TGCAGCTCTC GGAGGGCGAA GAATCTCGTG CTTTCAGCTT 4381 CGATGTAGGA GGGCGTGGAT ATGTCCTGCG GGTAAATAGC TGCGCCGATG GTTTCTACAA 4441 AGATCGTTAT GTTTATCGGC ACTTTGCATC GGCCGCGCTC CCGATTCCGG AAGTGCTTGA 4501 CATTGGGGAA TTCAGCGAGA GCCTGACCTA TTGCATCTCC CGCCGTGCAC AGGGTGTCAC 4561 GTTGCAAGAC CTGCCTGAAA CCGAACTGCC CGCTGTTCTG CAGCCGGTCG CGGAGGCCAT 4621 GGATGCGATC GCTGCGGCCG ATCTTAGCCA GACGAGCGGG TTCGGCCCAT TCGGACCGCA 4681 AGGAATCGGT CAATACACTA CATGGCGTGA TTTCATATGC GCGATTGCTG ATCCCCATGT 4741 GTATCACTGG CAAACTGTGA TGGACGACAC CGTCAGTGCG TCCGTCGCGC AGGCTCTCGA 4801 TGAGCTGATG CTTTGGGCCG AGGACTGCCC CGAAGTCCGG CACCTCGTGC ACGCGGATTT 4861 CGGCTCCAAC AATGTCCTGA CGGACAATGG CCGCATAACA GCGGTCATTG ACTGGAGCGA 4921 GGCGATGTTC GGGGATTCCC AATACGAGGT CGCCAACATC TTCTTCTGGA GGCCGTGGTT 4981 GGCTTGTATG GAGCAGCAGA CGCGCTACTT CGAGCGGAGG CATCCGGAGC TTGCAGGATC 5041 GCCGCGGCTC CGGGCGTATA TGCTCCGCAT TGGTCTTGAC CAACTCTATC AGAGCTTGGT 5101 TGACGGCAAT TTCGATGATG CAGCTTGGGC GCAGGGTCGA TGCGACGCAA TCGTCCGATC 5161 CGGAGCCGGG ACTGTCGGGC GTACACAAAT CGCCCGCAGA AGCGCGGCCG TCTGGACCGA 5221 TGGCTGTGTA GAAGTACTCG CCGATAGTGG AAACCGACGC CCCAGCACTC GTCCGAGGGC 5281 AAAGGAATAG AGTAGATGCC GACCGAACAA GGATCGATCC TAACACCGGG TTGTGTGGCC 5341 AAAATTGTTC TGTAGTCGCT GTGAGTTGAC ACGGCTAGTG CTTATGATTT TCCTCGCGTG 5401 TGGTGCCTGT ACTCAGCCCT ATGCCTTATT TGCAACACAT TTACGTACAG CGCACAAGAG 5461 AAGAGAAGAT CACTTGAAGA TAATAAATAT AGGGTTGTAG GCATCTTGTT TAACTCAAAT 5521 TTTCTCGTCT TGGTGTGTCG ACATGATTGA AATAGTGCCA CCAGTTGTGT TTGATGCGTT 5581 TGTTATCTAT GCAGTATTCT GCAAGGCCTT GCAGGCATGC AAGCTAGCTT GTATTCTATA 5641 GTGTCACCTA AATCGTATGT GTATGATACA TAAGGTTATG TATTAATTGT AGCCGCGTTC 5701 TAACGACAAT ATGTACAAGC CTAATTGTGT AGCATCTGGC TTACTGAAGC AGACCCTATC 5761 ATCTCTCTCG TAAACTGCCG TCAGAGTCGG TTTGGTTGGA CGAACCTTCT GAGTTTCTGG 5821 TAACGCCGTT CCGCACCCCG GAAATGGTCA GCGAACCAAT CAGCAGGGTC ATCGCTAGCC 5881 AGATCCTCTA CGCCGGACGC ATCGTGGCCG GCATCACCGG CGCCACAGGT GCGGTTGCTG 5941 GCGCCTATAT CGCCGACATC ACCGATGGGG AAGATCGGGC TCGCCACTTC GGGCTCATGA 6001 GCGCTTGTTT CGGCGTGGGT ATGGTGGCAG GCCCCGTGGC CGGGGGACTG TTGGGCGCCA 6061 TCTCCTTGCA CCATTCCTTG CGGCGGCGGT GCTCAACGGC CTCAACCTAC TACTGGGCTG 6121 CTTCCTAATG CAGGAGTCGC ATAAGGGAGA GCGTCGATAT GGTGCACTCT CAGTACAATC 6181 TGCTCTGATG CCGCATAGTT AAGCCAGCCC CGACACCCGC CAACACCCGC TGACGCGCCC 6241 TGACGGGCTT GTCTGCTCCC GGCATCCGCT TACAGACAAG CTGTGACCGT CTCCGGGAGC 6301 TGCATGTGTC AGAGGTTTTC ACCGTCATCA CCGAAACGCG CGAGACGAAA GGGCCTCGTG 6361 ATACGCCTAT TTTTATAGGT TAATGTCATG ATAATAATGG TTTCTTAGAC GTCAGGTGGC 6421 ACTTTTCGGG GAAATGTGCG CGGAACCCCT ATTTGTTTAT TTTTCTAAAT ACATTCAAAT 6481 ATGTATCCGC TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG 6541 AGTATGAGTA TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC ATTTTGCCTT 6601 CCTGTTTTTG CTCACCCAGA AACGCTGGTG AAAGTAAAAG ATGCTGAAGA TCAGTTGGGT 6661 GCACGAGTGG GTTACATCGA ACTGGATCTC AACAGCGGTA AGATCCTTGA GAGTTTTCGC 6721 CCCGAAGAAC GTTTTCCAAT GATGAGCACT TTTAAAGTTC TGCTATGTGG CGCGGTATTA 6781 TCCCGTATTG ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC TCAGAATGAC 6841 TTGGTTGAGT ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC AGTAAGAGAA 6901 TTATGCAGTG CTGCCATAAC CATGAGTGAT AACACTGCGG CCAACTTACT TCTGACAACG 6961 ATCGGAGGAC CGAAGGAGCT AACCGCTTTT TTGCACAACA TGGGGGATCA TGTAACTCGC 7021 CTTGATCGTT GGGAACCGGA GCTGAATGAA GCCATACCAA ACGACGAGCG TGACACCACG 7081 ATGCCTGTAG CAATGGCAAC AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA 7141 GCTTCCCGGC AACAATTAAT AGACTGGATG GAGGCGGATA AAGTTGCAGG ACCACTTCTG 7201 CGCTCGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAAAT CTGGAGCCGG TGAGCGTGGG 7261 TCTCGCGGTA TCATTGCAGC ACTGGGGCCA GATGGTAAGC CCTCCCGTAT CGTAGTTATC 7321 TACACGACGG GGAGTCAGGC AACTATGGAT GAACGAAATA GACAGATCGC TGAGATAGGT 7381 GCCTCACTGA TTAAGCATTG GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT 7441 GATTTAAAAC TTCATTTTTA ATTTAAAAGG ATCTAGGTGA AGATCCTTTT TGATAATCTC 7501 ATGACCAAAA TCCCTTAACG TGAGTTTTCG TTCCACTGAG CGTCAGACCC CGTAGAAAAG 7561 ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT GCAAACAAAA 7621 AAACCACCGC TACCAGCGGT GGTTTGTTTG CCGGATCAAG AGCTACCAAC TCTTTTTCCG 7681 AAGGTAACTG GCTTCAGCAG AGCGCAGATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG 7741 TTAGGCCACC ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG 7801 TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA CCGGGTTGGA CTCAAGACGA 7861 TAGTTACCGG ATAAGGCGCA GCGGTCGGGC TGAACGGGGG GTTCGTGCAC ACAGCCCAGC 7921 TTGGAGCGAA CGACCTACAC CGAACTGAGA TACCTACAGC GTGAGCTATG AGAAAGCGCC 7981 ACGCTTCCCG AAGGGAGAAA GGCGGACAGG TATCCGGTAA GCGGCAGGGT CGGAACAGGA 8041 GAGCGCACGA GGGAGCTTCC AGGGGGAAAC GCCTGGTATC TTTATAGTCC TGTCGGGTTT 8101 CGCCACCTCT GACTTGAGCG TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG 8161 AAAAACGCCA GCAACGCGGC CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC 8221 ATGTTCTTTC CTGCGTTATC CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA 8281 GCTGATACCG CTCGCCGCAG CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG 8341 GAAGAGCGCC CAATACGCAA ACCGCCTCTC CCCGCGCGTT GGCCGATTCA TTAATGCAG //