LOCUS pHD309-HYG-PUR 6137 bp DNA circular 16-MAR-2010 FEATURES Location/Qualifiers CDS 3..785 /number=bTUB 3' CDS 5'UTR 913..1103 /function="ALD 5' UTR & upstream" /note="corresponds to 2151-2345 and 5926-6120 in X52586. ALD ATG is at 2146" 3'UTR 1713..1856 /function="ALD short 3' UTR" /note="truncated aldolase 3'UTR originally derived from pHD330 (about 6x lower BF expression than the full-length ALD UTR in pLew100, but no difference in expression levels in PF, between long and short UTRs) " 5'UTR 1862..1977 /function="ACT 5' UTR & upstream" /note="Small sequence corrections made in January 2004" CDS 1990..3012 /function="HYG" 3'UTR 3045..3332 /function="ACT 3' UTR" /note="When the entire downstream region was BLASTed, these are the nt that matched the 427 Bn-2 ES. The match was almost the same against the LiTat1.6 ACT, with 3 internal point differences" CDS 4282..5139 /function="LACTAMASE" misc_feature 1..905 /function="TUBULIN array" CDS 1112..1711 /function="PUR" misc_feature 1727..1734 /note="Corrections after sequencing by Addgene 8 bp (1727-1735) were deleted and T at position 1681 changed to C (no AA change)." ORIGIN 1 CTGACCGTAT CATGATGACT TTCTCCATCA TCCCATCCCC CAAGGTGTCC GACACTGTCG 61 TCGAGCCGTA CAATACGACT CTCTCCGTGC ACCAACTTGT GGAAAACTCC GATGAGTCGA 121 TGTGCATTGA CAACGAGGCA CTGTACGATA TTTGCTTCCG CACCCTGAAA CTGACAACAC 181 CAACGTTCGG TGACCTGAAC CACTTGGTGT CTGCTGTTGT GTCCGGCGTC ACCTGCTGCC 241 TGCGCTTCCC TGGTCAGTTG AACTCTGACC TCCGTAAGTT GGCTGTGAAC CTTGTCCCAT 301 TCCCGCGTCT GCACTTCTTC ATGATGGGCT TCGCCCCGCT GACCAGCCGC GGCTCGCAGC 361 AGTACCGCGG TCTCTCCGTG CCCGAGCTAA CGCAGCAGAT GTTCGATGCG AAAAACATGA 421 TGCAAGCTGC AGATCCTCGT CACGGCCGCT ACCTGACAGC GTCTGCACTC TTCCGCGGCC 481 GCATGTCGAC GAAGGAGGTT GATGAGCAGA TGCTGAACGT GCAGAACAAG AACTCGTCCT 541 ACTTCATTGA GTGGATCGAT CCCGAACAAC ATCAAGTCCT CTGTTTGCGA TATCCCACCC 601 AAGGGACTCA AGATGGCTGT CACCTTCATT GGCAACAACA CCTGCATCCA GGAGATGTTC 661 CGCCGTGTGG GAGAGCAGTT CACCCTCATG TTCCGTCGCA AGGCGTTCTT GCACTGGTAC 721 ACTGGCGAGG GTATGGACGA GATGGAATTC ACGGAGGCAG AGTCCAACAT GAACGATCTC 781 GTGTCTGAGT ACCAGCAGTA CCAGGATGCC ACGATTGAGG AGGAGGGCGA GTTCGACGAG 841 GAGGAGCAAT ACTAGACGCG GACGGGGCAT TTCCCGTTCG TCATTAGCAG TAGGTAATGA 901 AGATGCTCGA GGGTGCTCAA GCTGTGTAGC GCACGCGTTT CCTTACATAT TTCTCTAACA 961 GGCACGGAAG CCTAACAAAT ACACTTGGCT TATTTTTTTG CCCCCTCATG TCTTGTACAA 1021 ATATTTGCGA TAGCTTAGCT ATCAGCCACA TTAATCAAAC AAGTATACCA ACAAGCCCGA 1081 AAACATAAAC TCAACTGCAA CGAAGCTTAC CATGACCGAG TACAAGCCCA CGGTGCGCCT 1141 CGCCACCCGC GACGACGTCC CCAGGGCCGT ACGCACCCTC GCCGCCGCGT TCGCCGACTA 1201 CCCCGCCACG CGCCACACCG TCGATCCAGA CCGCCACATC GAGCGGGTCA CCGAGCTGCA 1261 AGAACTCTTC CTCACGCGCG TCGGGCTCGA CATCGGCAAG GTGTGGGTCG CGGACGACGG 1321 CGCAGCAGTG GCGGTCTGGA CCACGCCGGA GAGCGTCGAA GCGGGGGCGG TGTTCGCCGA 1381 GATCGGCCCG CGCATGGCCG AGTTGAGCGG TTCCCGGCTG GCCGCGCAGC AACAGATGGA 1441 AGGCCTCCTG GCGCCGCACC GGCCCAAGGA GCCCGCGTGG TTCCTGGCCA CCGTCGGCGT 1501 CTCGCCCGAC CACCAGGGCA AGGGTCTGGG CAGCGCCGTC GTGCTCCCCG GAGTGGAGGC 1561 GGCCGAGCGC GCCGGGGTGC CCGCCTTCCT GGAGACCTCC GCGCCCCGCA ACCTCCCCTT 1621 CTACGAGCGG CTCGGCTTCA CCGTCACCGC CGACGTCGAG GTGCCCGAAG GACCGCGCAC 1681 CTGGTGCATG ACCCGCAAGC CCGGTGCCTG AGGATCCTGC CCATTTGGCT TTTCCCTTGT 1741 CTCGTGTCTT TTCCGTGGAA AGGTTCCCGG AGTAATCTGA TGGCACAGCA GGGAGGTGCG 1801 CCTGCAGGTT GGTTAGGAAG GGGGGATGAT GTAAAAGAAG AAAATGGGGG GATTCGAGCC 1861 CGGGCACAGC AAGGTCTTCT GAAATTCATG TTTTTTTTTT TTTTACTCTG CATTGCAGTC 1921 TCCGCTCTTA TTTAGTTTTG CTTTACGTAA GGTCTCGTTG CTGCCATAAA ATAAGCTCTA 1981 GAACTAGTGA TGAAAAAGCC TGAACTCACC GCGACGTCTG TCGAGAAGTT TCTGATCGAA 2041 AAGTTCGACA GCGTCTCCGA CCTGATGCAG CTCTCGGAGG GCGAAGAATC TCGTGCTTTC 2101 AGCTTCGATG TAGGAGGGCG TGGATATGTC CTGCGGGTAA ATAGCTGCGC CGATGGTTTC 2161 TACAAAGATC GTTATGTTTA TCGGCACTTT GCATCGGCCG CGCTCCCGAT TCCGGAAGTG 2221 CTTGACATTG GGGAATTCAG CGAGAGCCTG ACCTATTGCA TCTCCCGCCG TGCACAGGGT 2281 GTCACGTTGC AAGACCTGCC TGAAACCGAA CTGCCCGCTG TTCTGCAGCC GGTCGCGGAG 2341 GCCATGGATG CGATCGCTGC GGCCGATCTT AGCCAGACGA GCGGGTTCGG CCCATTCGGA 2401 CCGCAAGGAA TCGGTCAATA CACTACATGG CGTGATTTCA TATGCGCGAT TGCTGATCCC 2461 CATGTGTATC ACTGGCAAAC TGTGATGGAC GACACCGTCA GTGCGTCCGT CGCGCAGGCT 2521 CTCGATGAGC TGATGCTTTG GGCCGAGGAC TGCCCCGAAG TCCGGCACCT CGTGCACGCG 2581 GATTTCGGCT CCAACAATGT CCTGACGGAC AATGGCCGCA TAACAGCGGT CATTGACTGG 2641 AGCGAGGCGA TGTTCGGGGA TTCCCAATAC GAGGTCGCCA ACATCTTCTT CTGGAGGCCG 2701 TGGTTGGCTT GTATGGAGCA GCAGACGCGC TACTTCGAGC GGAGGCATCC GGAGCTTGCA 2761 GGATCGCCGC GGCTCCGGGC GTATATGCTC CGCATTGGTC TTGACCAACT CTATCAGAGC 2821 TTGGTTGACG GCAATTTCGA TGATGCAGCT TGGGCGCAGG GTCGATGCGA CGCAATCGTC 2881 CGATCCGGAG CCGGGACTGT CGGGCGTACA CAAATCGCCC GCAGAAGCGC GGCCGTCTGG 2941 ACCGATGGCT GTGTAGAAGT ACTCGCCGAT AGTGGAAACC GACGCCCCAG CACTCGTCCG 3001 AGGGCAAAGG AATAGAGTAG ATGCCGACCG GGATCGATCC CCCGATCCTA ACACCGGGTT 3061 GTGTGGCCAA AATTGTTCTG TAGTCGCTGT GAGTTGACAC GGCTAGTGCT TATGATTTTC 3121 CTCGCGTGTG GTGCCTGTAC TCAGCCCTAT GCCTTATTTG CAACACATTT ACGTACAGCG 3181 CACAAGAGAA GAGAAGATCA CTTGAAGATA ATAAATATAG GGTTGTAGGC ATCTTGTTTA 3241 ACTCAAATTT TCTCGTCTTG GTGTGTCGAC ATGATTGAAA TAGTGCCACC AGTTGTGTTT 3301 GATGCGTTTG TTATCTATGC AGTATTCTGC AAGGCCTTGC AAGGCCTTGC AGGCATGCAA 3361 GCTAGCTTGT ATTCTATAGT GTCACCTAAA TCGTATGTGT ATGATACATA AGGTTATGTA 3421 TTAATTGTAG CCGCGTTCTA ACGACAATAT GTACAAGCCT AATTGTGTAG CATCTGGCTT 3481 ACTGAAGCAG ACCCTATCAT CTCTCTCGTA AACTGCCGTC AGAGTCGGTT TGGTTGGACG 3541 AACCTTCTGA GTTTCTGGTA ACGCCGTTCC GCACCCCGGA AATGGTCAGC GAACCAATCA 3601 GCAGGGTCAT CGCTAGCCAG ATCCTCTACG CCGGACGCAT CGTGGCCGGC ATCACCGGCG 3661 CCACAGGTGC GGTTGCTGGC GCCTATATCG CCGACATCAC CGATGGGGAA GATCGGGCTC 3721 GCCACTTCGG GCTCATGAGC GCTTGTTTCG GCGTGGGTAT GGTGGCAGGC CCCGTGGCCG 3781 GGGGACTGTT GGGCGCCATC TCCTTGCACC ATTCCTTGCG GCGGCGGTGC TCAACGGCCT 3841 CAACCTACTA CTGGGCTGCT TCCTAATGCA GGAGTCGCAT AAGGGAGAGC GTCGATATGG 3901 TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGCCCCG ACACCCGCCA 3961 ACACCCGCTG ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG CATCCGCTTA CAGACAAGCT 4021 GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTTCAC CGTCATCACC GAAACGCGCG 4081 AGACGAAAGG GCCTCGTGAT ACGCCTATTT TTATAGGTTA ATGTCATGAT AATAATGGTT 4141 TCTTAGACGT CAGGTGGCAC TTTTCGGGGA AATGTGCGCG GAACCCCTAT TTGTTTATTT 4201 TTCTAAATAC ATTCAAATAT GTATCCGCTC ATGAGACAAT AACCCTGATA AATGCTTCAA 4261 TAATATTGAA AAAGGAAGAG TATGAGTATT CAACATTTCC GTGTCGCCCT TATTCCCTTT 4321 TTTGCGGCAT TTTGCCTTCC TGTTTTTGCT CACCCAGAAA CGCTGGTGAA AGTAAAAGAT 4381 GCTGAAGATC AGTTGGGTGC ACGAGTGGGT TACATCGAAC TGGATCTCAA CAGCGGTAAG 4441 ATCCTTGAGA GTTTTCGCCC CGAAGAACGT TTTCCAATGA TGAGCACTTT TAAAGTTCTG 4501 CTATGTGGCG CGGTATTATC CCGTATTGAC GCCGGGCAAG AGCAACTCGG TCGCCGCATA 4561 CACTATTCTC AGAATGACTT GGTTGAGTAC TCACCAGTCA CAGAAAAGCA TCTTACGGAT 4621 GGCATGACAG TAAGAGAATT ATGCAGTGCT GCCATAACCA TGAGTGATAA CACTGCGGCC 4681 AACTTACTTC TGACAACGAT CGGAGGACCG AAGGAGCTAA CCGCTTTTTT GCACAACATG 4741 GGGGATCATG TAACTCGCCT TGATCGTTGG GAACCGGAGC TGAATGAAGC CATACCAAAC 4801 GACGAGCGTG ACACCACGAT GCCTGTAGCA ATGGCAACAA CGTTGCGCAA ACTATTAACT 4861 GGCGAACTAC TTACTCTAGC TTCCCGGCAA CAATTAATAG ACTGGATGGA GGCGGATAAA 4921 GTTGCAGGAC CACTTCTGCG CTCGGCCCTT CCGGCTGGCT GGTTTATTGC TGATAAATCT 4981 GGAGCCGGTG AGCGTGGGTC TCGCGGTATC ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC 5041 TCCCGTATCG TAGTTATCTA CACGACGGGG AGTCAGGCAA CTATGGATGA ACGAAATAGA 5101 CAGATCGCTG AGATAGGTGC CTCACTGATT AAGCATTGGT AACTGTCAGA CCAAGTTTAC 5161 TCATATATAC TTTAGATTGA TTTAAAACTT CATTTTTAAT TTAAAAGGAT CTAGGTGAAG 5221 ATCCTTTTTG ATAATCTCAT GACCAAAATC CCTTAACGTG AGTTTTCGTT CCACTGAGCG 5281 TCAGACCCCG TAGAAAAGAT CAAAGGATCT TCTTGAGATC CTTTTTTTCT GCGCGTAATC 5341 TGCTGCTTGC AAACAAAAAA ACCACCGCTA CCAGCGGTGG TTTGTTTGCC GGATCAAGAG 5401 CTACCAACTC TTTTTCCGAA GGTAACTGGC TTCAGCAGAG CGCAGATACC AAATACTGTC 5461 CTTCTAGTGT AGCCGTAGTT AGGCCACCAC TTCAAGAACT CTGTAGCACC GCCTACATAC 5521 CTCGCTCTGC TAATCCTGTT ACCAGTGGCT GCTGCCAGTG GCGATAAGTC GTGTCTTACC 5581 GGGTTGGACT CAAGACGATA GTTACCGGAT AAGGCGCAGC GGTCGGGCTG AACGGGGGGT 5641 TCGTGCACAC AGCCCAGCTT GGAGCGAACG ACCTACACCG AACTGAGATA CCTACAGCGT 5701 GAGCTATGAG AAAGCGCCAC GCTTCCCGAA GGGAGAAAGG CGGACAGGTA TCCGGTAAGC 5761 GGCAGGGTCG GAACAGGAGA GCGCACGAGG GAGCTTCCAG GGGGAAACGC CTGGTATCTT 5821 TATAGTCCTG TCGGGTTTCG CCACCTCTGA CTTGAGCGTC GATTTTTGTG ATGCTCGTCA 5881 GGGGGGCGGA GCCTATGGAA AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT 5941 TGCTGGCCTT TTGCTCACAT GTTCTTTCCT GCGTTATCCC CTGATTCTGT GGATAACCGT 6001 ATTACCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC GAACGACCGA GCGCAGCGAG 6061 TCAGTGAGCG AGGAAGCGGA AGAGCGCCCA ATACGCAAAC CGCCTCTCCC CGCGCGTTGG 6121 CCGATTCATT AATGCAG //