Sequence Resources

A BLAST server is no longer available here. It was mainly used for running massive alignments that could not be done on other public servers. I don't expect to reinstall it unless someone really needs it (I'm retired!). VSG data files are STILL AVAILABLE TO DOWNLOAD, which can be more convenient than downloading from NCBI (GenBank). Also provided here are my unpublished contigs assembled from PacBio sequencing of the EATRO 1125 strain of Trypanosoma brucei.

Right click to download files: all files are in plain text with informative fasta headers (including the annotation 'MC' for sequences derived from purified minichromosomes of Lister 427, TREU 927 & EATRO 1125; annotation as minichromosome was dis-allowed by GenBank). Files were up-to-date in May 2017. The assembly and annotation of Lister 427 complete and partial VSGs are described in the 2014 paper of Cross, Kim & Wickstead entitled Capturing the Variant Surface Glycoprotein repertoire (the VSGnome) of Trypanosoma brucei Lister 427. All my otherwise unpublished additions to the repertoire of TREU 927 VSGs, which include those from purified minichromosomes, are now included in GenBank, together with all EATRO 1125 (AKA, incorrectly, as 'the AnTat strain') complete and partial VSGs that are at least 250 AAs long. The Lister 427 and TREU 927 data come from Illumina short-read assemblies; the EATRO 1125 data are derived from Illumina and PacBio sequencing. The Lister 427 genome and many of its incomplete VSG genes have been superseded by the 2018 de-novo phased genome assembly from Nicolai Siegel's laboratory, which includes haplotype-specific assembly of long VSG arrays, so I am no longer including my 2010 Illumina contigs on this site. All files are in FASTA format. I am no longer providing concatenated VSG files to use with IGV or Bowtie.

  • PROTEIN

  • All 13,171 VSGs of all species from all sources

  • VSGs Lister 427. All 4,211 >149 AAs (2010 Illumina Assembly)

  • VSGs Lister 427. Unique 2,470 >249 AAs (2010 Illumina Assembly)

  • VSGs EATRO 1125. All 5,349 >149 AAs (2015 PacBio & Illumina Assemblies)

  • VSGs EATRO 1125. Unique 3,250 >249 AAs (2015 PacBio & Illumina Assemblies)

  • VSGs. 2,990 of all species not provided by me parsed from GenBank & TriTrypDB May 2017

  • VSGs TREU 927. 644 New >149 AAs in 2012 Illumina assembly by me

  • DNA

  • EATRO 1125 PacBio Unpublished Contigs (957 QC-trimmed >2,000bp)

  • All 13,069 VSG CDSs of all species from all sources

  • VSG CDSs Lister 427. All 4,211 >149 AAs (2010 Illumina Assembly)

  • VSG CDSs Lister 427. Unique 2,470 >249 AAs (2010 Illumina Assembly)

  • VSG CDSs & Flanks Lister 427. Unique 2,470 >249 AAs (2010 Illumina Assembly)

  • VSG CDSs EATRO 1125. All 5,349 >149 AAs (2015 PacBio & Illumina Assemblies)

  • VSG CDSs & Flanks EATRO 1125. All 5,349 >149 AAs (2015 PacBio & Illumina Assemblies)

  • VSG CDSs EATRO 1125. Unique 3,250 >249 AAs (2015 PacBio & Illumina Assemblies)

  • VSG CDSs. 2,889 of all species not provided by me parsed from GenBank & TriTrypDB May 2017

  • VSG CDSs TREU 927. 644 New >149 AAs in 2012 Illumina assembly by me

  • Click here to return to tryps.rockefeller.edu home page