<?xml version="1.0" encoding="UTF-8"?>
<STUDY_SET xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <STUDY alias="ena-STUDY-KAUST-27-11-2014-10:24:28:422-2110" center_name="KAUST" accession="ERP008859">
    <IDENTIFIERS>
      <PRIMARY_ID>ERP008859</PRIMARY_ID>
      <EXTERNAL_ID namespace="BioProject">PRJEB7873</EXTERNAL_ID>
      <SUBMITTER_ID namespace="KAUST">ena-STUDY-KAUST-27-11-2014-10:24:28:422-2110</SUBMITTER_ID>
    </IDENTIFIERS>
    <DESCRIPTOR>
      <STUDY_TITLE>This is a genome assembly third party annotation of Toxoplasma gondii VEG strain based on strand-specific RNA-sequencing and manual re-annotation identifying novel features of UTRs and non-coding transcripts.</STUDY_TITLE>
      <STUDY_TYPE existing_study_type="Other"/>
      <STUDY_ABSTRACT>Toxoplasma gondii is an important protozoan parasite that infects all warm-blooded animals and causes opportunistic infections in immuno-compromised humans. Its closest relative, Neospora caninum, is an important veterinary pathogen that causes spontaneous abortion in livestock. Comparative genomics of these two closely related coccidians has been of particular interest to identify genes that contribute to varied host cell specificity and disease. Automated gene prediction tools that were used for gene annotation can lead to inaccurate gene models and lack information on untranslated regions and non-coding transcripts. Here, we describe a manual re-annotation of these genomes based on strand-specific RNA sequencing and shotgun proteomics. We have corrected the structures of over one third of the gene models and have annotated the complete set of untranslated regions (UTRs). We observe distinctly long UTRs in both the ?organisms??, almost four times longer than other eukaryotes?. We have also identified a putative set of cis-natural antisense transcripts (cis-NATs) and long intergenic non-coding RNAs (lincRNAs). With these, we have significantly improved the quality of annotation in the genomes to serve as a manually curated base for future research on these organisms.</STUDY_ABSTRACT>
      <CENTER_PROJECT_NAME>Strand-specific RNA-sequencing and manual re-annotation of Toxoplasma gondii VEG and Neospora caninum LIV genomes</CENTER_PROJECT_NAME>
      <STUDY_DESCRIPTION>Toxoplasma gondii is an important protozoan parasite that infects all warm-blooded animals and causes opportunistic infections in immuno-compromised humans. Its closest relative, Neospora caninum, is an important veterinary pathogen that causes spontaneous abortion in livestock. Comparative genomics of these two closely related coccidians has been of particular interest to identify genes that contribute to varied host cell specificity and disease. Automated gene prediction tools that were used for gene annotation can lead to inaccurate gene models and lack information on untranslated regions and non-coding transcripts. Here, we describe a manual re-annotation of these genomes based on strand-specific RNA sequencing and shotgun proteomics. We have corrected the structures of over one third of the gene models and have annotated the complete set of untranslated regions (UTRs). We observe distinctly long UTRs in both the ?organisms??, almost four times longer than other eukaryotes?. We have also identified a putative set of cis-natural antisense transcripts (cis-NATs) and long intergenic non-coding RNAs (lincRNAs). With these, we have significantly improved the quality of annotation in the genomes to serve as a manually curated base for future research on these organisms.</STUDY_DESCRIPTION>
    </DESCRIPTOR>
  </STUDY>
</STUDY_SET>
