<?xml version="1.0" encoding="UTF-8"?>
<EXPERIMENT_SET xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <EXPERIMENT accession="SRX702458" center_name="University of Oxford" alias="Filamoeba">
    <IDENTIFIERS>
      <PRIMARY_ID>SRX702458</PRIMARY_ID>
      <SUBMITTER_ID namespace="University of Oxford">Filamoeba</SUBMITTER_ID>
    </IDENTIFIERS>
    <TITLE>Filamoeba sinensis</TITLE>
    <STUDY_REF accession="SRP047214">
      <IDENTIFIERS>
        <PRIMARY_ID>SRP047214</PRIMARY_ID>
      </IDENTIFIERS>
    </STUDY_REF>
    <DESIGN>
      <DESIGN_DESCRIPTION>cDNA library was created by Vertis AG (Germany http://www.vertis-biotech.com/). Poly(A)+ RNA was purified from 10-20 µg of total RNA. Eight amoebozoan libraries including an unnormalised one for Filamoeba sinensis were created as part of a larger set of 22 cDNA libraries constructed by Vertis AG. cDNA synthesis was according to Vertis' FLE (full-length enriched) protocol. First strand cDNA was synthesized using an oligo-d(T) adapter primer (5'-CCTTGGCTGTCACTCACTGCGA-d(T)25-3') and Maloney Murine Leukemia virusRNAse H reverse-transcriptase was used to extend the primer towards the 5'-end of the mRNA. After RNA hydrolysis, an adapter primer (5'-CTCTGGACCTTGGCTGTCACTCAGTT-3') was attached to the 3'-ends of the first strand cDNA, to allow directional cloning/sequencing of subsequent samples. The adapter sequences on both ends of the cDNA were then used to synthesize second strand cDNA and to amplify the cDNA with long and accurate (LA) PCR (Barnes 1994). To limit PCR induced bias only 18-21 PCR cycles were performed to amplify the cDNA. Between 3.3-5.3 µg of cDNA was thus created for each library, which had a size range of ~250-3000 nucleotides determined by gel electrophoresis. 454 pyrosequencing by the University of Liverpool’s Advanced Genomics Facility of these 22 libraries and two genomic sequence surveys used a multiplex sequence strategy on two separate half plates after tagging each of the 12 samples per run with a separate multiplex ID (MID) tag. For tagging, each cDNA from Vertis AG was blunt-ended and ligated by the Liverpool facility to one of the unique-decamers (MID tags). They computationally sorted the resulting sequences to assign each to the appropriate sample using Roche software. Fuller details of techniques are in the supplementary material of Cavalier-Smith, T., Chao, E.E., Snell, E.A., Berney, C., Fiore-Donno, A.M., Lewis, R., 2014. Multigene eukaryote phylogeny reveals the likely protozoan ancestors of opisthokonts (animals, fungi, choanozoans) and Amoebozoa. Mol. Phylogenet. Evol. 81, 71-85. DOI:  10.1016/j.ympev.2014.08.012. As explained there, some cross-contamination must have occurred between samples in a single run prior to or during tagging, so some sequences are not from the expected taxa. Therefore, users of these data are strongly advised to run single-gene phylogenetic trees to check the taxonomic origin of any sequence of interest, as we did for the 192 genes used for our published multigene trees, which were the primary purpose of the sequencing. To enable interpretation of possible cross-contamination the taxa and MID tags run on each plate were:  Plate 1:  MID1  - Thaumatomonas oxoniensis (Cercozoa)  MID2  - Fabomonas tropica (Sulcozoa: Planomonadida)  MID3  - Microheliella maris (Heliozoa)  MID4  - Vannella simplex (Amoebozoa)  MID5  - Ovalopodium desertum (Amoebozoa)  MID6  - Nudifila producta (Cercozoa)  MID7  - Thecamonas oxoniensis (Sulcozoa: Apusomonadida)  MID8  - Mantamonas plastica (Sulcozoa: Mantamonadida)  MID9  - Nutamonas howeae (Sulcozoa: Planomonadia)  MID10 - Cercomonas clavideferens (Cercozoa)  MID11 - Helkesimastix marina (Cercozoa)  MID12 - Nolandella  abertawensis (Amoebozoa)  Plate 2:  MID1  - Stenamoeba stenopodia (Amoebozoa)  MID2  - Filamoeba sinensis (Amoebozoa)  MID3  - Sandona ubiquita (Cercozoa)  MID4  - Vexillifera bacillipedes (Amoebozoa)  MID5  - Oxnerella micra (Heliozoa) + Minimassisteria diva (Cercozoa)  MID6  - Flamella fluviatilis (Amoebozoa)  MID7  - Cochliopodium minutoideum (Amoebozoa)  MID8  - Multimonas media (Sulcozoa: Apusomonadida)  MID9  - Ancyromonas sigmoides (=Planomonas mylnikovi) (Sulcozoa: Planomonadida)  MID10 - Micrometopion nutans (Cercozoa) + Procryptobia sorokini (Euglenozoa)  MID11 - Micronuclearia podoventralis (Sulcozoa: Varisulca: Rigifilida) genomic DNA  MID12 - Rhogostoma minus (Cercozoa) genomic DNA.  Individual EST reads from each library were assembled using the 454 proprietary software, Newbler; contig construction parameters were chosen so primer sequences had no effect on contig assembly. Users should therefore remove primer sequences used in library construction from the raw submitted sequences.</DESIGN_DESCRIPTION>
      <SAMPLE_DESCRIPTOR accession="SRS702299">
        <IDENTIFIERS>
          <PRIMARY_ID>SRS702299</PRIMARY_ID>
        </IDENTIFIERS>
      </SAMPLE_DESCRIPTOR>
      <LIBRARY_DESCRIPTOR>
        <LIBRARY_NAME>Filamoeba sinensis</LIBRARY_NAME>
        <LIBRARY_STRATEGY>EST</LIBRARY_STRATEGY>
        <LIBRARY_SOURCE>TRANSCRIPTOMIC</LIBRARY_SOURCE>
        <LIBRARY_SELECTION>PolyA</LIBRARY_SELECTION>
        <LIBRARY_LAYOUT>
          <SINGLE/>
        </LIBRARY_LAYOUT>
      </LIBRARY_DESCRIPTOR>
      <SPOT_DESCRIPTOR>
        <SPOT_DECODE_SPEC>
          <SPOT_LENGTH>0</SPOT_LENGTH>
          <READ_SPEC>
            <READ_INDEX>0</READ_INDEX>
            <READ_CLASS>Technical Read</READ_CLASS>
            <READ_TYPE>Adapter</READ_TYPE>
            <BASE_COORD>1</BASE_COORD>
          </READ_SPEC>
          <READ_SPEC>
            <READ_INDEX>1</READ_INDEX>
            <READ_CLASS>Application Read</READ_CLASS>
            <READ_TYPE>Forward</READ_TYPE>
            <BASE_COORD>5</BASE_COORD>
          </READ_SPEC>
        </SPOT_DECODE_SPEC>
      </SPOT_DESCRIPTOR>
    </DESIGN>
    <PLATFORM>
      <LS454>
        <INSTRUMENT_MODEL>454 GS FLX Titanium</INSTRUMENT_MODEL>
      </LS454>
    </PLATFORM>
    <PROCESSING>
      <PIPELINE>
        <PIPE_SECTION>
          <STEP_INDEX/>
          <PREV_STEP_INDEX/>
          <PROGRAM>Roch454 proprietary software for MID tag sequence removal and taxon assignment to each sequence</PROGRAM>
          <VERSION/>
        </PIPE_SECTION>
        <PIPE_SECTION>
          <STEP_INDEX/>
          <PREV_STEP_INDEX/>
          <PROGRAM>Newbler contig assembly</PROGRAM>
          <VERSION>2.0.00.20</VERSION>
        </PIPE_SECTION>
      </PIPELINE>
    </PROCESSING>
  </EXPERIMENT>
</EXPERIMENT_SET>
