home > bioproject > PRJEB37669
identifier PRJEB37669
type bioproject
sameAs
organism
title Haplotype resolved chromosome level assembly of Apricot generated by application of gamete binning on single cell sequencing data of gametes.
description We developed and applied a novel gamete binning strategy on single cell sequencing data of gametes to generate haplotype resolved chromosome level assembly of Apricot genome.
This study includes the data created in the work of "Chromosome-level and haplotype-resolved genome assembly enabled by high-throughput single-cell sequencing of gamete genomes”, where a novel method has been developed to achieve the haploid assemblies in heterozygous species. As an application, two haploid genomes were assembled for an apricot tree (cultivar Rojo Pasion;isolate pruArmRojPasFocal), using PacBio reads sequenced from the somatic tissues and 10x Genomics/Illumina short reads sequenced from the hundreds of gamete genomes.Specifically, here are the assembly and annotation of the two haploid genomes from a single diploid heterozygous individual (isolate pruArmRojPasFocal), referred to as "Currot" and “Orange Red” haplotypes.
Following are the details of the assemblies.
  • Currot haplotype (Assembly Name : pruArmRojPasHapCUR ; Assembly ID : GCA_903112645 ; URL : http://www.ebi.ac.uk/ena/data/view/GCA_903112645 )
  • Orange Red haplotype (Assembly Name : pruArmRojPasHapORARED ; Assembly ID : GCA_903114435 ; URL : http://www.ebi.ac.uk/ena/data/view/GCA_903114435 )

The project also contains raw read data for
  • PacBio reads from the focal individual
  • Single Cell data from gametes (Pollen)
  • RNA seq from several tissues of the focal individual
  • Hi-C sequencing of the focal individual genome
  • Illumina sequencing of the actual parental cultivars (not used for genome assembly)
data type Genome sequencing and assembly
organization
publication
properties 
{...}
dbXrefs
sra-run  ERR4092030ERR4092031ERR4092032ERR4092033ERR4092034ERR4092035ERR4092036ERR4092037ERR4092038ERR4092039 More
sra-submission  ERA2539885ERA2539899ERA2540600ERA2541644ERA2931077
biosample  SAMEA6811881SAMEA6811882SAMEA6811883SAMEA6811884SAMEA6811885SAMEA6811886SAMEA6811888SAMEA6811889SAMEA6812185SAMEA6812187 More
sra-study  ERP120999
sra-sample  ERS4539527ERS4539528ERS4539529ERS4539530ERS4539531ERS4539532ERS4539534ERS4539535ERS4539830ERS4539832 More
sra-experiment  ERX4087532ERX4087533ERX4087529ERX4087531ERX4087530ERX4087528ERX4087539ERX4087540ERX4088521ERX4089092 More
distribution JSONJSON-LD
Download
bioproject.xml  HTTPS FTP
status public
visibility unrestricted-access
dateCreated 2020-04-12T00:00:00Z
dateModified 2020-04-12T00:00:00Z
datePublished