home > bioproject > PRJDB2229
identifier PRJDB2229
type bioproject
sameAs
sra-study  DRP000372
organism Homo sapiens
title Unamplified Cap Analysis of Gene Expression on a single molecule sequencer (HeliScopeCAGE)
description We report the development of a simplified Cap Analysis of Gene Expression (CAGE) protocol adapted for single molecule sequencers which avoids second strand synthesis, ligation, digestion and PCR. HeliScopeCAGE directly sequences the 3’ end of cap trapped first strand cDNAs. As with previous versions of CAGE, we better define transcription start sites (TSS) than known models, identify novel regions of transcription and alternative promoters, and find two major classes of TSS signal, sharp peaks and broad regions. However using this protocol we observe reproducible evidence of regulation at the much finer level of individual TSS positions. The libraries are quantitative over 5 orders of magnitude and highly reproducible (pearson’s correlation coefficient of 0.987). We have also scaled down the sample requirement to 5ug total RNA for a standard HeliScopeCAGE library and 100ng for a low quantity version. When the same RNA was run as 5ug and 100ng versions, the 100ng was still able to detect expression for ~60% of the 13468 loci detected by a 5ug library using the same threshold, allowing comparative analysis of even rare cell populations. Testing the protocol for differential gene expression measurements on triplicate HeLa and THP-1 samples, we find the log fold change compared to Illumina microarray measurements is highly correlated (0.871). In addition, HeliScopeCAGE finds differential expression for 1000s more loci including those with probes on the array. Finally although the majority of tags are 5’ associated we also observe a low level of signal on exons which is useful for defining gene-structures. HeliScopeCAGE protocol: First strand cDNA is generated from total RNA using an excess of random primer. 5’ end complete first strand cDNAs are captured for capped RNAs. First strand cDNA is polyA tailed and blocked and then loaded directly onto the HeliScope flow cell for sequencing. Data processing: After the filtering with filterSMS in HeliSphere package, the obtained reads are aligned to the human genome (hg18) with indexDPgenomic and only the alignments with the best alignment score in unique location are selected.
data type DDBJ SRA Study
publication
21596820
external link
properties 
{...}
dbXrefs
sra-run  DRR000945DRR000946DRR000948DRR000949DRR000950DRR000951DRR000952DRR000953DRR000954DRR000955 More
sra-submission  DRA000368
sra-analysis  DRZ000008DRZ000009DRZ000010DRZ000011DRZ000012DRZ000013DRZ000014DRZ000015DRZ000016DRZ000017 More
biosample  SAMD00009391SAMD00009392
sra-study  DRP000372
sra-sample  DRS000624DRS000625
sra-experiment  DRX000595DRX000596DRX000598DRX000599DRX000600DRX000601DRX000602DRX000603DRX000604DRX000605 More
distribution JSONJSON-LD
status public
visibility unrestricted-access
dateCreated 2014-05-12T00:46:27+0000
dateModified 2014-05-12T00:46:27+0000
datePublished 2014-05-12T00:46:27+0000