<?xml version="1.0" encoding="UTF-8"?>
<STUDY_SET xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <STUDY center_name="GEO" alias="GSE159964" accession="SRP288349">
    <IDENTIFIERS>
      <PRIMARY_ID>SRP288349</PRIMARY_ID>
      <EXTERNAL_ID namespace="BioProject" label="primary">PRJNA670967</EXTERNAL_ID>
      <EXTERNAL_ID namespace="GEO">GSE159964</EXTERNAL_ID>
    </IDENTIFIERS>
    <DESCRIPTOR>
      <STUDY_TITLE>Epigenomic Tensor Predicts Disease Subtypes and Reveals Constrained Tumor Evolution [RNA-Seq II]</STUDY_TITLE>
      <STUDY_TYPE existing_study_type="Transcriptome Analysis"/>
      <STUDY_ABSTRACT>Understanding the epigenomic evolution and specificity of disease subtypes from complex patient data remains a major biomedical problem. We here present DeCET (Decomposition and Classification of Epigenomic Tensors), an integrative computational approach for simultaneously analyzing hierarchical heterogeneous data, to identify robust epigenomic differences between tissue types, differentiation states, and disease subtypes. Applying DeCET to our own data from 21 uterine benign tumor (leiomyoma) patients identifies distinct epigenomic features discriminating normal myometrium and leiomyoma subtypes. Leiomyomas possess preponderant alterations in distal enhancers and long-range histone modifications confined to chromatin contact domains that constrain the evolution of pathological epigenomes. Moreover, we demonstrate the power and advantage of DeCET on multiple publicly available epigenomic datasets representing different cancers and cellular states. Epigenomic features extracted by DeCET can thus help improve our understanding of disease states, cellular development, and differentiation, thereby facilitating future therapeutic, diagnostic and prognostic strategies. Overall design: Primary myometrial cells were obtained from three patients and treated with empty or V5-tagged HOXA13 vector using lentiviral transduction. RNA was harvested from these cells and used for next generation sequencing. Basecalls were performed using RTA v.2.4.11 on Nextseq instrument. Reads were adapter trimmed using TrimGalore v0.4.4 with parameters --illumina --stringency 13 --paired. After adapter trimming reads were aligned to the hg19 genome with the UCSC knownGene annotation with STAR options --quantmode. The fourth column of the quantmode output was extracted to obtain reads aligning to features in the correct orientation.</STUDY_ABSTRACT>
      <CENTER_PROJECT_NAME>GSE159964</CENTER_PROJECT_NAME>
    </DESCRIPTOR>
    <STUDY_LINKS>
      <STUDY_LINK>
        <XREF_LINK>
          <DB>pubmed</DB>
          <ID>33789109</ID>
        </XREF_LINK>
      </STUDY_LINK>
    </STUDY_LINKS>
    <STUDY_ATTRIBUTES>
      <STUDY_ATTRIBUTE>
        <TAG>parent_bioproject</TAG>
        <VALUE>PRJNA596649</VALUE>
      </STUDY_ATTRIBUTE>
    </STUDY_ATTRIBUTES>
  </STUDY>
</STUDY_SET>
