<?xml version="1.0" encoding="UTF-8"?>
<STUDY_SET xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <STUDY accession="ERP135110" alias="Genotyping complete sequence of CYP2D6-CYP2D7 locus" broker_name="European Bioinformatics Institute" center_name="Ghent University">
    <IDENTIFIERS>
      <PRIMARY_ID>ERP135110</PRIMARY_ID>
      <EXTERNAL_ID namespace="BioProject">PRJEB50521</EXTERNAL_ID>
      <SUBMITTER_ID namespace="Ghent University">Genotyping complete sequence of CYP2D6-CYP2D7 locus</SUBMITTER_ID>
    </IDENTIFIERS>
    <DESCRIPTOR>
      <STUDY_TITLE>Obtaining complex allele-specific sequences spanning the CYP2D6-CYP2D7 locus using nCATS-CoLoRGen assay</STUDY_TITLE>
      <STUDY_TYPE existing_study_type="Other"/>
      <STUDY_ABSTRACT>CYP2D6 is one of the most challenging pharmacogenes to genotype due to the high similarity with its neighboring pseudogenes and the frequent occurrence of CYP2D6-CYP2D7 hybrids. Consequently, most current genotyping methods cannot correctly determine the complete CYP2D6-CYP2D7 sequence. Therefore, we developed a genotyping assay to elucidate comprehensive genotypes of complex regions, such as the CYP2D6-CYP2D7 locus. This assay consists of an optimized PCR-free nanopore Cas9-targeted sequencing (nCATS) protocol combined with adaptive sequencing (AS) as enrichment strategy, and a newly developed comprehensive long read genotyping (CoLoRGen) pipeline to perform the data analysis. The CoLoRGen pipeline generates a complete consensus sequence of both alleles and determines both large structural and small variants to ultimately assign the correct star-alleles. To evaluate the performance of the optimized assay, we used three different DNA samples that contain complex structural variants in the CYP2D6-CYP2D7 region. The complete allele-specific sequences spanning the entire CYP2D6-CYP2D7 locus of all three samples could be determined, confirming the presence of CYP2D6-CYP2D7 large structural variants and smaller single nucleotide variants (SNV) and insertions and deletions (INDELs) that go undetected by other current methods. Moreover, our results provide direct evidence that the CYP2D6 genotype of the NA12878 DNA should include the CYP2D6-CYP2D7 *68 hybrid and several additional single nucleotide variants (SNV) compared to existing references. Overall, the nCATS-CoLoRGen genotyping assay allows for more accurate gene function predictions by enabling the possibility to detect and phase de novo mutations in addition to known large structural and small variants.</STUDY_ABSTRACT>
      <STUDY_DESCRIPTION>CYP2D6 is one of the most challenging pharmacogenes to genotype due to the high similarity with its neighboring pseudogenes and the frequent occurrence of CYP2D6-CYP2D7 hybrids. Consequently, most current genotyping methods cannot correctly determine the complete CYP2D6-CYP2D7 sequence. Therefore, we developed a genotyping assay to elucidate comprehensive genotypes of complex regions, such as the CYP2D6-CYP2D7 locus. This assay consists of an optimized PCR-free nanopore Cas9-targeted sequencing (nCATS) protocol combined with adaptive sequencing (AS) as enrichment strategy, and a newly developed comprehensive long read genotyping (CoLoRGen) pipeline to perform the data analysis. The CoLoRGen pipeline generates a complete consensus sequence of both alleles and determines both large structural and small variants to ultimately assign the correct star-alleles. To evaluate the performance of the optimized assay, we used three different DNA samples that contain complex structural variants in the CYP2D6-CYP2D7 region. The complete allele-specific sequences spanning the entire CYP2D6-CYP2D7 locus of all three samples could be determined, confirming the presence of CYP2D6-CYP2D7 large structural variants and smaller single nucleotide variants (SNV) and insertions and deletions (INDELs) that go undetected by other current methods. Moreover, our results provide direct evidence that the CYP2D6 genotype of the NA12878 DNA should include the CYP2D6-CYP2D7 *68 hybrid and several additional single nucleotide variants (SNV) compared to existing references. Overall, the nCATS-CoLoRGen genotyping assay allows for more accurate gene function predictions by enabling the possibility to detect and phase de novo mutations in addition to known large structural and small variants.</STUDY_DESCRIPTION>
    </DESCRIPTOR>
    <STUDY_LINKS>
      <STUDY_LINK>
        <XREF_LINK>
          <DB>PUBMED</DB>
          <ID>36149915</ID>
        </XREF_LINK>
      </STUDY_LINK>
    </STUDY_LINKS>
    <STUDY_ATTRIBUTES>
      <STUDY_ATTRIBUTE>
        <TAG>ENA-FIRST-PUBLIC</TAG>
        <VALUE>2023-08-25</VALUE>
      </STUDY_ATTRIBUTE>
      <STUDY_ATTRIBUTE>
        <TAG>ENA-LAST-UPDATE</TAG>
        <VALUE>2023-09-05</VALUE>
      </STUDY_ATTRIBUTE>
    </STUDY_ATTRIBUTES>
  </STUDY>
</STUDY_SET>
