List of all project ids: submitter asigned, archive assigned, ....
At least one of ArchiveID, CenterID or LocalID should present.
Unique project identifier across all archives.
Contains host archive name, unique NCBI database ID (assigned only by NCBI DB staff) and unique accession (see accession description).
Element is optional since it will be created on submission. However it is required in the DB.
List of all project identifiers which were replaced by this record across. Similarly to Genbank secondary accession it is a cumulative field. .
Contains host archive name and accession (see accession description).
Element is optional.
List of center-specific internal tracking IDs for the project.
Common description of a project : title, publication, etc...
Very short descriptive name of the project for caption, labels, etc. For example: 1000 Genomes Project
Short, but informative title of the projects, single phrase, single line. For example:
Sequencing the southern Chinese HAN population from the 1000 Genomes Project.
Informative paragraph
Link to external resources (as an URL) or to external databases (DB name, ID).
This is not a project-to-project relation.
May be an URL to any external web resource or link to specified by name database.
Funding information for a project.
Date of public release.
Intended to be used in support of queries
Major impact categories for the project.
Could include bio-remediation, bio-fuels and other areas of research where there are areas of
mass production
Unspecified major impact categories to be defined here.
Attribute represents a key ; element text() represents a value
Attribute represents a term
Category to group the terms
Units - when value represent a measurement
If element with "RefSeq" tag present, the project is a refseq project. To be created by NCBI only.
The source of the annotation. For example, propogated from INSDC, RefSeq curation, etc.
Used sequence source
External authority (if one exists)
Project type specific fields. Created as "choice" - for future expansion.
An administrative project with the following attributes:
a. Unique tax_id is required. This will generally be at the single species-level, but could be at sub-species or a level higher than species. For example, with Canis lupus, dog is a sub-species of gray wolf.
b. Only one top single project for a tax_id
c. Only created by archive database collaborators (NCBI/EBI/DDBJ)
d. Top-single includes a description of the organism
e. It may include structured descriptors; for instance, information about habitat, chromosomes (how many, what are they named), genome size etc. These descriptors are extendable.
f. May have 1 to many submitter-level project types attached
g. Attached projects likely are not related by collaboration or funding (e.g. independent groups A, B, and C generate independent marker maps or transcriptome projects etc.)
An administrative project with the following attributes:
a. Tax_id is optional; may be a species- or higher-level tax_id (e.g., ‘primates’)
b. Primarily created by archive database collaborators (NCBI/EBI/DDBJ)
c. Submitters can request creation
d. May reflect a large multi-disciplinary project initiated by a funding agency
e. Or, Arbitrary grouping; e.g. all sequences (from a grant) submitted by different process flows; any grouping that does not cleanly fit into the first two classes.
f. May have subtypes, e.g. a controlled vocabulary of descriptors including:
i. Comparative genomics
ii. Disease
iii. Metagenome
If subtype eOther is chosen, explain details here
Please, fill Description element when choose "eOther"
A submitter level project based on actual experiment whose intent is to produce and submit data to one or more archives.
Target of the experiment. See @target_type for possible choices
Source of biomaterial used as target for this data project
Optionally provide description especially when "eOther" is selected
Identifier of the BioSample when known
The scope and purity of the biological sample used for the study
Samples are traceable to one or more isolated organisms/subjects/individuals
Samples are taken from the same species but multiple individuals/organisms with the goal of accessing variability among individuals
Sample is taken from multiple species
Content of species in a sample is not known, i.e. microbiome,metagenome, etc..
Synthetic material
Please, fill Description element when choose "eOther"
The type of material that is isolated from the sample for the experimental study
DNA material
Incompleted DNA material from (separate chromosome, etc...)
RNA material
Material obtained by precipitation, specific chemical reaction, etc....
Protein material
Phenotypes are collected
Please, fill Description element when choose "eOther"
The scope of results that the experimental study is capturing, e.g. the focus of the study
The project makes use of the whole sample material, e.g. the sample or method does not result in capturing a pre-defined class of data
Capture by bacterial cloning
Exome capture
specific loci are captured
The project is defined as a random partial survey of the sample material. For instance, a genome scan rather than a whole genome sequence approach
Please, fill Description element when choose "eOther"
Set of Targets references to BioSamples
The core experimental approach used to obtain the data that is submitted to archival databases
Sequencing using Sanger, 454, Illumina, etc with
Sequence is determined by hybridization arrays
Please, provide method description in the body of the element
The type of data that results from the experimental study; e.g., the type of data that will be submitted to archival databases
Submission of raw sequencing information as it comes out of machine
Sequence which is not raw - meaning processed (clipped, matepaired, oriented)
Higher level interpretation of the data
Experiment will result in assemblies (genome or transcriptome)
Experiment wil result in Annotation
Submission of variations
Experiment will result in Epigenetic markers
Experiment will result in submission to Peptidome
Experiment will result in cytogenetic, physical, Rh, etc.. maps
Experiment will deliver phenotypes
Please, provide data description in the body of the element
Types of data intended to be submitted for the project.
Data type based on actual data information. Ignored at submission time.
Free form citation.
Journal title in ISO-JTO format
Use for Unpublished, in press or other databases
Unique publication identifier in the specified database that is specific to the project.
Publication date.
Unique identifier in the specified database.
The external database to which the ID is referring.
Text which shown on the project presentation page for this link.
Used in situations to distinguish a project that is not a strain, breed or cultivar. For example,
this could be a common library name or a known sample name, such as Coco the gorilla.
A strain, breed or cultivar.
New items: Organism, Morphology, Environment, Phenotype, Relevance
Physical attributes of the organism
rod-shaped
spherical-shaped
spiral-shaped
elongated coccal form
filament-shaped (bacilli thar occur in long threads)
vibrio-shaped (short, slightly curved rods)
fusiform or spindle-shaped (rods with tapered ends)
Number ot chromosomes for the project or organism per replicon type
CentiMorgan
Arbitrary measurements
Host archive: NCBI, EMBL, DDBJ, .....
Host archive integer id (projectID for NCBI), optional. May be assigned only by NCBI. Should be omitted when created by other archive and reassigned later by NCBI.
Required unique accession. For NCBI: PRJNA12345, for EMBL: PRJEA12345, for DDBJ: PRJDA12345, where 12345 is an archive-unique internal number
Center-specific project IDs, assumed to be unique for a center.
Center internal abbreviation for the project.
Can have an integer synonym if needed.
Alternative internal IDs that may be optionally provided within a submission stream.
For example, there may be a localID from the Center and a different localID from the submitting DAC.
Replicon type is a text inside of element. Location and isSingle are attributes.
Explanation of an unusual chromosome features
Optionally: List all accepted synonyms for this chromosome
Chromosome number for organisms whith established numbering system :1,2,3,X,Y
Old historical prefixes may be one- or two-character long. Verification to be provided at API level