DNA Data Bank of Japan DNA Database Release 62.1, August 4 2005, including 45,249,444 entries, 49,158,155,283 bases -------------------------------------------------------------------------------- DDBJ release 62.0 revised as 62.1 at August 4, 2005 -------------------------------------------------------------------------------- Duplication of the AE006470 was found in the DDBJ release 62.0 (released on June 2005). Nature of inconsistency: The entry AE006470 was found in both BCT and CON divisions. Measures implemented: The AE006470 in the BCT division is the latest one, so we removed the old AE006470 in the CON division and released again on August 4, 2005. Corrected file: ddbjcon.seq We DDBJ regret our mistake. ------------------------------------------------------------------------------- Table of contents ------------------------------------------------------------------------------- 1. Introduction 1.1. Announcement for changes in the present release 1.2. Announcement for the forthcoming changes 2. DDBJ flat file format 2.1. LOCUS line 2.2. DEFINITION line 2.3. ACCESSION line 2.4. VERSION line 2.5. KEYWORDS line 2.6. SOURCE line 2.7. REFERENCE line 2.8. COMMENT line 2.9. FEATURES line 2.10. BASE COUNT line 2.11. ORIGIN line 3. Dataset categories 3.1. Division categories 3.2. TPA separated from primary dataset 3.3. Notice for patented data 4. DDBJ staff 5. Acknowledgment 6. File Contents 6.1. File categories 6.2. File types 7. Sample of the contents in each file 7.1. Part of the contents in the file 'ddbjbct1.seq' 7.2. Part of the contents in the file 'ddbjbct1.acc' 7.3. Part of the contents in the file 'ddbjbct1.aut' 7.4. Part of the short directory in DDBJ style in the file 'ddbjbct1.dir' 7.5. Part of the contents in the file 'ddbjbct1.idx' 7.6. Part of the contents in the file 'ddbjbct1.jou' 7.7. Part of the contents in the file 'ddbjbct1.key' 7.8. Part of the contents in the file 'ddbjbct1.org' 7.9. Part of the short directory file in DDBJ style in the file 7.10. Part of the accession number index file in the 'ddbjacc.idx' 7.11. Part of the keyword phrase index file in the 'ddbjkey.idx' 7.12. Part of the journal citation index file in 'ddbjjou.idx' 7.13. Part of the gene name index file in 'ddbjgen.idx' 8. Release history 9. File list ------------------------------------------------------------------------------- 1. Introduction This database contains nucleotide sequence data for any organism, not only those with DNA genomes but also those with RNA genomes. This database may be copied and redistributed without permission on the condition that all the statements in this release note are reproduced in each copy. See also '3.3. Notice for patented data' below. The present release contains the newest data prepared by the DNA Data Bank of Japan (DDBJ), GenBank (*), and European Molecular Biology Laboratory/European Bioinformatics Institute (EMBL/EBI) as of May 26, 2005. This unified database was made possible thanks to the international collaboration among the three data banks. All the entries have accordingly been annotated using the feature keys common to them. *'GenBank' is a trademark of NIH, USA, and is operated by National Center for Biotechnology Information (NCBI) at NIH. 1.1. Announcement for changes in the present release Release version number is introduced: DDBJ has started to include the item, 'version', for its release note, which indicates a version for its periodical release. It is expressed like '62.0', in which the digit(s) after the period is a version number. The reason for adding the version number is that a released data is sometimes revised due to urgent and necessary corrections. The number is increased by one every time when a revised periodical release is made public until the next release. Introduction of ENV division: Recently, the submissions of the sequences derived from environmental samples have rapidly increased. To accommodate such submissions, a new division, ENV, has been created (See also '3.1. Division categories'). This division contains the sequences obtained via direct molecular isolation such as PCR, DGGE, or any anonymous method. In the past, the sequences derived from environmental samples belonged to taxonomic divisions, mainly BCT. At DDBJ, the retrofit to transfer relevant entries from taxonomic divisions to the ENV division starts in the present release, and ends by the next periodical release. Please note that during this transitional period, some entries to be eventually placed in the ENV division will be found in other divisions. Strand information is removed: The strand information of LOCUS line in the flat file has been removed as shown below. See also '2.1. LOCUS line'. ---------------------------------------------------------------------------- Old (-rel. 61): 44-44 space 45-47 spaces, ss- (single-stranded), ds- (double-stranded), or ms- (mixed-stranded) New (rel. 62-): 44-47 spaces ---------------------------------------------------------------------------- 1.2. Announcement for the forthcoming changes Nothing particular. 2. DDBJ flat file format The database is a collection of "entry" which is the unit of the data. The entries submitted to databanks were processed and publicized according to the DDBJ format for distribution (flat file). The flat file includes the sequence and the information of submitters, references, source organisms, and "feature" information, etc. The items of the DDBJ flat file are explained at following; -------------------------------------------------------------------------------- LOCUS AB000000 450 bp mRNA linear HUM 08-JUL- 2002 DEFINITION Homo sapiens GAPD mRNA for glyceraldehyde-3-phosphate dehydrogenase, partial cds. ACCESSION AB000000 VERSION AB000000.1 KEYWORDS . SOURCE Homo sapiens ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 450) AUTHORS Mishima,H. and Shizuoka,T. TITLE Direct Submission JOURNAL Submitted (30-NOV-2000) to the DDBJ/EMBL/GenBank databases. Hanako Mishima, National Institute of Genetics, DNA Data Bank of Japan; Yata 1111, Mishima, Shizuoka 411-8540, Japan (E-mail:mishima@supernig.nig.ac.jp, Tel:81-55-981-6853, Fax:81-55-981-6849) REFERENCE 2 (sites) AUTHORS Mishima,H., Shizuoka,T. and Fuji,I. TITLE Glyceraldehyde-3-phosphate dehydrogenase expressed in human liver JOURNAL Unpublished (2002) COMMENT Human cDNA sequencing project. FEATURES Location/Qualifiers source 1..450 /chromosome="12" /clone="GT200015" /clone_lib="lambda gt11 human liver cDNA (GeneTech. No.20)" /map="12p13" /mol_type="mRNA" /organism="Homo sapiens" /tissue_type="liver" CDS 86..>450 /codon_start=1 /gene="GAPD" /product="glyceraldehyde-3-phosphate dehydrogenase" /protein_id="BAA12345.1" /transl_table=1 /translation="MAKIKIGINGFGRIGRLVARVALQSDDVELVAVNDPFIT TDYMT YMFKYDTVHGQWKHHEVKVKDSKTLLFGEKEVTVFGCRNPKEIP WGETSAEFVVEYTG VFTDKDKAVAQLKGGAKKV" BASE COUNT 102 a 119 c 131 g 98 t ORIGIN 1 cccacgcgtc cggtcgcatc gcacttgtag ctctcgaccc ccgcatctca tccctcctct 61 cgcttagttc agatcgaaat cgcaaatggc gaagattaag atcgggatca atgggttcgg 121 gaggatcggg aggctcgtgg ccagggtggc cctgcagagc gacgacgtcg agctcgtcgc 181 cgtcaacgac cccttcatca ccaccgacta catgacatac atgttcaagt atgacactgt 241 gcacggccag tggaagcatc atgaggttaa ggtgaaggac tccaagaccc ttctcttcgg 301 tgagaaggag gtcaccgtgt tcggctgcag gaaccctaag gagatcccat ggggtgagac 361 tagcgctgag tttgttgtgg agtacactgg tgttttcact gacaaggaca aggccgttgc 421 tcaacttaag ggtggtgcta agaaggtctg // -------------------------------------------------------------------------------- 2.1. LOCUS line The format of LOCUS line in the flat file is shown below; --------- -------- Positions Contents --------- -------- 01-05 'LOCUS' 06-12 spaces 13-28 Locus name 29-29 space 30-40 Length of sequence, right-justified 41-41 space 42-43 'bp' 44-47 spaces 48-53 DNA, RNA, mRNA, pre-RNA, rRNA, scRNA, snRNA, snoRNA, tRNA, left justified 54-55 spaces 56-63 'linear' followed by two spaces, or 'circular' 64-64 space 65-67 The division code (see '3.1. Division categories.') 68-68 space 69-79 Date, in the form dd-MMM-yyyy (e.g., 08-JUL-2002) ------------------------------------------------------------------------------ 2.2. DEFINITION line The definition briefly describes the information of gene(s). "DEFINITION" is constructed by each of the three data banks. 2.3. ACCESSION line This line shows accession number of the entry data. A unique accession number is issued to the data submitter by each of the three data banks. The accession number is composed of 1 alphabet character and 5 digits (ex. A12345) or 2 alphabet characters and 6 digits (ex. AB123456). The former style was used in 1980s, but later the latter style was introduced because of data explosion. All the entries designated by the accession numbers with the prefixes given below have been collected and processed by DDBJ, and the rest have been done by GenBank and EMBL/EBI. ------------------------------------------------------------------------------ C, D, E, AB, AG, AK, AP, AT, AU, AV, BA, BB, BD, BJ, BP, BR, BS, BW, BY, CJ, DD, DE ------------------------------------------------------------------------------ If multiple entries are united to an entry, or if an entry is extensively modified after the submission, the responsible data banks may assign a new accession number to it. In these cases, the new accession number is called the primary accession number, and the old accession number(s) is/are called the secondary accession number(s). In the flat file, the primary accession number is indicated first, then the secondary accession number(s) follows. You can find the same updated entry with both the primary and the secondary accession numbers. 2.4. VERSION line This line consists of an accession number and a version number, like "AB123456.1", in which the digit(s) after the period is a version number. The data open to public for the first time is version number as "1". The reason for adding VERSION is that since a released sequence sometimes revised by the submitter, the accession number alone cannot specify the sequence in question causing the user a trouble. The number is increased by one every time when a revised sequence is made public. 2.5. KEYWORDS line The data banks describe this line, if necessary. In many cases, the categories of the data (EST, HTG etc.), gene names and product names included in "KEYWORDS". 2.6. SOURCE line This line shows the scientific name on organism from which the sequence is obtained and an organelle type if the sequence is derived from an organelle other than the nucleus. 2.7. REFERENCE line The information on the submitters and references related to the submitted sequence is indicated in REFERENCE line. 2.8. COMMENT line. The information about an entry that can not be described using FEATURES or the other fields. 2.9. FEATURES line Biological features of a submitted sequence data are described with "Feature" key (the biological nature of the annotated feature), "Location" (the region of the sequence which corresponds to Feature), and "Qualifier" (supplementary information about Feature). The "Feature" and "Qualifier" keys used in the present release is defined by DDBJ/EMBL/GenBank Feature Table: Definition (version 6.3, April 15, 2005). The document is continuously updated every half year. You can find its newest version on URL; http://www.ddbj.nig.ac.jp/FT/full_index.html 2.10. BASE COUNT line In the BASE COUNT line of the DDBJ flat file, 9 digits are allocated for each number of a (adenine), c (cytosine), g(guanine) and t (thymine). In the case of RNA sequence, uracil is indicated as "t" according to the rule of the international nucleotide database. In accordance with the relaxation of sequence length limitation, GenBank had already dropped the BASE COUNT line from their flat file format from GenBank Release 138 (Oct. 2003). DDBJ has decided to maintain the BASE COUNT line in our flat file format from the view that GC contents are still important information to characterize the sequence. 2.11. ORIGIN line The sequence data starts from the next line of ORIGIN. The sequence is indicated as lower case letters, delimited by space per 10 bases, starts a new line by 60 bases. The numbers described at left side of lines mean the ordinal number of the top base of the line. 3. Dataset categories There have been a number of genome projects going on worldwide. Among them human genome projects have probably been most productive and yielded a large number of ordinary sequences, huge amounts of genome sequences and EST (expressed sequence tags). Thus, we DDBJ have the human (HUM) division solely for human sequences and the primate (PRI) division for non-human primate sequences, while PRI division of GenBank database contains human sequences too. Note that the other divisions such as EST, GSS, and HTC may also contain human sequences. The present release is divided into 21 categories of organisms and others. See also '6.1. File categories' and '9. File list' below. The contents of the 21 categories are shown in the following. 3.1. Division categories The first 20 divisions are given below; HUM; human PRI; primates (other than human) ROD; rodents MAM; mammals (other than primates and rodents) VRT; vertebrates (other than mammals) INV; invertebrates (animals other than vertebrates) PLN; plants, fungi, plastids (eukaryotes other than animals) BCT; bacteria (including both Eubacteria and Archaea) VRL; viruses PHG; bacteriophages ENV; sequences obtained via environmental sampling methods SYN; synthetic constructs EST; expressed sequence tags; short single pass cDNA sequences GSS; genome survey sequences; short single pass genomic sequences HTC; high throughput cDNA sequences; The sequence submitted from full length cDNA sequencing projects. This division is to include unfinished high throughput cDNA sequences, each of which has 5'UTR and 3'UTR at both ends and part of a coding region. The sequence may also include introns. When the sequence becomes finished later, it moves to the corresponding taxonomic division. HTG; high throughput genomic sequences The sequence submitted mainly from genome sequencing projects which regarded a clone as a sequencing unit. STS; sequence tagged sites The tag site for genome sequencing. The information of chromosome, map, PCR_condition is mandatory for this division. PAT; patented data The data submitted to JPO (Japan Patent Office), EPO (European Patent Office), or USPTO (United States Patent and Trademark Office). See also '3.3. Notice for patented data' in below. UNA; the data not annotated The UNA division is not used for recently submitted sequences. CON; Contig / Constructed To conjugate a series of entries, such as those submitted from a genome project, each of data banks constructs an entry and assign an accession number to a large scale sequence dataset. Such entries are classified into the CON division. The entry in the CON division has the information of joined accession numbers instead of the sequence data. The corresponding entries of the CON entry have been submitted to other divisions. The entries and bases in the CON division are not counted in the released numbers given on the top of the release note. 3.2. TPA separated from primary dataset TPA (Third Party Annotation) data are also available. The TPA data are a complement to the existing DDBJ/EMBL/GenBank comprehensive database of primary nucleotide sequences, which typically result from direct sequencing of cDNAs, ESTs, genomic DNAs etc. Primary entries are defined to be data for which the submitting group has done the sequencing and annotation, and as 'owner' of these data has privileges to submit updates/corrections etc. Primary entries used to build a TPA sequence are those that have been experimentally determined and are publicly available in the DDBJ/EMBL/GenBank databases. They may not be from a proprietary database. The entries and bases in TPA are not counted in the released numbers given on the top of the release note. 3.3. Notice for patented data This release includes PAT division for patented data as described above. The patented data are those which the Japanese Patent Office (JPO), United States Patent and Trademark Office (USPTO), and the European Patent Office (EPO) collected, processed and released. The prefixes of accession numbers for the patented data are shown below; ----------------------- JPO : E, BD, DD USPTO: I, AR EPO : A, AX, CQ, CS ----------------------- Note also that unauthorized use of the patented data may cause legal issues for which DDBJ takes no responsibility. 4. DDBJ staff This release is published by the following DDBJ staff. Gojobori T, Tateno Y, Nishikawa K, Sugawara H, Saitou N, Okubo K, Ikeo K, Suzuki Y, Fukuchi S, Kinjo A, Barrero R, Abe T, Sumiyama K, Aono H, Atsumi T, Ejima M, Fukuda D, Gojobori M, Hikino Y, Hirai T, Honda K, Hoshi N, Ichikawa K, Ishida K, Kato T, Kawamoto T, Kohira J, Koike T, Kosuge T, Kusakabe A, Kuwana Y, Lin Y, Maesako H, Mamiya K, Maruyama N, Mashima J, Mimura K, Miyazawa S, Mizunuma T, Murakata N, Nagira S, Nagura M, Nishinomiya N, Okido T, Sakai K, Shigemoto Y, Shiozawa H, Sugiyama F, Suzuki M, Suzuki S, Tsuboi M, Tsutsui H, Yamamoto M, and Yokoyama E Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Research Organization of Information and Systems Mishima 411-8540, Japan Phone: +81 55 981 6853 FAX: +81 55 981 6849 E-mail: ddbj@ddbj.nig.ac.jp (for general inquiry) ddbjsub@ddbj.nig.ac.jp (for data submission) ddbjupdt@ddbj.nig.ac.jp (for updates and notification of publication) WWW: http://www.ddbj.nig.ac.jp/ (for DDBJ WWW server) http://sakura.ddbj.nig.ac.jp/ (for DDBJ sequence data submission system) 5. Acknowledgment We are grateful to NCBI and EMBL/EBI for a firm friendship and an excellent collaboration with us. We also thank the Japanese Patent Office for a steady cooperation with us. The operation of DDBJ is supported by the Ministry of Education, Culture, Sports, Science and Technology, and we would gratefully note this here. DDBJ uses the Super-SINET computer network for data collection, data exchange and various services. 6. File Contents 6.1. File categories This release covers 21 categories (see also '3. Dataset categories'.) of organisms and others as follows: ------------------------------------------------------------------------------ ddbjbct*** Category for bacteria ddbjcon*** Category for CON (contig sequences) ddbjenv*** Category for ENV (environmental samples) ddbjest*** Category for EST (expressed sequence tags) ddbjgss*** Category for GSS (genome survey sequences) ddbjhtc*** Category for HTC (high throughput cDNA sequences) ddbjhtg*** Category for HTG (high throughput genomic sequences) ddbjhum*** Category for human ddbjinv*** Category for invertebrates ddbjmam*** Category for mammals other than primates and rodents ddbjpat*** Category for patents ddbjphg*** Category for phages ddbjpln*** Category for plants ddbjpri*** Category for primates other than human ddbjrod*** Category for rodents ddbjsts*** Category for STS (sequence tagged sites) ddbjsyn*** Category for synthetic DNAs ddbjtpa*** Category for TPA (Third Party Annotation) ddbjuna*** Category for unannotated sequences ddbjvrl*** Category for viruses ddbjvrt*** Category for vertebrates other than mammals ------------------------------------------------------------------------------ Some of above in the present release were recorded in multiple ddbj***.seq files, each of which had 300 MB storage capacity as follows, respectively. --------------------- ddbjbct : 9 files ddbjenv : 2 files ddbjest : 300 files ddbjgss : 114 files ddbjhtc : 6 files ddbjhtg : 54 files ddbjhum : 22 files ddbjinv : 6 files ddbjpat : 18 files ddbjpln : 13 files ddbjrod : 15 files ddbjsts : 12 files ddbjvrl : 4 files ddbjvrt : 7 files --------------------- 6.2. File types The index files are not presented in this release except for ddbjacc.idx, ddbjgen.idx, ddbjjou.idx, and ddbjkey.idx. Instead, we have included a program by which to make the index files not presented in this release. For the use of the program, see the files, seq2indexes.doc, seq2indexes.c, and seq2indexes.h in this release. Each category then has the following nine types of files. Note that all the files except for ddbj***.seq are created by the user by use of seq2indexes as mentioned above. ------------------------------------------------------------------------------ ddbj***.seq List of an entry in DDBJ format, see 7.1. ddbj***.acc List of the accession numbers, see 7.2. ddbj***.aut List of the authors, see 7.3. ddbj***.dir List of the short directory in DDBJ style, see 7.4. ddbj***.idx List of indices, see 7.5. ddbj***.jou List of the journals, see 7.6. ddbj***.key List of the key words, see 7.7. ddbj***.org List of the species names, see 7.8. ddbj***.sdr List of the short directory in DDBJ style, see 7.9. ------------------------------------------------------------------------------ 7. Sample of the contents in each file 7.1. Part of the contents in the file 'ddbjbct1.seq' This shows all pieces of information on one entry in DDBJ format. ------------------------------------------------------------------------------ LOCUS D87069 993 bp mRNA linear BCT 14-APR- 2000 DEFINITION Escherichia coli mRNA for RNA polymerase sigma subunit, truncated form of sigma-38, complete cds. ACCESSION D87069 VERSION D87069.1 KEYWORDS RNA polymerase sigma subunit, truncated form of sigma-38. SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 993) AUTHORS Jishage,M. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) to the DDBJ/EMBL/GenBank databases. Miki Jishage, National Institute of Genetics, Molecular Genetics; Yata 1111, Mishima, Shizuoka 411, Japan (E-mail:mjishage@lab.nig.ac.jp, Tel:0559-81-6742, Fax:0559-81-6746) REFERENCE 2 (bases 1 to 993) AUTHORS Jishage,M. and Ishihama,A. TITLE Variation in RNA polymerase sigma subunit composition within different stocks of Escherichia coli starin W3110 JOURNAL Unpublished (1996) REFERENCE 3 AUTHORS Ivanova,A., Renshaw,M., Guntaka,R. and Eisenstark,A. TITLE DNA base sequence variability in katF (putative sigma factor) gene Escherichia coli JOURNAL Nucleic Acids Res. 20, 5479-5480 (1992) REFERENCE 4 AUTHORS Takayanagi,Y., Tanaka,K. and Takahashi,H. TITLE Structure of the 5' upstream region and the regulation of the rpoS gene of Escherichia coli JOURNAL Mol. Gen. Genet. 243, 525-531 (1994) COMMENT FEATURES Location/Qualifiers source 1..993 /mol_type="mRNA" /organism="Escherichia coli" /strain="W3110" CDS 1..810 /note="the gene has four single base changes, resulting in two amino acid substitutions and an amber mutation" /product="RNA polymerase sigma subunit, truncated form of sigma-38" /protein_id="BAA13238.1" /transl_table=11 /translation="MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEYEPS DNDLAEEE LLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVAS RRRMIESNLRLV VKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATW WIRQTIERAIMN QTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVD DVSRMLRLNER ITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWL FELNAK" variation 75 /citation=[3] /replace="t" variation 97 /citation=[3] /replace="t" variation 99 /citation=[3] /replace="t" variation 808 /citation=[3] /replace="t" BASE COUNT 254 a 223 c 291 g 225 t ORIGIN 1 atgagtcaga atacgctgaa agttcatgat ttaaatgaag atgcggaatt tgatgagaac 61 ggagttgagg tttttgacga aaaggcctta gtagaatatg aacccagtga taacgatttg 121 gccgaagagg aactgttatc gcagggagcc acacagcgtg tgttggacgc gactcagctt 181 taccttggtg agattggtta ttcaccactg ttaacggccg aagaagaagt ttattttgcg 241 cgtcgcgcac tgcgtggaga tgtcgcctct cgccgccgga tgatcgagag taacttgcgt 301 ctggtggtaa aaattgcccg ccgttatggc aatcgtggtc tggcgttgct ggaccttatc 361 gaagagggca acctggggct gatccgcgcg gtagagaagt ttgacccgga acgtggtttc 421 cgcttctcaa catacgcaac ctggtggatt cgccagacga ttgaacgggc gattatgaac 481 caaacccgta ctattcgttt gccgattcac atcgtaaagg agctgaacgt ttacctgcga 541 accgcacgtg agttgtccca taagctggac catgaaccaa gtgcggaaga gatcgcagag 601 caactggata agccagttga tgacgtcagc cgtatgcttc gtcttaacga gcgcattacc 661 tcggtagaca ccccgctggg tggtgattcc gaaaaagcgt tgctggacat cctggccgat 721 gaaaaagaga acggtccgga agataccacg caagatgacg atatgaagca gagcatcgtc 781 aaatggctgt tcgagctgaa cgccaaatag cgtgaagtgc tggcacgtcg attcggtttg 841 ctggggtacg aagcggcaac actggaagat gtaggtcgtg aaattggcct cacccgtgaa 901 cgtgttcgcc agattcaggt tgaaggcctg cgccgtttgc gcgaaatcct gcaaacgcag 961 gggctgaata tcgaagcgct gttccgcgag taa // ------------------------------------------------------------------------------ 7.2. Part of the contents in the file 'ddbjbct1.acc' The first column refers to the secondary accession number, second column to the locus name, and third to the primary accession number. The primary number may be the same as the secondary number. They are arranged in the ascending order of the secondary accession numbers. ------------------------------------------------------------------------------ D00681 -> AB028210 AB028210 D10012 -> AB010832 AB010832 D10013 -> AB010832 AB010832 D10048 -> AB008452 AB008452 D13563 -> AB018435 AB018435 D13614 -> AB006206 AB006206 D13762 -> AB063629 AB063629 D14537 -> AB040412 AB040412 D14604 -> AB001637 AB001637 D14607 -> AB027308 AB027308 ------------------------------------------------------------------------------ 7.3. Part of the contents in the file 'ddbjbct1.aut' For each author name given on the left to the arrow, the corresponding locus name and primary accession number are respectively listed on the right. They are arranged in the alphabetical order of the author names. ------------------------------------------------------------------------------ Aarestrup,F.M.Threlfall,E.J. -> AF393510 AF393510 Aarnikunnas,J. -> AY090766 AY090766 Aarnio,T. -> AY792975 AY792975 Aarnio,T. -> AY792976 AY792976 Aarnio,T. -> AY792977 AY792977 Aarnio,T. -> AY792978 AY792978 Aarnio,T. -> AY792979 AY792979 Aarnio,T. -> AY792980 AY792980 Aarnio,T. -> AY792981 AY792981 Aarnio,T. -> AY792982 AY792982 ------------------------------------------------------------------------------ 7.4. Part of the short directory in DDBJ style in the file 'ddbjbct1.dir' For each locus name given in the first column, the corresponding primary accession number, molecular type, number of nucleotide pairs, and description for the locus are respectively listed. They are arranged in the alphabetical order of the locus names. ------------------------------------------------------------------------------ AAC133631 AJ133631 DNA 1482 Alicyclobacillus acidoterrestris 16S rRNA gene, strain DSM 3922T. AAC133789 AJ133789 DNA 3097 Alicyclobacillus acidocaldarius cyclomaltodex trinase gene region. AAC243194 AJ243194 DNA 1720 Alicyclobacillus acidocaldarius kdpA gene. AAC252160 AJ252160 DNA 1638 Alicyclobacillus acidocaldarius cysA gene for putative ABC-transporter ATP-binding protein. AAC252161 AJ252161 DNA 8690 Alicyclobacillus acidocaldarius maltose/maltodextrine transport gene region (malEFGR genes, cdaA gene and glcA gene). AAC289685 AJ289685 DNA 453 Actinobacillus actinomycetemcomitans partial infB gene for translation initiation factor IF2, strain CCUG13227 T (ATCC33384, NCTC9710). AAC289686 AJ289686 DNA 453 Actinobacillus actinomycetemcomitans partial infB gene for translation initiation factor IF2, strain HK666. AAC289687 AJ289687 DNA 453 Actinobacillus actinomycetemcomitans partial infB gene for translation initiation factor IF2, strain HK1662. AAC289694 AJ289694 DNA 453 Actinobacillus actinomycetemcomitans partial infB gene for translation initiation factor IF2, strain HK1651. ------------------------------------------------------------------------------ 7.5. Part of the contents in the file 'ddbjbct1.idx' The first column refers to the locus name, second column to the starting site of the locus in byte, and third to its ending site in byte. They are arranged in the alphabetical order of the locus names. ------------------------------------------------------------------------------ %***************************** #AAC133631 1125664347 1125667802 #AAC133789 1125786300 1125794416 #AAC243194 1136871421 1136876093 #AAC252160 1146107711 1146112332 #AAC252161 1146112333 1146131350 #AAC289685 1155353186 1155356047 #AAC289686 1155356048 1155358781 #AAC289687 1155358782 1155361517 #AAC289694 1155377972 1155380768 #AAC308623 1167555889 1167560983 ------------------------------------------------------------------------------ 7.6. Part of the contents in the file 'ddbjbct1.jou' This gives information on the journal in which sequence data were published. ------------------------------------------------------------------------------ Yi Chuan Xue Bao 29 (12), 1111-1117 (2002) -> AY601668 AY601668 Yi Chuan Xue Bao 29 (12), 1111-1117 (2002) -> AY603420 AY603420 Yi Chuan Xue Bao 30 (4), 364-369 (2003) -> AF526379 AF526379 Yonsei Med. J. 39 (6), 520-525 (1998) -> AF373217 AF373217 Yonsei Med. J. 39 (6), 520-525 (1998) -> AF373218 AF373218 Z. Lebensm.-Unters. -Forsch., A Eur. Food Res. Technol. 209, 83-87(1999). -> ABA 7623 AJ007623 Z. Lebensm.-Unters. -Forsch., A Eur. Food Res. Technol. 209, 83-87(1999). -> ABA 7624 AJ007624 Zb. Bioteh. Fak. Univ. Ljubl. Kmet. Supl. 79(1), 107-113(2002). -> ASP012466 AJ 012466 Zb. Bioteh. Fak. Univ. Ljubl. Kmet. Supl. 79(1), 19-26(2002). -> ASP012466 AJ01 2466 Zentralbl. Bakteriol. 286 (1), 1-8 (1997) -> AF192489 AF192489 Zentralbl. Bakteriol. 290, 37-49(2000). -> LPN7311 AJ007311 Zentralbl. Bakteriol. 291, 345-352(2001). -> LPN300467 AJ300467 Zentralbl. Bakteriol. 292, 207-214(2002). -> ECO459584 AJ459584 Zhi Wu Bao Hu Xue Hui Hui Kan 44, 233-244 (2002) -> AF540014 AF540014 Zhi Wu Bing Li Xue Bao 34 (1), 80-85 (2004) -> AY447045 AY447045 Zhi Wu Bing Li Xue Hui Kan 12, 57-64 (2003) -> AF450275 AF450275 Zhiwu Baohuxue Hui Huikan 44, 185-208 (2002) -> AY122057 AY122057 Zhiwu Binglixue Huikan 6, 207-208 (1997) -> AF149819 AF149819 Zhongguo Jiaqin 25 (Supplement 1), 60-71 (2003) -> AY615358 AY615358 Zhongguo Kang Sheng Su Za Zhi 21, 94-97 (2003) -> AY293073 AY293073 Zhongguo Kang Sheng Su Za Zhi 28, 96-100 (2003) -> AY293071 AY293071 Zhongguo Kang Sheng Su Za Zhi 28, 96-100 (2003) -> AY293072 AY293072 Zhongguo Lin Chuang Yao Li Xue Za Zhi 19, 190-195 (2003) -> AY536040 AY536040 Zhongguo Nong Ye Ke Xue 36(1), 17-25 (2003) -> AY555768 AY555768 Zhonghua Liu Xing Bing Xue Za Zhi 24, 291-295 (2003) -> AY279003 AY279003 Zhonghua Min Guo Wei Sheng Wu Ji Mian Yi Xue Za Zhi (2003) In press -> AY299484 AY299484 Zhonghua Min Guo Wei Sheng Wu Ji Mian Yi Xue Za Zhi 22 (5), 489-492 (2002) -> AY 382160 AY382160 Zool. Sci. 14, 701-706 (1997) -> AB002286 AB002286 Zool. Sci. 14, 701-706 (1997) -> AB002287 AB002287 Zool. Sci. 14, 701-706 (1997) -> AB002288 AB002288 Zool. Sci. 14, 701-706 (1997) -> AB002289 AB002289 Zool. Sci. 14, 701-706 (1997) -> AB002290 AB002290 Zool. Sci. 14, 701-706 (1997) -> AB002291 AB002291 Zoolog. Sci. 17, 983-989 (2000) -> AB038366 AB038366 Zoolog. Sci. 17, 983-989 (2000) -> AB038367 AB038367 Zoolog. Sci. 17, 983-989 (2000) -> AB038368 AB038368 Zoolog. Sci. 17, 983-989 (2000) -> AB038369 AB038369 Zoolog. Sci. 17, 983-989 (2000) -> AB038370 AB038370 ------------------------------------------------------------------------------ 7.7. Part of the contents in the file 'ddbjbct1.key' For the locus and accession number respectively given on the right to the arrow, the corresponding keywords are listed on the left. ------------------------------------------------------------------------------ Synechococcus sp. DNA for intrinsic membrane protein, malK-like protein, cyanase, complete cds. -> AB000100 AB000100 cynS; cyanase; cynD; malK-like protein; cynB; intrinsic membrane protein. -> AB000100 AB000100 Direct Submission -> AB000100 AB000100 Identification and nitrogen regulation of the cyanase gene from the cyanobacteri a Synechocystis sp. strain PPC 6803 and Synechococcus sp. strain PPC 7942 -> AB000100 AB000100 Sequence updated (31-Mar-1997) by: Tatsuo Omata Sequence updated (14-Aug-1997) -> AB000100 AB000100 Sphingomonas sp. 16S ribosomal RNA. -> AB000106 AB000106 16S rRNA. -> AB000106 AB000106 Direct Submission -> AB000106 AB000106 Sphingomonas sp. VT1 16s rRNA -> AB000106 AB000106 Synechococcus sp. gene for ribosomal proteins, complete cds. -> AB000111 AB000111 tRNA pseudouridine synthase I; 50S ribosomal protein L17; DNA-dircted RNA polymerase alpha chain; 30S Ribosomal Protein S11; 30S ribosomal protein S13; 50S ribosomal protein L36; adenylate kinase; preprotein translocase SecY subunit; 50S ribosomal protein L15; 30S ribosomal protein S5; 50S ribosomal protein L18; 50S ribosomal protein L6; 30S ribosomal protein S8; 50S ribosomal protein L5; 50S ribosomal protein L24; 50S ribosomal protein L14; 30S ribosomal rotein S17; 50S ribosomal protein L29; 50S ribosomal protein L16; 30S ribosomal protein S3; 50S ribosomal protein L22; 30S ribosomal protein S19; 50S ribosomal protein L2; 50S ribosomal protein L23; 50S ribosomal protein L4; 50S ribosomal protein L3. -> AB000111 AB000111 Direct Submission -> AB000111 AB000111 ------------------------------------------------------------------------------ 7.8. Part of the contents in the file 'ddbjbct1.org' For the locus and accession number respectively given on the right to the arrow, the corresponding taxonomic names are listed on the left. They are arranged in the alphabetical order of the species names. ------------------------------------------------------------------------------ 'Flavobacterium' lutescens 'Flavobacterium' lutescens Bacteria; Proteobacteria; gamma subdivision; Pseudomonadaceae; Pseudomonas. -> AB035478 AB035478 'Flavobacterium' lutescens 'Flavobacterium' lutescens Bacteria; Proteobacteria; gamma subdivision; Pseudomonadaceae; Pseudomonas. -> AB042983 AB042983 'Fragaria multicipita' phytoplasma 'Fragaria multicipita' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytoplasma. -> AF036354 AF036354 'Fragaria multicipita' phytoplasma 'Fragaria multicipita' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytoplasma. -> AF190224 AF190224 'Fragaria multicipita' phytoplasma 'Fragaria multicipita' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytoplasma. -> AF190225 AF190225 'Helichrysum bracteatum' phytoplasma 'Helichrysum bracteatum' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytoplasma. -> AF515771 AF515771 'Momordica charantia' phytoplasma 'Momordica charantia' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytoplasma. -> PHY431368 AJ431368 'Rehmannia glutinosa var. purpurea' phytoplasma 'Rehmannia glutinosa var. purpurea' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytoplasma. -> AF335107 AF335107 'Rhizomonas' sp. 'Rhizomonas' sp. Bacteria; Proteobacteria; Alphaproteobacteria; Sphingomonadales; Sphingomonadaceae; Sphingomonas. -> UBA132327 AJ132327 'Vinca minor' phytoplasma 'Vinca minor' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytoplasma. -> AY144608 AY144608 ------------------------------------------------------------------------------ 7.9. Part of the short directory file in DDBJ style in the file 'ddbjbct1.sdr' The short directory file contains brief descriptions of all of the sequence entries contained in the DDBJ style. ------------------------------------------------------------------------------ AC133631 Alicyclobacillus acidoterrestris 16S rRNA gene, strain DSM 1482bp AAC133789 Alicyclobacillus acidocaldarius cyclomaltodextrinase gene 3097bp AAC243194 Alicyclobacillus acidocaldarius kdpA gene. 1720bp AAC252160 Alicyclobacillus acidocaldarius cysA gene for putative 1638bp AAC252161 Alicyclobacillus acidocaldarius maltose/maltodextrine 8690bp AAC289685 Actinobacillus actinomycetemcomitans partial infB gene for 453bp AAC289686 Actinobacillus actinomycetemcomitans partial infB gene for 453bp AAC289687 Actinobacillus actinomycetemcomitans partial infB gene for 453bp AAC289694 Actinobacillus actinomycetemcomitans partial infB gene for 453bp AAC308623 Alicyclobacillus acidocaldarius celA gene for cellulase. 1778bp AAC417690 Actinobacillus actinomycetemcomitans mukB gene. 4491bp AAC419840 Acetobacter aceti 16S rRNA gene, strain LMG 1531. 1440bp AAC430786 Actinobacillus actinomycetemcomitans partial fur gene for 1246bp AAC493667 Alicyclobacillus acidocaldarius subsp. rittmannii 16S rRNA 1472bp AAC496806 Alicyclobacillus acidocaldarius 16S rRNA gene, strain DSM 1507bp ------------------------------------------------------------------------------ In addition to the 9 tables the four following index files are included in this release. These files were prepared for BCT, EST, GSS, HTC, HTG, HUM, INV, MAM, PAT, PHG, PLN, PRI, ROD, STS, SYN, UNA, VRL, VRT divisions. Accession number index file Keyword phrase index file Journal citation index file Gene name index file A brief description is given for each file in the following. 7.10. Part of the accession number index file in the 'ddbjacc.idx' The following excerpt from the accession number index file illustrates the format of the index. ------------------------------------------------------------------------------ D00001 ECPBPA BCT X04516 D00002 ECPYRC BCT X04469 D00003 HUMP450M HUM D00003 D00004 FLBFLBL40 VRL D00004 D00005 IBAMEM682 VRL D00005 D00006 BACPNS1981 BCT D00006 D00007 CHKCALGRP VRT D00007 D00008 ECPNTAB BCT X04195 D00009 DROPER1 INV D00009 ------------------------------------------------------------------------------ 7.11. Part of the keyword phrase index file in the 'ddbjkey.idx' Keyword phrases consist of names for gene products and other characteristics of sequence entries. ------------------------------------------------------------------------------ "COAT PROTEIN SMO511347 VRL AJ511347 'TNPA GENE UBA564903 BCT AJ564903 'ZINC-FINGER' MOTIF PRNS53 VRL X60546 (+) MATING TYPE SURFACE PROTEIN ABGPSSP PLN M94861 (1,3 TABETGLUB PLN Z22874 (1,3)-BETA-D-GLUCAN BINDING PROTEIN AJ606470 INV AJ606470 (1,3)BETA-GLUCAN SYNTHASE NCU09275 PLN U09275 (1,4)-BETA-D-ARABINOXYLAN ARABINOFURANOHYDROLASE ANAXHA PLN Z78011 ANTUAXHA PLN Z78010 (1,6)-BETA-GLUCAN BIOSYNTHESIS YSAKRE1A PLN M81588 (1-3)-BETA-GLUCANASE NTSP41AGN PLN X81560 PA13BGPT PLN X57794 (1-3,1-4)-BETA-D-GLUCANASE HVBDG PLN X52572 (1-4)-BETA-MANNAN ENDOHYDROLASE CAR278996 PLN AJ278996 CAR293305 PLN AJ293305 (2',5'-OLIGOISOADENYLATE SYNTHETASE-DEPENDENT) AL138776 HUM AL138776 (2'-5') OLIGO(A) SYNTHASE E16 SSO4G06 EST F14610 (2'-5')OLIGOADENYLATE SYNTHETASE HSA225089 HUM AJ225089 HUMSYN25A HUM D00068 SSA225090 MAM AJ225090 (6')-IB' AMINOGLYCOSIDE ACETYLTRANSFERASE AXY278514 BCT AJ278514 PAE291609 BCT AJ291609 (8,11)-LINOLEOYL DESATURASE COF245938 PLN AJ245938 ------------------------------------------------------------------------------ 7.12. Part of the journal citation index file in 'ddbjjou.idx' The journal citation index file lists all of the citations that appear in the references. ------------------------------------------------------------------------------ (ER) AAPS PHARMSCI. 4 (3), DOI 10.1208/PS040315 (2002) AY170916 ROD AY170916 (ER) AM. J. HUM. GENET. 76 (1) (2004) IN PRESS AY753209S1 HUM AY753209 AY753209S2 HUM AY753210 (ER) ARCH. VIROL. (2004) IN PRESS AF531505 VRL AF531505 AY518899 VRL AY518899 AY518900 VRL AY518900 AY518901 VRL AY518901 AY518902 VRL AY518902 AY518903 VRL AY518903 AY518904 VRL AY518904 AY518905 VRL AY518905 AY518906 VRL AY518906 AY518907 VRL AY518907 AY518908 VRL AY518908 AY518909 VRL AY518909 AY518910 VRL AY518910 AY518911 VRL AY518911 AY518912 VRL AY518912 AY518913 VRL AY518913 AY518914 VRL AY518914 AY518915 VRL AY518915 AY518916 VRL AY518916 AY518917 VRL AY518917 AY518918 VRL AY518918 AY518919 VRL AY518919 AY518920 VRL AY518920 AY518921 VRL AY518921 AY518922 VRL AY518922 AY518923 VRL AY518923 AY518924 VRL AY518924 AY518925 VRL AY518925 AY518926 VRL AY518926 AY518927 VRL AY518927 AY518928 VRL AY518928 AY518929 VRL AY518929 AY518930 VRL AY518930 AY518931 VRL AY518931 AY518932 VRL AY518932 AY521234 VRL AY521234 AY521235 VRL AY521235 AY521236 VRL AY521236 AY521237 VRL AY521237 AY521238 VRL AY521238 (ER) ARTERIOSCLER. THROMB. VASC. BIOL. (2004) IN PRESS AY563557 HUM AY563557 (ER) BIOCHEM. BIOPHYS. RES. COMMUN. 325 (1), 203-214 (2004) AY563137 HUM AY563137 (ER) BIOCHEM. J./10.1042/BJ20030293 HSA496460 HUM AJ496460 ------------------------------------------------------------------------------ 7.13. Part of the gene name index file in 'ddbjgen.idx' This file lists all the gene names that appear in the feature table. ------------------------------------------------------------------------------ 'ARR BX927156 BCT BX927156 'BGLG BX927156 BCT BX927156 'BGLS BX927148 BCT BX927148 'BGLY' BX927156 BCT BX927156 'BRNQ AF305888 BCT AF305888 'COMK AL591983 BCT AL591983 AL596172 BCT AL596172 'CRCB BX927155 BCT BX927155 'CRTI BX927155 BCT BX927155 'DPPE LDDIPEP BCT Z34898 'FIC BX936398 BCT BX936398 ------------------------------------------------------------------------------ 8. Release history Release Date Entries Bases Comments 62 06/05 45,249,444 49,158,155,283 ENV started Version for release note started 61 03/05 43,118,204 47,099,081,750 Style of release note changed 60 12/04 40,583,945 44,416,752,273 /db_xref="H-inv:**" started 59 09/04 37,926,117 42,245,956,937 58 06/04 34,917,581 39,812,635,108 57 03/04 32,693,678 38,008,449,840 56 12/03 30,405,173 36,079,046,032 55 09/03 27,753,140 34,280,225,489 54 06/03 25,149,821 32,162,041,177 53 02/03 23,250,813 29,711,299,332 52 12/02 20,354,812 26,931,456,316 51 09/02 18,401,358 22,782,404,136 TPA started 50 06/02 17,260,693 20,158,357,982 49 04/02 16,503,157 18,579,627,226 48 01/02 15,016,100 16,197,713,855 47 10/01 13,266,610 14,145,671,645 46 07/01 12,313,759 13,037,646,166 45 04/01 11,434,113 12,207,092,905 HTC division started 44 01/01 10,165,597 11,136,298,841 43 10/00 8,666,551 10,034,532,698 42 07/00 7,554,995 8,880,721,093 41 04/00 5,962,608 6,409,581,885 CON division started 40 01/00 5,388,125 4,762,696,173 RNA division terminated 39 10/99 4,810,773 3,728,000,562 NID and PID discarded 38 07/99 4,294,369 3,098,519,597 37 03/99 3,311,627 2,375,261,951 VERSION, /protein_id started 36 01/99 3,073,166 2,190,425,560 35 10/98 2,759,261 1,957,341,169 34 07/98 2,412,785 1,708,580,623 33 04/98 2,174,769 1,479,303,279 32 01/98 1,956,669 1,300,950,613 31 10/97 1,731,532 1,139,869,464 Adoption of the unified taxonomy database 30 07/97 1,534,115 992,788,339 NID and PID terminated 29 04/97 1,270,194 841,415,232 28 01/97 1,154,120 756,785,219 HTG division started ORG division terminated 27 10/96 936,697 608,103,057 GSS division started 26 07/96 835,552 551,932,448 25 04/96 744,490 499,300,364 /translation started 24 01/96 637,508 431,771,652 23 10/95 569,757 390,694,350 22 07/95 437,588 322,982,425 HUM division started 21 04/95 274,596 250,875,023 20 01/95 239,689 231,299,557 19 10/94 204,332 205,274,131 18 07/94 185,230 192,473,021 17 04/94 169,957 179,942,209 16 01/94 154,626 165,017,628 15 10/93 131,649 147,224,690 14 07/93 120,350 138,686,333 13 04/93 112,067 129,784,445 12 01/93 97,683 120,815,244 EST division started 11 07/92 65,693 84,839,075 10 01/92 59,317 77,805,556 GenBank/EMBL inclusion started 9 07/91 1,130 2,002,124 8 01/91 879 1,573,442 7 07/90 681 1,154,211 6 01/90 496 841,236 5 07/89 395 679,378 4 01/89 302 535,985 3 07/88 230 345,850 2 01/88 142 199,392 1 07/87 66 108,970 Started with DDBJ only ------------------ Since release 61 ------------------ The style of release note (this file) has considerably been changed. Some entries have the sequential format for the secondary accession numbers in the ACCESSION line to make the expression of secondary accession numbers in the past short. For example; ------------------------------------------------------------------------------ Before; ACCESSION AB000802 D85885 D85886 D85887 After; ACCESSION AB000802 D85885-D85887 ------------------------------------------------------------------------------ See also '2.3. ACCESSION line'. ------------------ Since release 60 ------------------ The cross-reference to the H-invitational has been included. ------------------ Since release 56 ------------------ The three data banks have agreed that the maximum length limitation (350 kb) of a submitted sequence be relaxed. The BASE COUNT line of the DDBJ flat file format has been changed, corresponding to the relaxation of the maximum sequence length restriction in the entry that had been practiced at DDBJ/EMBL/GenBank International Nucleotide Sequence Databases. In the BASE COUNT line of the DDBJ flat file, 6 digits had been allocated for each number of a, c, g, t and other bases in the sequence. Hereafter, in the new flat file format, 9 digits are allocated for each number of a, c, g and t, while the numbers of other bases are removed. In accordance with the relaxation of sequence length limitation, GenBank had already dropped the BASE COUNT line from their flat file format from GenBank Release 138 (Oct. 2003). We DDBJ have decided to maintain the BASE COUNT line in our flat file format from the view that GC contents are still important information to characterize the sequence. The changes in the BASE COUNT line are shown below. ---------------------------------------------------------------------------- Old (-rel. 55): 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 |----|----|----|----|----|----|----|----|----|----|----|----|----|----| BASE COUNT 123456 a 123456 c 123456 g 123456 t 123456 others New (rel. 56-): 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 |----|----|----|----|----|----|----|----|----|----|----|----|----|----| BASE COUNT 123456789 a 123456789 c 123456789 g 123456789 t ---------------------------------------------------------------------------- The SOURCE in the flat file is revisited and revised if necessary in accordance with the unified taxonomy database common to the three data banks. ------------------ Since release 54 ------------------ '/sequenced_mol' qualifier has been changed to '/mol_type' qualifier. We accordingly completed retrofitting the pertinent entries. This change was made on the agreement at the international collaborative meeting in 2002. ------------------ Since release 51 ------------------ The TPA (Third Party Annotation) dataset has been available. The dataset is a complement to the existing DDBJ/EMBL/GenBank database of the primary nucleotide sequences which were obtained from direct sequencing of cDNAs, ESTs, genomic DNAs etc. The format of LOCUS line in the flat file has been changed as shown below to adjust to the GenBank format. ------------------------------------------------------------------------------ Old (-rel. 50): LOCUS AB000001 660 bp DNA PLN 01-FEB-2001 New (rel. 51-): LOCUS AB000001 660 bp DNA linear PLN 01-FEB-2001 ------------------------------------------------------------------------------ ------------------ Since release 45 ------------------ The HTC (High Throughput cDNA) division has been included. This is to include unfinished high throughput cDNA sequences, each of which has 5'UTR and 3'UTR at both ends and part of a coding region. The sequence may also include introns. When the sequence becomes finished later, it moves to the corresponding taxonomic division. The sequence is accompanied with a keyword, HTC (High Throughput cDNA), which is dropped when the sequence is finished and moved to a taxonomic division. ------------------ Since release 41 ------------------ The CON division has been included. This division is to show the order of related sequences in a genome, and expressed by join and the accession numbers of the sequences. The contents of the CON division are compiled by the three data banks not by the data submitter. ------------------ Since release 40 ------------------ The RNA division was terminated . The RNA data have been redistributed according to the category of the organism. Therefore, you will find a human RNA sequence, for example, in the HUM division. ------------------ Since release 37 ------------------ The three data banks include the item VERSION in the flat file, which indicates a version of a submitted nucleotide sequence. It is expressed like AB123456.1, in which the digit(s) after the period is a version number. The reason for adding VERSION is that since a released sequence sometimes revised by the submitter, the accession number alone cannot specify the sequence in question causing the user a trouble. The number is increased by one every time when a revised sequence is made public. Accordingly, the translated protein sequence will be accompanied with a /protein_id which is expressed as BAA12345.1, in which the digit(s) after the period is again a version number. The number is increased by one when the corresponding nucleotide sequence is revised and the protein sequence is changed as a result, and when the revised protein sequence is made public. ------------------ Since release 31 ------------------ We have started adopting the unified taxonomy database to unify the biological source of the sequence. The database is made up with scientific names, ID of unidentified organisms, and synthetic constructs etc. ------------------ Since release 30 ------------------ NID and PID were terminated. This change was made on the agreement at the international collaborative meeting in 1999. ------------------ Since release 28 ------------------ The HTG (High Throughput Genomic sequence) has been included. This division was created to cope with genome project teams which deal with a clone as a sequencing unit. We terminated the ORG (Organelle) division. Thus, if you are interested in human mitochondrial sequences, for example, you are now advised to refer to the HUM division. ------------------ Since release 27 ------------------ The GSS division has been included. GSS stands for Genome Survey Sequence, which is similar to EST, except that GSS is genomic DNA whereas EST is cDNA. ------------------ Since release 25 ------------------ DDBJ release contains amino acid sequences that were translated from the corresponding nucleotide sequences. In the translation we paid much attention to the fact that some species or organella have a codon different from the universal one, and used the proper codon table. ------------------ Since release 22 ------------------ The HUM division has been included. Human genome projects have probably been most productive and yielded a large number of sequences. Thus, we have the human (HUM) division solely for human sequences and the primate (PRI) division for non-human primate sequences. ------------------ Since release 12 ------------------ The EST (Expressed Sequence Tag) division has been included. The number of ESTs has been increasing at an enormous rate and is expected to be growing even more rapidly in the future. Thus, we created a division for ESTs ------------------ Since release 10 ------------------ The sequences submitted to GenBank or EMBL have been included in the DDBJ database. 9. File list. The files in this release are arranged in the following order with non-labeled format. ----------------------------------------------------------------------- file name number of entries number of bases file size ----------------------------------------------------------------------- ddbjrel.txt (DDBJ release note) 97060 ddbjacc.idx (Accession number index file) 1764385053 ddbjgen.idx (Gene name index file) 75040226 ddbjjou.idx (Journal citation index file) 1879630040 ddbjkey.idx (Keyword phrase index file) 1663655032 ----------------------------------------------------------------------- file name number of entries number of bases file size ----------------------------------------------------------------------- ddbjbct1.seq 35051 118744679 299011333 ddbjbct2.seq 7153 130249717 299188217 ddbjbct3.seq 692 136290044 299000643 ddbjbct4.seq 66669 105809720 299001446 ddbjbct5.seq 32406 116286262 299001462 ddbjbct6.seq 51757 116477133 299802556 ddbjbct7.seq 458 129705309 299885582 ddbjbct8.seq 7321 130854777 299001497 ddbjbct9.seq 33433 73272839 194160261 ddbjenv1.seq 114027 85704440 299000344 ddbjenv2.seq 10452 11220811 30609734 ddbjest1.seq 89191 33694302 299001244 ddbjest2.seq 94408 37822048 299001937 ddbjest3.seq 94451 36816309 299001412 ddbjest4.seq 89497 28249390 299000238 ddbjest5.seq 93386 35875420 299000087 ddbjest6.seq 96879 38445490 299000351 ddbjest7.seq 97789 37886387 299001222 ddbjest8.seq 97422 37461451 299002141 ddbjest9.seq 97129 38384682 299001040 ddbjest10.seq 98824 38850153 299002524 ddbjest11.seq 96735 38410869 299002130 ddbjest12.seq 96049 42810896 299001233 ddbjest13.seq 102947 42225630 299002606 ddbjest14.seq 101541 40600693 299002653 ddbjest15.seq 97337 40360538 299002081 ddbjest16.seq 95196 41887290 299000596 ddbjest17.seq 95377 39225234 299000745 ddbjest18.seq 97136 41496830 299001128 ddbjest19.seq 96231 43021845 299002352 ddbjest20.seq 93695 37925276 299001342 ddbjest21.seq 100482 44817082 299001250 ddbjest22.seq 130742 60497322 299002229 ddbjest23.seq 121317 63675573 299000212 ddbjest24.seq 98009 53782232 299002395 ddbjest25.seq 99804 76621606 299001452 ddbjest26.seq 119035 61368927 299000769 ddbjest27.seq 121276 62867435 299001612 ddbjest28.seq 123742 61705321 299000271 ddbjest29.seq 126674 57990876 299001467 ddbjest30.seq 97890 33998401 299001465 ddbjest31.seq 92634 24596417 299001754 ddbjest32.seq 82930 24666976 299001315 ddbjest33.seq 60670 16569595 299004079 ddbjest34.seq 60630 16355688 299003588 ddbjest35.seq 102908 42417834 299002210 ddbjest36.seq 118658 55242185 299001365 ddbjest37.seq 105538 54535024 299002602 ddbjest38.seq 110741 53591956 299001384 ddbjest39.seq 123443 62921689 299000231 ddbjest40.seq 96697 41654590 299000544 ddbjest41.seq 86857 38138824 299004300 ddbjest42.seq 91616 39428212 299001594 ddbjest43.seq 101898 41300688 299001183 ddbjest44.seq 92505 36198224 299002258 ddbjest45.seq 84564 37736190 299002035 ddbjest46.seq 95033 42284791 299001650 ddbjest47.seq 96429 43432315 299001354 ddbjest48.seq 96120 34796289 299001739 ddbjest49.seq 108254 48019381 299000622 ddbjest50.seq 65208 19499878 299002017 ddbjest51.seq 60358 17513466 299000345 ddbjest52.seq 60475 18602823 299002554 ddbjest53.seq 60450 18921130 299002938 ddbjest54.seq 60492 18883497 299001080 ddbjest55.seq 60417 18087564 299003095 ddbjest56.seq 61100 18613320 299002270 ddbjest57.seq 61797 18848292 299002538 ddbjest58.seq 61697 19494269 299004761 ddbjest59.seq 62473 17501017 299000385 ddbjest60.seq 58145 34932973 299003855 ddbjest61.seq 55066 24741318 299004379 ddbjest62.seq 54657 24219721 299002880 ddbjest63.seq 54181 22492681 299003093 ddbjest64.seq 60722 24670906 299002063 ddbjest65.seq 93240 39367846 299001020 ddbjest66.seq 95520 38593616 299001201 ddbjest67.seq 96121 54159405 299000630 ddbjest68.seq 98363 52511905 299000262 ddbjest69.seq 97360 47508492 299001364 ddbjest70.seq 94616 52857656 299003064 ddbjest71.seq 93692 45336185 299002602 ddbjest72.seq 95704 54206845 299002674 ddbjest73.seq 92172 46860699 299002471 ddbjest74.seq 89936 54818978 299001166 ddbjest75.seq 89799 44217011 299002975 ddbjest76.seq 93646 54724646 299001826 ddbjest77.seq 92131 56223357 299003791 ddbjest78.seq 91291 54947971 299001606 ddbjest79.seq 90831 37612782 299001599 ddbjest80.seq 85007 41920860 299002716 ddbjest81.seq 87089 48822910 299000387 ddbjest82.seq 88753 57166910 299003009 ddbjest83.seq 96289 43135383 299002666 ddbjest84.seq 95455 34628114 299002871 ddbjest85.seq 94799 42465132 299001730 ddbjest86.seq 84140 42728289 299002334 ddbjest87.seq 100814 59622000 299000272 ddbjest88.seq 100273 57423440 299000992 ddbjest89.seq 85907 53477687 299001359 ddbjest90.seq 93427 61064982 299000067 ddbjest91.seq 89161 56103370 299001182 ddbjest92.seq 94032 56271704 299001759 ddbjest93.seq 95091 60117575 299000119 ddbjest94.seq 94002 63071182 299000922 ddbjest95.seq 97267 57647251 299002277 ddbjest96.seq 97083 37547165 299002669 ddbjest97.seq 103882 55848335 299001260 ddbjest98.seq 95155 55367478 299002478 ddbjest99.seq 82470 43274265 299002729 ddbjest100.seq 92192 51507778 299001085 ddbjest101.seq 84322 45173945 299000412 ddbjest102.seq 90381 53657436 299002098 ddbjest103.seq 92649 58585668 299002951 ddbjest104.seq 86829 51933416 299002358 ddbjest105.seq 93140 53115107 299002092 ddbjest106.seq 82810 49871185 299000285 ddbjest107.seq 97423 53423266 299002467 ddbjest108.seq 115325 65016335 299002744 ddbjest109.seq 97398 50848598 299002051 ddbjest110.seq 130048 68735182 299001070 ddbjest111.seq 122936 68309452 299000233 ddbjest112.seq 95581 56669162 299001235 ddbjest113.seq 114181 66109971 299002280 ddbjest114.seq 119226 71487604 299002245 ddbjest115.seq 102956 60660410 299001183 ddbjest116.seq 85891 43388433 299002512 ddbjest117.seq 79483 35813985 299000305 ddbjest118.seq 75137 35908400 299000270 ddbjest119.seq 81044 41077484 299000263 ddbjest120.seq 87388 52722363 299002828 ddbjest121.seq 87335 56933450 299001973 ddbjest122.seq 98408 63485385 299000330 ddbjest123.seq 78356 40801646 299003840 ddbjest124.seq 82686 48098125 299001290 ddbjest125.seq 84206 40800478 299001559 ddbjest126.seq 81758 57996304 299002868 ddbjest127.seq 110961 53488276 299000470 ddbjest128.seq 104128 58289375 299000488 ddbjest129.seq 119452 69213004 299002139 ddbjest130.seq 117267 65897127 299002763 ddbjest131.seq 97137 45881450 299000582 ddbjest132.seq 94064 43405061 299000744 ddbjest133.seq 111638 53989873 299002662 ddbjest134.seq 102536 42360362 299000008 ddbjest135.seq 95605 51766092 299002864 ddbjest136.seq 87427 52885183 299000160 ddbjest137.seq 77437 49197361 299000407 ddbjest138.seq 90714 45783038 299000792 ddbjest139.seq 97337 42873203 299002991 ddbjest140.seq 94659 54730787 299001577 ddbjest141.seq 87576 44381280 299003398 ddbjest142.seq 87771 59574707 299000749 ddbjest143.seq 87592 55365978 299001666 ddbjest144.seq 88171 45430030 299002064 ddbjest145.seq 89286 74626457 299000816 ddbjest146.seq 89297 44243333 299003316 ddbjest147.seq 89997 57989343 299000173 ddbjest148.seq 88406 71879855 299002325 ddbjest149.seq 82615 60642429 299001359 ddbjest150.seq 82365 60609033 299003389 ddbjest151.seq 83711 59142710 299001583 ddbjest152.seq 82139 61354963 299002931 ddbjest153.seq 81876 48656770 299002690 ddbjest154.seq 82838 46628055 299001640 ddbjest155.seq 94766 51459261 299000800 ddbjest156.seq 107490 69460902 299001228 ddbjest157.seq 98642 60193454 299000861 ddbjest158.seq 131206 81204206 299001622 ddbjest159.seq 134362 79432395 299001339 ddbjest160.seq 130370 82254046 299000635 ddbjest161.seq 139175 74384765 299000331 ddbjest162.seq 125858 72439433 299002275 ddbjest163.seq 131918 60940202 299000078 ddbjest164.seq 100394 55485028 299002468 ddbjest165.seq 80755 52242029 299002699 ddbjest166.seq 91931 79344195 299002079 ddbjest167.seq 99947 56883100 299001261 ddbjest168.seq 101212 59299678 299002220 ddbjest169.seq 96605 68778330 299001317 ddbjest170.seq 74437 41461471 299004135 ddbjest171.seq 58458 21939759 299001439 ddbjest172.seq 57184 20154987 299001453 ddbjest173.seq 57159 20672390 298999951 ddbjest174.seq 56486 22002978 299002427 ddbjest175.seq 56225 23081076 299004339 ddbjest176.seq 58320 19825470 299003569 ddbjest177.seq 58676 23188292 299000408 ddbjest178.seq 56526 24328039 299004087 ddbjest179.seq 55817 22873241 299002645 ddbjest180.seq 55875 23811993 299003412 ddbjest181.seq 56508 22752902 299001685 ddbjest182.seq 56237 23142574 299002098 ddbjest183.seq 53000 36323784 299000493 ddbjest184.seq 101938 36944741 299001311 ddbjest185.seq 104574 53700880 299003105 ddbjest186.seq 87441 56877402 299002905 ddbjest187.seq 88235 56503409 299000325 ddbjest188.seq 83729 52512880 299000091 ddbjest189.seq 96752 55022017 299003384 ddbjest190.seq 82008 59476273 299000856 ddbjest191.seq 79849 39315570 299001909 ddbjest192.seq 128524 60395389 299000824 ddbjest193.seq 100231 54933588 299001401 ddbjest194.seq 83220 40916865 299000076 ddbjest195.seq 90344 51941294 299001005 ddbjest196.seq 94149 45226839 298999976 ddbjest197.seq 90580 47171322 299000730 ddbjest198.seq 90695 54461353 299001207 ddbjest199.seq 82428 46673359 299000984 ddbjest200.seq 102104 53181959 299000931 ddbjest201.seq 97116 55284366 299000577 ddbjest202.seq 100738 70444381 299000756 ddbjest203.seq 130027 64510598 299000346 ddbjest204.seq 107669 56813743 299000924 ddbjest205.seq 91527 56911298 299002801 ddbjest206.seq 96374 46428938 299002732 ddbjest207.seq 97453 41611693 299000310 ddbjest208.seq 86369 49858143 299003504 ddbjest209.seq 75851 48533505 299003623 ddbjest210.seq 93011 55116669 299003197 ddbjest211.seq 80508 52619984 299001410 ddbjest212.seq 98896 50466122 299000169 ddbjest213.seq 101554 51746420 299003791 ddbjest214.seq 101288 64159430 299001231 ddbjest215.seq 144221 65182969 299001879 ddbjest216.seq 120758 55354266 299000393 ddbjest217.seq 88467 46093142 299001016 ddbjest218.seq 91469 59458105 299001331 ddbjest219.seq 98419 49047066 299001766 ddbjest220.seq 89988 54858513 299002119 ddbjest221.seq 81484 45060834 299000077 ddbjest222.seq 70856 40869183 299000342 ddbjest223.seq 89301 62445584 299003769 ddbjest224.seq 90313 54948719 299002536 ddbjest225.seq 86621 50039561 299002514 ddbjest226.seq 96524 52370247 299000210 ddbjest227.seq 94011 57135044 299002836 ddbjest228.seq 94897 58806439 299002974 ddbjest229.seq 86850 67989766 299001899 ddbjest230.seq 81418 57182835 299000110 ddbjest231.seq 80983 48780077 299001707 ddbjest232.seq 93208 58840581 299001146 ddbjest233.seq 90066 46016588 299003158 ddbjest234.seq 89813 48041543 299003290 ddbjest235.seq 79850 40681820 299001449 ddbjest236.seq 83884 53320478 299001460 ddbjest237.seq 103364 54953251 299002753 ddbjest238.seq 86323 51745692 299002721 ddbjest239.seq 88636 52387530 299000291 ddbjest240.seq 92040 58598759 299000331 ddbjest241.seq 108943 64349793 299001707 ddbjest242.seq 103560 56526786 298999983 ddbjest243.seq 82349 48662231 299003111 ddbjest244.seq 94263 48677919 299000660 ddbjest245.seq 67770 36099599 299003747 ddbjest246.seq 81908 46363815 299001470 ddbjest247.seq 101826 55545527 299001117 ddbjest248.seq 109477 55777640 299008165 ddbjest249.seq 94680 65620682 299000489 ddbjest250.seq 94735 57905932 299001600 ddbjest251.seq 78845 44333707 299002279 ddbjest252.seq 109337 48854751 299002773 ddbjest253.seq 79988 52610636 299000619 ddbjest254.seq 81181 50625952 299001752 ddbjest255.seq 129689 71877473 299000845 ddbjest256.seq 85999 48822315 299002898 ddbjest257.seq 94238 48008884 299000310 ddbjest258.seq 59932 36370198 299000035 ddbjest259.seq 77895 47678651 299000394 ddbjest260.seq 102638 54919919 299002397 ddbjest261.seq 96344 56555334 299001911 ddbjest262.seq 86842 48787716 299002925 ddbjest263.seq 85927 52503996 299001486 ddbjest264.seq 79591 47324362 299002510 ddbjest265.seq 100424 38007280 299001655 ddbjest266.seq 94374 44033277 299000294 ddbjest267.seq 87971 49549927 299000949 ddbjest268.seq 96679 53954953 299000020 ddbjest269.seq 89258 58545117 299000819 ddbjest270.seq 87670 52663691 299001449 ddbjest271.seq 85521 48764566 299001879 ddbjest272.seq 93371 57762825 299001024 ddbjest273.seq 104463 61794768 299001477 ddbjest274.seq 88171 54472889 299000918 ddbjest275.seq 75633 53937760 299000008 ddbjest276.seq 74245 52745564 299002195 ddbjest277.seq 84007 52960113 299000834 ddbjest278.seq 83965 50710445 299001357 ddbjest279.seq 75546 48082486 299002402 ddbjest280.seq 74066 51931317 299000379 ddbjest281.seq 77594 53785121 299001698 ddbjest282.seq 63803 45557681 299001763 ddbjest283.seq 97492 49939407 299003609 ddbjest284.seq 67078 52182146 299004268 ddbjest285.seq 86679 56094282 299001818 ddbjest286.seq 75753 46650755 299002412 ddbjest287.seq 138273 71364188 299000166 ddbjest288.seq 105622 65845574 299001158 ddbjest289.seq 85062 59424269 299003132 ddbjest290.seq 58059 49644121 299003304 ddbjest291.seq 60688 75310297 299002459 ddbjest292.seq 85977 57523630 299001550 ddbjest293.seq 91607 56830242 299000491 ddbjest294.seq 94997 41943046 299000948 ddbjest295.seq 91394 32659606 299000905 ddbjest296.seq 93059 34298550 299000713 ddbjest297.seq 95938 34387324 299000605 ddbjest298.seq 97896 34211152 299001458 ddbjest299.seq 88505 36987285 299001071 ddbjest300.seq 17923 5334959 46616927 ddbjgss1.seq 104246 75949965 299001249 ddbjgss2.seq 101454 70698239 299002236 ddbjgss3.seq 106262 59033289 299001523 ddbjgss4.seq 79842 68406005 299000415 ddbjgss5.seq 80477 67045905 299001300 ddbjgss6.seq 81263 65434735 299003233 ddbjgss7.seq 83843 67148474 299000333 ddbjgss8.seq 97846 75623437 299000567 ddbjgss9.seq 104115 54512507 299002121 ddbjgss10.seq 98236 75613291 299000219 ddbjgss11.seq 82231 68746542 299000641 ddbjgss12.seq 80529 72268523 299002671 ddbjgss13.seq 75208 65081691 299000477 ddbjgss14.seq 107262 45075149 299001984 ddbjgss15.seq 112481 46975884 299000233 ddbjgss16.seq 113778 50927196 299001380 ddbjgss17.seq 107051 52864699 299000341 ddbjgss18.seq 99701 52535961 299001226 ddbjgss19.seq 100060 49870908 299000801 ddbjgss20.seq 98005 49956410 298999933 ddbjgss21.seq 94179 47580581 299002026 ddbjgss22.seq 95613 54017846 299003169 ddbjgss23.seq 89828 46593130 299000271 ddbjgss24.seq 95695 50038443 299001702 ddbjgss25.seq 92617 61685390 299002916 ddbjgss26.seq 89945 41062787 299001109 ddbjgss27.seq 98215 57419928 299000221 ddbjgss28.seq 88272 40518726 299003088 ddbjgss29.seq 73009 37331832 299000189 ddbjgss30.seq 73782 35244569 299002334 ddbjgss31.seq 79328 41634114 299003496 ddbjgss32.seq 79916 40329586 299002554 ddbjgss33.seq 78129 47881131 299002363 ddbjgss34.seq 88656 37976179 299001915 ddbjgss35.seq 74353 33694811 299003573 ddbjgss36.seq 90233 43046103 299002787 ddbjgss37.seq 81634 44212977 299002167 ddbjgss38.seq 95289 52409451 299003026 ddbjgss39.seq 91562 55296670 299002966 ddbjgss40.seq 101026 52660087 299001230 ddbjgss41.seq 96553 51145380 299002499 ddbjgss42.seq 111323 73959729 299000597 ddbjgss43.seq 109003 62049477 299002599 ddbjgss44.seq 111298 67080195 299002571 ddbjgss45.seq 110219 46814228 299000480 ddbjgss46.seq 103735 54260417 299001710 ddbjgss47.seq 118160 78170425 299000632 ddbjgss48.seq 105460 48088573 299000832 ddbjgss49.seq 112633 77210210 299002505 ddbjgss50.seq 93706 63109977 299000809 ddbjgss51.seq 92343 60837095 299002957 ddbjgss52.seq 100823 52763444 299001314 ddbjgss53.seq 104102 66774208 299002452 ddbjgss54.seq 108857 85486785 299000085 ddbjgss55.seq 110372 63832219 299001892 ddbjgss56.seq 113778 74241477 299001861 ddbjgss57.seq 101849 61735335 299002649 ddbjgss58.seq 97921 58622743 299001144 ddbjgss59.seq 116204 58321591 299001787 ddbjgss60.seq 108800 59254486 299002115 ddbjgss61.seq 101945 81488121 299001994 ddbjgss62.seq 92950 99842709 299003008 ddbjgss63.seq 115988 85981348 299002307 ddbjgss64.seq 91250 59400343 299003247 ddbjgss65.seq 87484 61198946 299002009 ddbjgss66.seq 113661 85242269 299002051 ddbjgss67.seq 108577 67229959 299001151 ddbjgss68.seq 111975 72888828 299001877 ddbjgss69.seq 121969 72780536 299001091 ddbjgss70.seq 127406 69250651 299000622 ddbjgss71.seq 128170 68252507 299002222 ddbjgss72.seq 129529 66471418 299001742 ddbjgss73.seq 129552 66440218 299000296 ddbjgss74.seq 130267 65504531 299000974 ddbjgss75.seq 127648 68933594 299000928 ddbjgss76.seq 115507 86271817 299002538 ddbjgss77.seq 113594 89406820 299001601 ddbjgss78.seq 111411 88309020 299001887 ddbjgss79.seq 112965 81668170 299002634 ddbjgss80.seq 118660 45020656 299002287 ddbjgss81.seq 118809 40317147 299000370 ddbjgss82.seq 108534 63156867 299000067 ddbjgss83.seq 105172 64416445 299000460 ddbjgss84.seq 109491 75381271 299001529 ddbjgss85.seq 98894 100205481 299001583 ddbjgss86.seq 99641 77849562 299001389 ddbjgss87.seq 116844 62589049 299001352 ddbjgss88.seq 97987 53972744 298999953 ddbjgss89.seq 96652 78887011 299001787 ddbjgss90.seq 109731 64225439 299002427 ddbjgss91.seq 106868 76447848 299001784 ddbjgss92.seq 107444 73792374 299002092 ddbjgss93.seq 113083 69249051 299001494 ddbjgss94.seq 122889 91670193 299000657 ddbjgss95.seq 138988 86506002 299001550 ddbjgss96.seq 127639 77233326 299000446 ddbjgss97.seq 104704 63505771 299001593 ddbjgss98.seq 106798 66061121 299001564 ddbjgss99.seq 97603 54451501 299002037 ddbjgss100.seq 95168 58978421 299001891 ddbjgss101.seq 95306 58721190 299000219 ddbjgss102.seq 95313 58469196 299001464 ddbjgss103.seq 96624 60211414 299002329 ddbjgss104.seq 108098 73327548 299000038 ddbjgss105.seq 105083 69136129 299002551 ddbjgss106.seq 109748 65777159 299000247 ddbjgss107.seq 105222 79699048 299001702 ddbjgss108.seq 110495 57866868 299001133 ddbjgss109.seq 117205 66570301 299002528 ddbjgss110.seq 98919 66982519 299000883 ddbjgss111.seq 106891 81063892 299001382 ddbjgss112.seq 97106 58027490 299000968 ddbjgss113.seq 97526 53646014 299001667 ddbjgss114.seq 14129 7376059 42360799 ddbjhtc1.seq 37990 68269939 299006248 ddbjhtc2.seq 53823 61213385 299001201 ddbjhtc3.seq 82880 76279240 299001567 ddbjhtc4.seq 87337 94353259 299001642 ddbjhtc5.seq 110557 105380536 299003331 ddbjhtc6.seq 6988 17202394 41427026 ddbjhtg1.seq 1568 225539909 299111757 ddbjhtg2.seq 3363 221431537 299020636 ddbjhtg3.seq 3023 223621145 299084872 ddbjhtg4.seq 1911 223413959 299077745 ddbjhtg5.seq 1523 221613801 299259214 ddbjhtg6.seq 1497 221708555 299098773 ddbjhtg7.seq 1538 221196310 299091175 ddbjhtg8.seq 1332 225509415 299114954 ddbjhtg9.seq 1765 218855509 299311187 ddbjhtg10.seq 1139 228915248 299080685 ddbjhtg11.seq 899 229947867 299074228 ddbjhtg12.seq 883 229972521 299269312 ddbjhtg13.seq 967 229813925 299201272 ddbjhtg14.seq 920 229866989 299132241 ddbjhtg15.seq 2098 210557010 299087378 ddbjhtg16.seq 1274 224498524 299006549 ddbjhtg17.seq 1217 225933477 299129828 ddbjhtg18.seq 938 229476177 299144658 ddbjhtg19.seq 1115 227638164 299099174 ddbjhtg20.seq 1051 228241613 299303610 ddbjhtg21.seq 920 229698459 299084985 ddbjhtg22.seq 1003 229074218 299228617 ddbjhtg23.seq 1066 228293835 299024867 ddbjhtg24.seq 1127 227836081 299543428 ddbjhtg25.seq 1095 228344527 299122277 ddbjhtg26.seq 1090 228038791 299047921 ddbjhtg27.seq 1048 228841876 299211649 ddbjhtg28.seq 1159 227151246 299279780 ddbjhtg29.seq 1005 229193439 299070086 ddbjhtg30.seq 999 229455778 299220614 ddbjhtg31.seq 1107 227706701 299119026 ddbjhtg32.seq 1115 228240251 299057253 ddbjhtg33.seq 1124 227683593 299083142 ddbjhtg34.seq 1323 225388093 299029864 ddbjhtg35.seq 1414 224280154 299080165 ddbjhtg36.seq 1523 225135574 299202463 ddbjhtg37.seq 1386 224539195 299462967 ddbjhtg38.seq 1326 229123884 299201229 ddbjhtg39.seq 1503 228173178 299151980 ddbjhtg40.seq 1400 229076949 299074217 ddbjhtg41.seq 1412 228797964 299233705 ddbjhtg42.seq 1334 227540348 299118417 ddbjhtg43.seq 1330 227153260 299141126 ddbjhtg44.seq 1772 224481681 299128053 ddbjhtg45.seq 1622 226332374 299030051 ddbjhtg46.seq 1617 225954039 299183947 ddbjhtg47.seq 1608 226389719 299249725 ddbjhtg48.seq 1389 229501012 299280040 ddbjhtg49.seq 1393 228559288 299038293 ddbjhtg50.seq 1674 228946125 299041670 ddbjhtg51.seq 1207 231706214 299201351 ddbjhtg52.seq 1229 231538616 299136493 ddbjhtg53.seq 1184 227469959 299048358 ddbjhtg54.seq 158 15105331 19644492 ddbjhum1.seq 14675 190074835 299099964 ddbjhum2.seq 1647 214192804 299012831 ddbjhum3.seq 1612 222832426 299031118 ddbjhum4.seq 1415 216564702 299001260 ddbjhum5.seq 1468 215125352 299017555 ddbjhum6.seq 1491 213539619 299115307 ddbjhum7.seq 1556 208418688 299006296 ddbjhum8.seq 1682 216710746 299167694 ddbjhum9.seq 1564 213982282 299149897 ddbjhum10.seq 1847 214672824 299251946 ddbjhum11.seq 1938 213651990 299204682 ddbjhum12.seq 39201 160144216 299004451 ddbjhum13.seq 68569 95791023 299003420 ddbjhum14.seq 15726 182445592 299209542 ddbjhum15.seq 2782 220444436 299025622 ddbjhum16.seq 2081 222878440 299033609 ddbjhum17.seq 2521 220799828 299103073 ddbjhum18.seq 3991 216711467 299001598 ddbjhum19.seq 9257 210789671 299013567 ddbjhum20.seq 52342 95907984 299001852 ddbjhum21.seq 39053 128685806 299004788 ddbjhum22.seq 58042 97047194 242585140 ddbjinv1.seq 15788 199259609 299109783 ddbjinv2.seq 1067 195391875 299230514 ddbjinv3.seq 79896 97733986 299004865 ddbjinv4.seq 68340 104663208 299001435 ddbjinv5.seq 89637 88306217 299001169 ddbjinv6.seq 28816 97275381 212811090 ddbjmam.seq 74035 133779188 319969040 ddbjpat1.seq 253014 87770034 299000052 ddbjpat2.seq 212254 101235929 299000934 ddbjpat3.seq 183950 117525640 299003214 ddbjpat4.seq 163812 98837651 299000359 ddbjpat5.seq 129076 131584870 299000357 ddbjpat6.seq 159824 105523890 299026675 ddbjpat7.seq 151525 108102619 299000532 ddbjpat8.seq 176153 68568585 298999946 ddbjpat9.seq 124850 74913989 299001457 ddbjpat10.seq 134984 61084186 299000134 ddbjpat11.seq 135393 77230733 299000296 ddbjpat12.seq 172012 57663943 299000975 ddbjpat13.seq 174227 83372009 299000181 ddbjpat14.seq 131401 129425348 299000331 ddbjpat15.seq 143793 117180135 299119796 ddbjpat16.seq 122440 125007050 299001643 ddbjpat17.seq 155179 80611712 299000297 ddbjpat18.seq 28935 7232882 30038326 ddbjphg.seq 2570 16071646 39919153 ddbjpln1.seq 32920 155471564 299009694 ddbjpln2.seq 1795 217911670 299046426 ddbjpln3.seq 51950 130816616 299000285 ddbjpln4.seq 87292 94627763 299001229 ddbjpln5.seq 75593 60551957 299007057 ddbjpln6.seq 32189 97065922 299219012 ddbjpln7.seq 1601 201470107 299014957 ddbjpln8.seq 1492 225983866 318858088 ddbjpln9.seq 8 248890531 315329420 ddbjpln10.seq 49584 128818875 299000761 ddbjpln11.seq 91934 92217074 299001162 ddbjpln12.seq 53430 123326182 300909159 ddbjpln13.seq 53570 105540163 267684176 ddbjpri.seq 35211 284770812 435041366 ddbjrod1.seq 8760 205791443 299234213 ddbjrod2.seq 1100 207420483 299006140 ddbjrod3.seq 1084 208427527 299253642 ddbjrod4.seq 1106 209429291 299055031 ddbjrod5.seq 1177 215599570 299222061 ddbjrod6.seq 1175 216734820 299061403 ddbjrod7.seq 1201 218028475 299071566 ddbjrod8.seq 1191 219252689 299187509 ddbjrod9.seq 1194 220424232 299080746 ddbjrod10.seq 26884 184219632 299001745 ddbjrod11.seq 10364 210126359 299144791 ddbjrod12.seq 1437 231531456 299129813 ddbjrod13.seq 19248 201062865 299000432 ddbjrod14.seq 33839 110472811 299126399 ddbjrod15.seq 38141 91783335 200251641 ddbjsts1.seq 96827 44782888 299000538 ddbjsts2.seq 94949 47501590 299000758 ddbjsts3.seq 96182 40610909 299000772 ddbjsts4.seq 64660 38071716 299002888 ddbjsts5.seq 64936 37743788 299003451 ddbjsts6.seq 64660 38139082 299004410 ddbjsts7.seq 66755 46216037 299002313 ddbjsts8.seq 68574 52485627 299001472 ddbjsts9.seq 68567 52201994 299001860 ddbjsts10.seq 77946 48131407 299001478 ddbjsts11.seq 102008 43599717 299000751 ddbjsts12.seq 8187 2551991 17552986 ddbjsyn.seq 22364 31683431 101263949 ddbjuna.seq 256 136155 521237 ddbjvrl1.seq 86888 77043906 299108490 ddbjvrl2.seq 85860 77613161 299002541 ddbjvrl3.seq 84350 83675922 299001721 ddbjvrl4.seq 31516 29496899 109594377 ddbjvrt1.seq 73549 117563686 299002342 ddbjvrt2.seq 22405 192058263 299001483 ddbjvrt3.seq 75623 79858891 299009228 ddbjvrt4.seq 10004 195552702 299046617 ddbjvrt5.seq 1541 231004913 299004590 ddbjvrt6.seq 11522 215099607 299048442 ddbjvrt7.seq 35947 147689930 261234202 ------------------------------------------------------------------------------ Total 45249444 49158155283 172497893036 ddbjtpa.seq 4560 14648024 32426168 ddbjcon.seq 459013 0 1164292145 The entries and bases in the CON division and TPA dataset are not counted in the numbers given on the top of the release note or 'Total' on the above table.