DNA Data Bank of Japan DNA Database Release 61, Mar. 2005, including 43,118,204 entries, 47,099,081,750 bases -------------------------------------------------------------------------------- DDBJ release 61 revised at Apr. 28, 2005 -------------------------------------------------------------------------------- Inconsistency was found in the AY732571 of the DDBJ release 61 (released on Mar. 2005). Nature of inconsistency: Description of "ORGANISM" line was incorrect in the AY732571. Measures implemented: The AY732571 was corrected, and released again on April 28. Corrected file: ddbjvrt3.seq We DDBJ regret our mistake. -------------------------------------------------------------------------------- Table of contents -------------------------------------------------------------------------------- 1. Introduction 1.1. Announcement for changes in the present release 1.2. Announcement for the forthcoming changes 2. DDBJ flat file format 2.1. LOCUS line 2.2. DEFINITION line 2.3. ACCESSION line 2.4. VERSION line 2.5. KEYWORDS line 2.6. SOURCE line 2.7. REFERENCE line 2.8. COMMENT line 2.9. FEATURES line 2.10. BASE COUNT line 2.11. ORIGIN line 3. Dataset categories 3.1. Division categories 3.2. TPA separated from primary dataset 3.3. Notice for patented data 4. DDBJ staff 5. Acknowledgment 6. File Contents 6.1. File categories 6.2. File types 7. Sample of the contents in each file 7.1. Part of the contents in the file 'ddbjbct1.seq' 7.2. Part of the contents in the file 'ddbjbct1.acc' 7.3. Part of the contents in the file 'ddbjbct1.aut' 7.4. Part of the short directory in DDBJ style in the file 'ddbjbct1.dir' 7.5. Part of the contents in the file 'ddbjbct1.idx' 7.6. Part of the contents in the file 'ddbjbct1.jou' 7.7. Part of the contents in the file 'ddbjbct1.key' 7.8. Part of the contents in the file 'ddbjbct1.org' 7.9. Part of the short directory file in DDBJ style in the file 7.10. Part of the accession number index file in the 'ddbjacc.idx' 7.11. Part of the keyword phrase index file in the 'ddbjkey.idx' 7.12. Part of the journal citation index file in 'ddbjjou.idx' 7.13. Part of the gene name index file in 'ddbjgen.idx' 8. Release history 9. File list -------------------------------------------------------------------------------- 1. Introduction This is International Nucleotide Sequence Database (INSD). This database contains nucleotide sequence data for any organism, not only those with DNA genomes but also those with RNA genomes. This database may be copied and redistributed without permission on the condition that all the statements in this release note are reproduced in each copy. See also '3.3. Notice for patented data' below. The present release contains the newest data prepared by the DNA Data Bank of Japan (DDBJ), GenBank (*), and European Molecular Biology Laboratory/European Bioinformatics Institute (EMBL/EBI) as of February 24, 2005. This unified database was made possible thanks to the international collaboration among the three data banks. All the entries have accordingly been annotated using the feature keys common to them. *'GenBank' is a trademark of NIH, USA, and is operated by National Center for Biotechnology Information (NCBI) at NIH. 1.1. Announcement for changes in the present release The style of release note (this file) has been changed, dramatically. Since the present release, some entries have the sequential format for the secondary accession numbers in the ACCESSION line, in order to make the expression of secondary accession numbers in the past short. For example; ------------------------------------------------------------------------------ Before; ACCESSION AB000802 D85885 D85886 D85887 After; ACCESSION AB000802 D85885-D85887 ------------------------------------------------------------------------------ See also '2.3. ACCESSION line' below. 1.2. Announcement for the forthcoming changes A new division, ENV, for sequences obtained via environmental sampling methods will be introduced in the next DDBJ release. This new division will include those sequences for which the source organism is unknown, or can only be inferred by sequence comparison. 2. DDBJ flat file format The database is a collection of "entry" which is the unit of the data. The entries submitted to databanks were processed and publicized according to the DDBJ format for distribution (flat file). The flat file includes the sequence and the information of submitters, references, source organisms, and "feature" information, etc. The items of the DDBJ flat file are explained at following; -------------------------------------------------------------------------------- LOCUS AB000000 450 bp mRNA linear HUM 08-JUL-2002 DEFINITION Homo sapiens GAPD mRNA for glyceraldehyde-3-phosphate dehydrogenase, partial cds. ACCESSION AB000000 VERSION AB000000.1 KEYWORDS . SOURCE Homo sapiens ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 450) AUTHORS Mishima,H. and Shizuoka,T. TITLE Direct Submission JOURNAL Submitted (30-NOV-2000) to the DDBJ/EMBL/GenBank databases. Hanako Mishima, National Institute of Genetics, DNA Data Bank of Japan; Yata 1111, Mishima, Shizuoka 411-8540, Japan (E-mail:mishima@supernig.nig.ac.jp, Tel:81-55-981-6853, Fax:81-55-981-6849) REFERENCE 2 (sites) AUTHORS Mishima,H., Shizuoka,T. and Fuji,I. TITLE Glyceraldehyde-3-phosphate dehydrogenase expressed in human liver JOURNAL Unpublished (2002) COMMENT Human cDNA sequencing project. FEATURES Location/Qualifiers source 1..450 /chromosome="12" /clone="GT200015" /clone_lib="lambda gt11 human liver cDNA (GeneTech. No.20)" /map="12p13" /mol_type="mRNA" /organism="Homo sapiens" /tissue_type="liver" CDS 86..>450 /codon_start=1 /gene="GAPD" /product="glyceraldehyde-3-phosphate dehydrogenase" /protein_id="BAA12345.1" /transl_table=1 /translation="MAKIKIGINGFGRIGRLVARVALQSDDVELVAVNDPFITTDYMT YMFKYDTVHGQWKHHEVKVKDSKTLLFGEKEVTVFGCRNPKEIPWGETSAEFVVEYTG VFTDKDKAVAQLKGGAKKV" BASE COUNT 102 a 119 c 131 g 98 t ORIGIN 1 cccacgcgtc cggtcgcatc gcacttgtag ctctcgaccc ccgcatctca tccctcctct 61 cgcttagttc agatcgaaat cgcaaatggc gaagattaag atcgggatca atgggttcgg 121 gaggatcggg aggctcgtgg ccagggtggc cctgcagagc gacgacgtcg agctcgtcgc 181 cgtcaacgac cccttcatca ccaccgacta catgacatac atgttcaagt atgacactgt 241 gcacggccag tggaagcatc atgaggttaa ggtgaaggac tccaagaccc ttctcttcgg 301 tgagaaggag gtcaccgtgt tcggctgcag gaaccctaag gagatcccat ggggtgagac 361 tagcgctgag tttgttgtgg agtacactgg tgttttcact gacaaggaca aggccgttgc 421 tcaacttaag ggtggtgcta agaaggtctg // -------------------------------------------------------------------------------- 2.1. LOCUS line The format of LOCUS line in the flat file is shown below; --------- -------- Positions Contents --------- -------- 01-05 'LOCUS' 06-12 spaces 13-28 Locus name 29-29 space 30-40 Length of sequence, right-justified 41-41 space 42-43 'bp' 44-44 space 45-47 spaces, ss- (single-stranded), ds- (double-stranded), or ms- (mixed-stranded) 48-53 DNA, RNA, mRNA, pre-RNA, rRNA, scRNA, snRNA, snoRNA, tRNA Left justified. 54-55 spaces 56-63 'linear' followed by two spaces, or 'circular' 64-64 space 65-67 The division code (see '3.1. Division categories.') 68-68 space 69-79 Date, in the form dd-MMM-yyyy (e.g., 08-JUL-2002) ------------------------------------------------------------------------------ 2.2. DEFINITION line The definition briefly describes the information of gene(s). "DEFINITION" is constructed by each of the three data banks. 2.3. ACCESSION line This line shows accession number of the entry data. A unique accession number is issued to the data submitter by each of the three data banks. The accession number is composed of 1 alphabet character and 5 digits (ex. A12345) or 2 alphabet characters and 6 digits (ex. AB123456). The former style was used in 1980s, but later the latter style was introduced because of data explosion. All the entries designated by the accession numbers with the prefixes given below have been collected and processed by DDBJ, and the rest have been done by GenBank and EMBL/EBI. ------------------------------------------------------------------------------- C, D, E, AB, AG, AK, AP, AT, AU, AV, BA, BB, BD, BJ, BP, BR, BS, BW, BY, CJ, DD ------------------------------------------------------------------------------- If multiple entries are united to an entry, or if an entry is extensively modified after the submission, the responsible data banks may assign a new accession number to it. In these cases, the new accession number is called the primary accession number, and the old accession number(s) is/are called the secondary accession number(s). In the flat file, the primary accession number is indicated first, then the secondary accession number(s) follows. You can find the same updated entry with both the primary and the secondary accession numbers. 2.4. VERSION line This line consists of an accession number and a version number, like "AB123456.1", in which the digit(s) after the period is a version number. The data open to public for the first time is version number as "1". The reason for adding VERSION is that since a released sequence sometimes revised by the submitter, the accession number alone cannot specify the sequence in question causing the user a trouble. The number is increased by one every time when a revised sequence is made public. 2.5. KEYWORDS line The data banks describe this line, if necessary. In many cases, the categories of the data (EST, HTG etc.), gene names and product names included in "KEYWORDS". 2.6. SOURCE line This line shows the scientific name on organism from which the sequence is obtained and an organelle type if the sequence is derived from an organelle other than the nucleus. 2.7. REFERENCE line The information on the submitters and references related to the submitted sequence is indicated in REFERENCE line. 2.8. COMMENT line. The information about an entry that can not be described using FEATURES or the other fields. 2.9. FEATURES line Biological features of a submitted sequence data are described with "Feature" key (the biological nature of the annotated feature), "Location" (the region of the sequence which corresponds to Feature), and "Qualifier" (supplementary information about Feature). The "Feature" and "Qualifier" keys used in the present release is defined by DDBJ/EMBL/GenBank Feature Table: Definition (Ver. 6.2 Oct. 15, 2004). The document is continuously updated every half year. You can find its newest version on URL; http://www.ddbj.nig.ac.jp/FT/full_index.html 2.10. BASE COUNT line In the BASE COUNT line of the DDBJ flat file, 9 digits are allocated for each number of a (adenine), c (cytosine), g(guanine) and t (thymine). In the case of RNA sequence, uracil is indicated as "t" according to the rule of the international nucleotide database. In accordance with the relaxation of sequence length limitation, GenBank had already dropped the BASE COUNT line from their flat file format from GenBank Release 138 (Oct. 2003). DDBJ has decided to maintain the BASE COUNT line in our flat file format from the view that GC contents are still important information to characterize the sequence. 2.11. ORIGIN line The sequence data starts from the next line of ORIGIN. The sequence is indicated as lower case letters, delimited by space per 10 bases, starts a new line by 60 bases. The numbers described at left side of lines mean the ordinal number of the top base of the line. 3. Dataset categories There have been a number of genome projects going on worldwide. Among them human genome projects have probably been most productive and yielded a large number of ordinary sequences, huge amounts of genome sequences and EST (expressed sequence tags). Thus, we DDBJ have the human (HUM) division solely for human sequences and the primate (PRI) division for non-human primate sequences, while PRI division of GenBank database contains human sequences too. Note that the other divisions such as EST, GSS, and HTC may also contain human sequences. The present release is divided into 20 categories of organisms and others. See also '6.1. File categories' and '9. File list' below. The contents of the 20 categories are shown in the following. 3.1. Division categories The first 19 divisions are given below; HUM; human PRI; primates (other than human) ROD; rodents MAM; mammals (other than primates and rodents) VRT; vertebrates (other than mammals) INV; invertebrates (animals other than vertebrates) PLN; plants, fungi, plastids (eukaryotes other than animals) BCT; bacteria (including both Eubacteria and Archaea) VRL; viruses PHG; bacteriophages SYN; synthetic constructs EST; expressed sequence tags; short single pass cDNA sequences GSS; genome survey sequences; short single pass genomic sequences HTC; high throughput cDNA sequences; The sequence submitted from full length cDNA sequencing projects. This division is to include unfinished high throughput cDNA sequences, each of which has 5'UTR and 3'UTR at both ends and part of a coding region. The sequence may also include introns. When the sequence becomes finished later, it moves to the corresponding taxonomic division. HTG; high throughput genomic sequences The sequence submitted mainly from genome sequencing projects which regarded a clone as a sequencing unit. STS; sequence tagged sites The tag site for genome sequencing. The information of chromosome, map, PCR_condition is mandatory for this division. PAT; patented data The data submitted to JPO (Japan Patent Office), EPO (European Patent Office), or USPTO (United States Patent and Trademark Office). See also '3.3. Notice for patented data' in below. UNA; the data not annotated The UNA division is not used for recently submitted sequences. CON; Contig / Constructed To conjugate a series of entries, such as those submitted from a genome project, each of data banks constructs an entry and assign an accession number to a large scale sequence dataset. Such entries are classified into the CON division. The entry in the CON division has the information of joined accession numbers instead of the sequence data. The corresponding entries of the CON entry have been submitted to other divisions. The entries and bases in the CON division are not counted in the released numbers given on the top of the release note. 3.2. TPA separated from primary dataset TPA (Third Party Annotation) data are also available. The TPA data are a complement to the existing DDBJ/EMBL/GenBank comprehensive database of primary nucleotide sequences, which typically result from direct sequencing of cDNAs, ESTs, genomic DNAs etc. Primary entry are defined to be data for which the submitting group has done the sequencing and annotation, and as 'owner' of these data has privileges to submit updates/corrections etc. Primary entries used to build a TPA sequence are those that have been experimentally determined and are publicly available in the DDBJ/EMBL/GenBank databases. They may not be from a proprietary database. The entries and bases in TPA are not counted in the released numbers given on the top of the release note. 3.3. Notice for patented data This release includes PAT division for patented data as described above. The patented data are those which the Japanese Patent Office (JPO), United States Patent and Trademark Office (USPTO), and the European Patent Office (EPO) collected, processed and released. The prefixes of accession numbers for the patented data are shown below; ----------------------- JPO : E, BD, DD USPTO: I, AR EPO : A, AX, CQ, CS ----------------------- Note also that unauthorized use of the patented data may cause legal issues for which DDBJ takes no responsibility. 4. DDBJ staff This release is published by the following DDBJ staff. Gojobori T, Tateno Y, Nishikawa K, Sugawara H, Saitou N, Okubo K, Ikeo K, Suzuki Y, Fukuchi S, Kinjo A, Itoh K, Barrero R, Abe T, Aono H, Atsumi T, Ejima M, Endo N, Fukuda D, Gojobori M, Hikino Y, Hirai T, Hoshi N, Ichikawa K, Ishida K, Ishizaka N, Kato T, Kawamoto T, Kohira J, Koike T, Kosuge T, Kusakabe A, Kuwana Y, Lin Y, Maesako H, Mamiya K, Maruyama N, Mashima J, Mimura K, Min H-J, Miyazawa S, Murakata N, Nagira S, Nagura M, Nishinomiya N, Okido T, Sakai K, Shigemoto Y, Shiozawa H, Sugiyama F, Suzuki M, Suzuki S, Tsuboi M, Tsutsui H, Yamamoto M, and Yokoyama E Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Research Organization of Information and Systems Mishima 411-8540, Japan Phone: +81 55 981 6853 FAX: +81 55 981 6849 E-mail: ddbj@ddbj.nig.ac.jp (for general inquiry) ddbjsub@ddbj.nig.ac.jp (for data submission) ddbjupdt@ddbj.nig.ac.jp (for updates and notification of publication) WWW: http://www.ddbj.nig.ac.jp/ (for DDBJ WWW server) http://sakura.ddbj.nig.ac.jp/ (for DDBJ sequence data submission system) 5. Acknowledgment We are grateful to NCBI and EMBL/EBI for a firm friendship and an excellent collaboration with us. We also thank the Japanese Patent Office for a steady cooperation with us. The operation of DDBJ is supported by the Ministry of Education, Culture, Sports, Science and Technology, and we would gratefully note this here. 6. File Contents 6.1. File categories This release covers 20 categories of organisms and others as follows: ------------------------------------------------------------------------------ ddbjbct*** Category for bacteria ddbjcon*** Category for CON (contig sequences) ddbjest*** Category for EST (expressed sequence tag) ddbjgss*** Category for GSS (genome survey sequence) ddbjhtc*** Category for HTC (high throughput cDNA) ddbjhtg*** Category for HTG (high throughput genomic sequence) ddbjhum*** Category for human ddbjinv*** Category for invertebrates ddbjmam*** Category for mammals other than primates and rodents ddbjpat*** Category for patents ddbjphg*** Category for phages ddbjpln*** Category for plants ddbjpri*** Category for primates other than human ddbjrod*** Category for rodents ddbjsts*** Category for STS (sequence tagged site) ddbjsyn*** Category for synthetic DNAs ddbjtpa*** Category for TPA (Third Party Annotation) ddbjuna*** Category for unannotated sequences ddbjvrl*** Category for viruses ddbjvrt*** Category for vertebrates other than mammals ------------------------------------------------------------------------------ Some of above in the present release were recorded in multiple ddbj***.seq files, each of which had 300 MB storage capacity as follows, respectively. --------------------- ddbjbct : 10 files ddbjest : 287 files ddbjgss : 110 files ddbjhtc : 6 files ddbjhtg : 53 files ddbjhum : 22 files ddbjinv : 6 files ddbjpat : 17 files ddbjpln : 13 files ddbjrod : 14 files ddbjsts : 8 files ddbjvrl : 4 files ddbjvrt : 7 files --------------------- 6.2. File types The index files are not presented in this release except for ddbjacc.idx, ddbjgen.idx, ddbjjou.idx, and ddbjkey.idx. Instead, we have included a program by which to make the index files not presented in this release. For the use of the program, see the files, seq2indexes.doc, seq2indexes.c, and seq2indexes.h in this release. Each category then has the following nine types of files. Note that all the files except for ddbj***.seq are created by the user by use of seq2indexes as mentioned above. ------------------------------------------------------------------------------ ddbj***.seq List of an entry in DDBJ format, see 7.1. ddbj***.acc List of the accession numbers, see 7.2. ddbj***.aut List of the authors, see 7.3. ddbj***.dir List of the short directory in DDBJ style, see 7.4. ddbj***.idx List of indices, see 7.5. ddbj***.jou List of the journals, see 7.6. ddbj***.key List of the key words, see 7.7. ddbj***.org List of the species names, see 7.8. ddbj***.sdr List of the short directory in DDBJ style, see 7.9. ------------------------------------------------------------------------------ 7. Sample of the contents in each file 7.1. Part of the contents in the file 'ddbjbct1.seq' This shows all pieces of information on one entry in DDBJ format. ------------------------------------------------------------------------------ LOCUS D87069 993 bp mRNA linear BCT 14-APR-2000 DEFINITION Escherichia coli mRNA for RNA polymerase sigma subunit, truncated form of sigma-38, complete cds. ACCESSION D87069 VERSION D87069.1 KEYWORDS RNA polymerase sigma subunit, truncated form of sigma-38. SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 993) AUTHORS Jishage,M. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) to the DDBJ/EMBL/GenBank databases. Miki Jishage, National Institute of Genetics, Molecular Genetics; Yata 1111, Mishima, Shizuoka 411, Japan (E-mail:mjishage@lab.nig.ac.jp, Tel:0559-81-6742, Fax:0559-81-6746) REFERENCE 2 (bases 1 to 993) AUTHORS Jishage,M. and Ishihama,A. TITLE Variation in RNA polymerase sigma subunit composition within different stocks of Escherichia coli starin W3110 JOURNAL Unpublished (1996) REFERENCE 3 AUTHORS Ivanova,A., Renshaw,M., Guntaka,R. and Eisenstark,A. TITLE DNA base sequence variability in katF (putative sigma factor) gene Escherichia coli JOURNAL Nucleic Acids Res. 20, 5479-5480 (1992) REFERENCE 4 AUTHORS Takayanagi,Y., Tanaka,K. and Takahashi,H. TITLE Structure of the 5' upstream region and the regulation of the rpoS gene of Escherichia coli JOURNAL Mol. Gen. Genet. 243, 525-531 (1994) COMMENT FEATURES Location/Qualifiers source 1..993 /mol_type="mRNA" /organism="Escherichia coli" /strain="W3110" CDS 1..810 /note="the gene has four single base changes, resulting in two amino acid substitutions and an amber mutation" /product="RNA polymerase sigma subunit, truncated form of sigma-38" /protein_id="BAA13238.1" /transl_table=11 /translation="MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEYEPSDNDLAEEE LLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLV VKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMN QTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNER ITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAK" variation 75 /citation=[3] /replace="t" variation 97 /citation=[3] /replace="t" variation 99 /citation=[3] /replace="t" variation 808 /citation=[3] /replace="t" BASE COUNT 254 a 223 c 291 g 225 t ORIGIN 1 atgagtcaga atacgctgaa agttcatgat ttaaatgaag atgcggaatt tgatgagaac 61 ggagttgagg tttttgacga aaaggcctta gtagaatatg aacccagtga taacgatttg 121 gccgaagagg aactgttatc gcagggagcc acacagcgtg tgttggacgc gactcagctt 181 taccttggtg agattggtta ttcaccactg ttaacggccg aagaagaagt ttattttgcg 241 cgtcgcgcac tgcgtggaga tgtcgcctct cgccgccgga tgatcgagag taacttgcgt 301 ctggtggtaa aaattgcccg ccgttatggc aatcgtggtc tggcgttgct ggaccttatc 361 gaagagggca acctggggct gatccgcgcg gtagagaagt ttgacccgga acgtggtttc 421 cgcttctcaa catacgcaac ctggtggatt cgccagacga ttgaacgggc gattatgaac 481 caaacccgta ctattcgttt gccgattcac atcgtaaagg agctgaacgt ttacctgcga 541 accgcacgtg agttgtccca taagctggac catgaaccaa gtgcggaaga gatcgcagag 601 caactggata agccagttga tgacgtcagc cgtatgcttc gtcttaacga gcgcattacc 661 tcggtagaca ccccgctggg tggtgattcc gaaaaagcgt tgctggacat cctggccgat 721 gaaaaagaga acggtccgga agataccacg caagatgacg atatgaagca gagcatcgtc 781 aaatggctgt tcgagctgaa cgccaaatag cgtgaagtgc tggcacgtcg attcggtttg 841 ctggggtacg aagcggcaac actggaagat gtaggtcgtg aaattggcct cacccgtgaa 901 cgtgttcgcc agattcaggt tgaaggcctg cgccgtttgc gcgaaatcct gcaaacgcag 961 gggctgaata tcgaagcgct gttccgcgag taa // ------------------------------------------------------------------------------ 7.2. Part of the contents in the file 'ddbjbct1.acc' The first column refers to the secondary accession number, second column to the locus name, and third to the primary accession number. The primary number may be the same as the secondary number. They are arranged in the ascending order of the secondary accession numbers. ------------------------------------------------------------------------------ D00681 -> AB028210 AB028210 D10012 -> AB010832 AB010832 D10013 -> AB010832 AB010832 D10048 -> AB008452 AB008452 D13563 -> AB018435 AB018435 D13614 -> AB006206 AB006206 D13762 -> AB063629 AB063629 D14537 -> AB040412 AB040412 D14604 -> AB001637 AB001637 D14607 -> AB027308 AB027308 ------------------------------------------------------------------------------ 7.3. Part of the contents in the file 'ddbjbct1.aut' For each author name given on the left to the arrow, the corresponding locus name and primary accession number are respectively listed on the right. They are arranged in the alphabetical order of the author names. ------------------------------------------------------------------------------ Aarestrup,F.M.Threlfall,E.J. -> AF393510 AF393510 Aarnikunnas,J. -> AY090766 AY090766 Aarnio,T. -> AY792975 AY792975 Aarnio,T. -> AY792976 AY792976 Aarnio,T. -> AY792977 AY792977 Aarnio,T. -> AY792978 AY792978 Aarnio,T. -> AY792979 AY792979 Aarnio,T. -> AY792980 AY792980 Aarnio,T. -> AY792981 AY792981 Aarnio,T. -> AY792982 AY792982 ------------------------------------------------------------------------------ 7.4. Part of the short directory in DDBJ style in the file 'ddbjbct1.dir' For each locus name given in the first column, the corresponding primary accession number, molecular type, number of nucleotide pairs, and description for the locus are respectively listed. They are arranged in the alphabetical order of the locus names. ------------------------------------------------------------------------------ AAC133631 AJ133631 DNA 1482 Alicyclobacillus acidoterrestris 16S rRNA gen e, strain DSM 3922T. AAC133789 AJ133789 DNA 3097 Alicyclobacillus acidocaldarius cyclomaltodex trinase gene region. AAC243194 AJ243194 DNA 1720 Alicyclobacillus acidocaldarius kdpA gene. AAC252160 AJ252160 DNA 1638 Alicyclobacillus acidocaldarius cysA gene for putative ABC-transporter ATP-binding protein. AAC252161 AJ252161 DNA 8690 Alicyclobacillus acidocaldarius maltose/malto dextrine transport gene region (malEFGR genes, cdaA gene and glcA gene). AAC289685 AJ289685 DNA 453 Actinobacillus actinomycetemcomitans partial infB gene for translation initiation factor IF2, strain CCUG13227 T (ATCC33384, NCTC9710). AAC289686 AJ289686 DNA 453 Actinobacillus actinomycetemcomitans partial infB gene for translation initiation factor IF2, strain HK666. AAC289687 AJ289687 DNA 453 Actinobacillus actinomycetemcomitans partial infB gene for translation initiation factor IF2, strain HK1662. AAC289694 AJ289694 DNA 453 Actinobacillus actinomycetemcomitans partial infB gene for translation initiation factor IF2, strain HK1651. ------------------------------------------------------------------------------ 7.5. Part of the contents in the file 'ddbjbct1.idx' The first column refers to the locus name, second column to the starting site of the locus in byte, and third to its ending site in byte. They are arranged in the alphabetical order of the locus names. ------------------------------------------------------------------------------ %***************************** #AAC133631 1125664347 1125667802 #AAC133789 1125786300 1125794416 #AAC243194 1136871421 1136876093 #AAC252160 1146107711 1146112332 #AAC252161 1146112333 1146131350 #AAC289685 1155353186 1155356047 #AAC289686 1155356048 1155358781 #AAC289687 1155358782 1155361517 #AAC289694 1155377972 1155380768 #AAC308623 1167555889 1167560983 ------------------------------------------------------------------------------ 7.6. Part of the contents in the file 'ddbjbct1.jou' This gives information on the journal in which sequence data were published. ------------------------------------------------------------------------------ Yi Chuan Xue Bao 29 (12), 1111-1117 (2002) -> AY601668 AY601668 Yi Chuan Xue Bao 29 (12), 1111-1117 (2002) -> AY603420 AY603420 Yi Chuan Xue Bao 30 (4), 364-369 (2003) -> AF526379 AF526379 Yonsei Med. J. 39 (6), 520-525 (1998) -> AF373217 AF373217 Yonsei Med. J. 39 (6), 520-525 (1998) -> AF373218 AF373218 Z. Lebensm.-Unters. -Forsch., A Eur. Food Res. Technol. 209, 83-87(1999). -> ABA 7623 AJ007623 Z. Lebensm.-Unters. -Forsch., A Eur. Food Res. Technol. 209, 83-87(1999). -> ABA 7624 AJ007624 Zb. Bioteh. Fak. Univ. Ljubl. Kmet. Supl. 79(1), 107-113(2002). -> ASP012466 AJ 012466 Zb. Bioteh. Fak. Univ. Ljubl. Kmet. Supl. 79(1), 19-26(2002). -> ASP012466 AJ01 2466 Zentralbl. Bakteriol. 286 (1), 1-8 (1997) -> AF192489 AF192489 Zentralbl. Bakteriol. 290, 37-49(2000). -> LPN7311 AJ007311 Zentralbl. Bakteriol. 291, 345-352(2001). -> LPN300467 AJ300467 Zentralbl. Bakteriol. 292, 207-214(2002). -> ECO459584 AJ459584 Zhi Wu Bao Hu Xue Hui Hui Kan 44, 233-244 (2002) -> AF540014 AF540014 Zhi Wu Bing Li Xue Bao 34 (1), 80-85 (2004) -> AY447045 AY447045 Zhi Wu Bing Li Xue Hui Kan 12, 57-64 (2003) -> AF450275 AF450275 Zhiwu Baohuxue Hui Huikan 44, 185-208 (2002) -> AY122057 AY122057 Zhiwu Binglixue Huikan 6, 207-208 (1997) -> AF149819 AF149819 Zhongguo Jiaqin 25 (Supplement 1), 60-71 (2003) -> AY615358 AY615358 Zhongguo Kang Sheng Su Za Zhi 21, 94-97 (2003) -> AY293073 AY293073 Zhongguo Kang Sheng Su Za Zhi 28, 96-100 (2003) -> AY293071 AY293071 Zhongguo Kang Sheng Su Za Zhi 28, 96-100 (2003) -> AY293072 AY293072 Zhongguo Lin Chuang Yao Li Xue Za Zhi 19, 190-195 (2003) -> AY536040 AY536040 Zhongguo Nong Ye Ke Xue 36(1), 17-25 (2003) -> AY555768 AY555768 Zhonghua Liu Xing Bing Xue Za Zhi 24, 291-295 (2003) -> AY279003 AY279003 Zhonghua Min Guo Wei Sheng Wu Ji Mian Yi Xue Za Zhi (2003) In press -> AY299484 AY299484 Zhonghua Min Guo Wei Sheng Wu Ji Mian Yi Xue Za Zhi 22 (5), 489-492 (2002) -> AY 382160 AY382160 Zool. Sci. 14, 701-706 (1997) -> AB002286 AB002286 Zool. Sci. 14, 701-706 (1997) -> AB002287 AB002287 Zool. Sci. 14, 701-706 (1997) -> AB002288 AB002288 Zool. Sci. 14, 701-706 (1997) -> AB002289 AB002289 Zool. Sci. 14, 701-706 (1997) -> AB002290 AB002290 Zool. Sci. 14, 701-706 (1997) -> AB002291 AB002291 Zoolog. Sci. 17, 983-989 (2000) -> AB038366 AB038366 Zoolog. Sci. 17, 983-989 (2000) -> AB038367 AB038367 Zoolog. Sci. 17, 983-989 (2000) -> AB038368 AB038368 Zoolog. Sci. 17, 983-989 (2000) -> AB038369 AB038369 Zoolog. Sci. 17, 983-989 (2000) -> AB038370 AB038370 ------------------------------------------------------------------------------ 7.7. Part of the contents in the file 'ddbjbct1.key' For the locus and accession number respectively given on the right to the arrow, the corresponding keywords are listed on the left. ------------------------------------------------------------------------------ Synechococcus sp. DNA for intrinsic membrane protein, malK-like protein, cyanase , complete cds. -> AB000100 AB000100 cynS; cyanase; cynD; malK-like protein; cynB; intrinsic membrane protein. -> AB000100 AB000100 Direct Submission -> AB000100 AB000100 Identification and nitrogen regulation of the cyanase gene from the cyanobacteri a Synechocystis sp. strain PPC 6803 and Synechococcus sp. strain PPC 7942 -> AB000100 AB000100 Sequence updated (31-Mar-1997) by: Tatsuo Omata Sequence updated (14-Aug-1997) -> AB000100 AB000100 Sphingomonas sp. 16S ribosomal RNA. -> AB000106 AB000106 16S rRNA. -> AB000106 AB000106 Direct Submission -> AB000106 AB000106 Sphingomonas sp. VT1 16s rRNA -> AB000106 AB000106 Synechococcus sp. gene for ribosomal proteins, complete cds. -> AB000111 AB000111 tRNA pseudouridine synthase I; 50S ribosomal protein L17; DNA-dircted RNA polyme rase alpha chain; 30S Ribosomal Protein S11; 30S ribosomal protein S13; 50S ribo somal protein L36; adenylate kinase; preprotein translocase SecY subunit; 50S ri bosomal protein L15; 30S ribosomal protein S5; 50S ribosomal protein L18; 50S ri bosomal protein L6; 30S ribosomal protein S8; 50S ribosomal protein L5; 50S ribo somal protein L24; 50S ribosomal protein L14; 30S ribosomal protein S17; 50S rib osomal protein L29; 50S ribosomal protein L16; 30S ribosomal protein S3; 50S rib osomal protein L22; 30S ribosomal protein S19; 50S ribosomal protein L2; 50S rib osomal protein L23; 50S ribosomal protein L4; 50S ribosomal protein L3. -> AB000111 AB000111 Direct Submission -> AB000111 AB000111 ------------------------------------------------------------------------------ 7.8. Part of the contents in the file 'ddbjbct1.org' For the locus and accession number respectively given on the right to the arrow, the corresponding taxonomic names are listed on the left. They are arranged in the alphabetical order of the species names. ------------------------------------------------------------------------------ 'Flavobacterium' lutescens 'Flavobacterium' lutescens Bacteria; Proteobacteria; gamma subdivision; Pseudomonadaceae; Pseudomonas. -> AB035478 AB035478 'Flavobacterium' lutescens 'Flavobacterium' lutescens Bacteria; Proteobacteria; gamma subdivision; Pseudomonadaceae; Pseudomonas. -> AB042983 AB042983 'Fragaria multicipita' phytoplasma 'Fragaria multicipita' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytop lasma. -> AF036354 AF036354 'Fragaria multicipita' phytoplasma 'Fragaria multicipita' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytop lasma. -> AF190224 AF190224 'Fragaria multicipita' phytoplasma 'Fragaria multicipita' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytop lasma. -> AF190225 AF190225 'Helichrysum bracteatum' phytoplasma 'Helichrysum bracteatum' phytoplasma Bacter ia; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Ph ytoplasma. -> AF515771 AF515771 'Momordica charantia' phytoplasma 'Momordica charantia' phytoplasma Bacteria; Fi rmicutes; Mollicutes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytopl asma. -> PHY431368 AJ431368 'Rehmannia glutinosa var. purpurea' phytoplasma 'Rehmannia glutinosa var. purpur ea' phytoplasma Bacteria; Firmicutes; Mollicutes; Acholeplasmatales; Acholeplasm ataceae; Candidatus Phytoplasma. -> AF335107 AF335107 'Rhizomonas' sp. 'Rhizomonas' sp. Bacteria; Proteobacteria; Alphaproteobacteria; Sphingomonadales; Sphingomonadaceae; Sphingomonas. -> UBA132327 AJ132327 'Vinca minor' phytoplasma 'Vinca minor' phytoplasma Bacteria; Firmicutes; Mollic utes; Acholeplasmatales; Acholeplasmataceae; Candidatus Phytoplasma. -> AY144 608 AY144608 ------------------------------------------------------------------------------ 7.9. Part of the short directory file in DDBJ style in the file 'ddbjbct1.sdr' The short directory file contains brief descriptions of all of the sequence entries contained in the DDBJ style. ------------------------------------------------------------------------------ AC133631 Alicyclobacillus acidoterrestris 16S rRNA gene, strain DSM 1482bp AAC133789 Alicyclobacillus acidocaldarius cyclomaltodextrinase gene 3097bp AAC243194 Alicyclobacillus acidocaldarius kdpA gene. 1720bp AAC252160 Alicyclobacillus acidocaldarius cysA gene for putative 1638bp AAC252161 Alicyclobacillus acidocaldarius maltose/maltodextrine 8690bp AAC289685 Actinobacillus actinomycetemcomitans partial infB gene for 453bp AAC289686 Actinobacillus actinomycetemcomitans partial infB gene for 453bp AAC289687 Actinobacillus actinomycetemcomitans partial infB gene for 453bp AAC289694 Actinobacillus actinomycetemcomitans partial infB gene for 453bp AAC308623 Alicyclobacillus acidocaldarius celA gene for cellulase. 1778bp AAC417690 Actinobacillus actinomycetemcomitans mukB gene. 4491bp AAC419840 Acetobacter aceti 16S rRNA gene, strain LMG 1531. 1440bp AAC430786 Actinobacillus actinomycetemcomitans partial fur gene for 1246bp AAC493667 Alicyclobacillus acidocaldarius subsp. rittmannii 16S rRNA 1472bp AAC496806 Alicyclobacillus acidocaldarius 16S rRNA gene, strain DSM 1507bp ------------------------------------------------------------------------------ In addition to the 9 tables the four following index files are included in this release. These files were prepared for BCT, EST, GSS, HTC, HTG, HUM, INV, MAM, PAT, PHG, PLN, PRI, ROD, STS, SYN, UNA, VRL, VRT divisions. Accession number index file Keyword phrase index file Journal citation index file Gene name index file A brief description is given for each file in the following. 7.10. Part of the accession number index file in the 'ddbjacc.idx' The following excerpt from the accession number index file illustrates the format of the index. ------------------------------------------------------------------------------ D00001 ECPBPA BCT X04516 D00002 ECPYRC BCT X04469 D00003 HUMP450M HUM D00003 D00004 FLBFLBL40 VRL D00004 D00005 IBAMEM682 VRL D00005 D00006 BACPNS1981 BCT D00006 D00007 CHKCALGRP VRT D00007 D00008 ECPNTAB BCT X04195 D00009 DROPER1 INV D00009 ------------------------------------------------------------------------------ 7.11. Part of the keyword phrase index file in the 'ddbjkey.idx' Keyword phrases consist of names for gene products and other characteristics of sequence entries. ------------------------------------------------------------------------------ "COAT PROTEIN SMO511347 VRL AJ511347 'TNPA GENE UBA564903 BCT AJ564903 'ZINC-FINGER' MOTIF PRNS53 VRL X60546 (+) MATING TYPE SURFACE PROTEIN ABGPSSP PLN M94861 (1,3 TABETGLUB PLN Z22874 (1,3)-BETA-D-GLUCAN BINDING PROTEIN AJ606470 INV AJ606470 (1,3)BETA-GLUCAN SYNTHASE NCU09275 PLN U09275 (1,4)-BETA-D-ARABINOXYLAN ARABINOFURANOHYDROLASE ANAXHA PLN Z78011 ANTUAXHA PLN Z78010 (1,6)-BETA-GLUCAN BIOSYNTHESIS YSAKRE1A PLN M81588 (1-3)-BETA-GLUCANASE NTSP41AGN PLN X81560 PA13BGPT PLN X57794 (1-3,1-4)-BETA-D-GLUCANASE HVBDG PLN X52572 (1-4)-BETA-MANNAN ENDOHYDROLASE CAR278996 PLN AJ278996 CAR293305 PLN AJ293305 (2',5'-OLIGOISOADENYLATE SYNTHETASE-DEPENDENT) AL138776 HUM AL138776 (2'-5') OLIGO(A) SYNTHASE E16 SSO4G06 EST F14610 (2'-5')OLIGOADENYLATE SYNTHETASE HSA225089 HUM AJ225089 HUMSYN25A HUM D00068 SSA225090 MAM AJ225090 (6')-IB' AMINOGLYCOSIDE ACETYLTRANSFERASE AXY278514 BCT AJ278514 PAE291609 BCT AJ291609 (8,11)-LINOLEOYL DESATURASE COF245938 PLN AJ245938 ------------------------------------------------------------------------------ 7.12. Part of the journal citation index file in 'ddbjjou.idx' The journal citation index file lists all of the citations that appear in the references. ------------------------------------------------------------------------------ (ER) AAPS PHARMSCI. 4 (3), DOI 10.1208/PS040315 (2002) AY170916 ROD AY170916 (ER) AM. J. HUM. GENET. 76 (1) (2004) IN PRESS AY753209S1 HUM AY753209 AY753209S2 HUM AY753210 (ER) ARCH. VIROL. (2004) IN PRESS AF531505 VRL AF531505 AY518899 VRL AY518899 AY518900 VRL AY518900 AY518901 VRL AY518901 AY518902 VRL AY518902 AY518903 VRL AY518903 AY518904 VRL AY518904 AY518905 VRL AY518905 AY518906 VRL AY518906 AY518907 VRL AY518907 AY518908 VRL AY518908 AY518909 VRL AY518909 AY518910 VRL AY518910 AY518911 VRL AY518911 AY518912 VRL AY518912 AY518913 VRL AY518913 AY518914 VRL AY518914 AY518915 VRL AY518915 AY518916 VRL AY518916 AY518917 VRL AY518917 AY518918 VRL AY518918 AY518919 VRL AY518919 AY518920 VRL AY518920 AY518921 VRL AY518921 AY518922 VRL AY518922 AY518923 VRL AY518923 AY518924 VRL AY518924 AY518925 VRL AY518925 AY518926 VRL AY518926 AY518927 VRL AY518927 AY518928 VRL AY518928 AY518929 VRL AY518929 AY518930 VRL AY518930 AY518931 VRL AY518931 AY518932 VRL AY518932 AY521234 VRL AY521234 AY521235 VRL AY521235 AY521236 VRL AY521236 AY521237 VRL AY521237 AY521238 VRL AY521238 (ER) ARTERIOSCLER. THROMB. VASC. BIOL. (2004) IN PRESS AY563557 HUM AY563557 (ER) BIOCHEM. BIOPHYS. RES. COMMUN. 325 (1), 203-214 (2004) AY563137 HUM AY563137 (ER) BIOCHEM. J./10.1042/BJ20030293 HSA496460 HUM AJ496460 ------------------------------------------------------------------------------ 7.13. Part of the gene name index file in 'ddbjgen.idx' This file lists all the gene names that appear in the feature table. ------------------------------------------------------------------------------ 'ARR BX927156 BCT BX927156 'BGLG BX927156 BCT BX927156 'BGLS BX927148 BCT BX927148 'BGLY' BX927156 BCT BX927156 'BRNQ AF305888 BCT AF305888 'COMK AL591983 BCT AL591983 AL596172 BCT AL596172 'CRCB BX927155 BCT BX927155 'CRTI BX927155 BCT BX927155 'DPPE LDDIPEP BCT Z34898 'FIC BX936398 BCT BX936398 ------------------------------------------------------------------------------ 8. Release history Release Date Entries Bases Comments 61 03/05 43,118,204 47,099,081,750 60 12/04 40,583,945 44,416,752,273 /db_xref="H-inv:**" started 59 09/04 37,926,117 42,245,956,937 58 06/04 34,917,581 39,812,635,108 57 03/04 32,693,678 38,008,449,840 56 12/03 30,405,173 36,079,046,032 55 09/03 27,753,140 34,280,225,489 54 06/03 25,149,821 32,162,041,177 53 02/03 23,250,813 29,711,299,332 52 12/02 20,354,812 26,931,456,316 51 09/02 18,401,358 22,782,404,136 TPA started 50 06/02 17,260,693 20,158,357,982 49 04/02 16,503,157 18,579,627,226 48 01/02 15,016,100 16,197,713,855 47 10/01 13,266,610 14,145,671,645 46 07/01 12,313,759 13,037,646,166 45 04/01 11,434,113 12,207,092,905 HTC division started 44 01/01 10,165,597 11,136,298,841 43 10/00 8,666,551 10,034,532,698 42 07/00 7,554,995 8,880,721,093 41 04/00 5,962,608 6,409,581,885 CON division started 40 01/00 5,388,125 4,762,696,173 RNA division terminated 39 10/99 4,810,773 3,728,000,562 NID and PID discarded 38 07/99 4,294,369 3,098,519,597 37 03/99 3,311,627 2,375,261,951 VERSION, /protein_id started 36 01/99 3,073,166 2,190,425,560 35 10/98 2,759,261 1,957,341,169 34 07/98 2,412,785 1,708,580,623 33 04/98 2,174,769 1,479,303,279 32 01/98 1,956,669 1,300,950,613 31 10/97 1,731,532 1,139,869,464 Adoption of the unified taxonomy database 30 07/97 1,534,115 992,788,339 NID and PID terminated 29 04/97 1,270,194 841,415,232 28 01/97 1,154,120 756,785,219 HTG division started ORG division terminated 27 10/96 936,697 608,103,057 GSS division started 26 07/96 835,552 551,932,448 25 04/96 744,490 499,300,364 /translation started 24 01/96 637,508 431,771,652 23 10/95 569,757 390,694,350 22 07/95 437,588 322,982,425 HUM division started 21 04/95 274,596 250,875,023 20 01/95 239,689 231,299,557 19 10/94 204,332 205,274,131 18 07/94 185,230 192,473,021 17 04/94 169,957 179,942,209 16 01/94 154,626 165,017,628 15 10/93 131,649 147,224,690 14 07/93 120,350 138,686,333 13 04/93 112,067 129,784,445 12 01/93 97,683 120,815,244 EST division started 11 07/92 65,693 84,839,075 10 01/92 59,317 77,805,556 GenBank/EMBL inclusion started 9 07/91 1,130 2,002,124 8 01/91 879 1,573,442 7 07/90 681 1,154,211 6 01/90 496 841,236 5 07/89 395 679,378 4 01/89 302 535,985 3 07/88 230 345,850 2 01/88 142 199,392 1 07/87 66 108,970 Started with DDBJ only ------------------ Since release 60 ------------------ The cross-reference to the H-invitational has been included. ------------------ Since release 56 ------------------ The three data banks have agreed that the maximum length limitation (350 kb) of a submitted sequence be relaxed. The BASE COUNT line of the DDBJ flat file format has been changed, corresponding to the relaxation of the maximum sequence length restriction in the entry that had been practiced at DDBJ/EMBL/GenBank International Nucleotide Sequence Databases. In the BASE COUNT line of the DDBJ flat file, 6 digits had been allocated for each number of a, c, g, t and other bases in the sequence. Hereafter, in the new flat file format, 9 digits are allocated for each number of a, c, g and t, while the numbers of other bases are removed. In accordance with the relaxation of sequence length limitation, GenBank had already dropped the BASE COUNT line from their flat file format from GenBank Release 138 (Oct. 2003). We DDBJ have decided to maintain the BASE COUNT line in our flat file format from the view that GC contents are still important information to characterize the sequence. The changes in the BASE COUNT line are shown below. ---------------------------------------------------------------------------- Old (-rel. 55): 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 |----|----|----|----|----|----|----|----|----|----|----|----|----|----| BASE COUNT 123456 a 123456 c 123456 g 123456 t 123456 others New (rel. 56-): 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 |----|----|----|----|----|----|----|----|----|----|----|----|----|----| BASE COUNT 123456789 a 123456789 c 123456789 g 123456789 t ---------------------------------------------------------------------------- The SOURCE in the flat file is revisited and revised if necessary in accordance with the unified taxonomy database common to the three data banks. ------------------ Since release 54 ------------------ '/sequenced_mol' qualifier has been changed to '/mol_type' qualifier. We accordingly completed retrofitting the pertinent entries. This change was made on the agreement at the INSD international collaborative meeting in 2002. ------------------ Since release 51 ------------------ The TPA (Third Party Annotation) dataset has been available. The dataset is a complement to the existing DDBJ/EMBL/GenBank database of the primary nucleotide sequences which were obtained from direct sequencing of cDNAs, ESTs, genomic DNAs etc. The format of LOCUS line in the flat file has been changed as shown below to adjust to the GenBank format. ------------------------------------------------------------------------------ Old (-rel. 50): LOCUS AB000001 660 bp DNA PLN 01-FEB-2001 New (rel. 51-): LOCUS AB000001 660 bp DNA linear PLN 01-FEB-2001 ------------------------------------------------------------------------------ ------------------ Since release 45 ------------------ The HTC (High Throughput cDNA) division has been included. This is to include unfinished high throughput cDNA sequences, each of which has 5'UTR and 3'UTR at both ends and part of a coding region. The sequence may also include introns. When the sequence becomes finished later, it moves to the corresponding taxonomic division. The sequence is accompanied with a keyword, HTC (High Throughput cDNA), which is dropped when the sequence is finished and moved to a taxonomic division. ------------------ Since release 41 ------------------ The CON division has been included. This division is to show the order of related sequences in a genome, and expressed by join and the accession numbers of the sequences. The contents of the CON division are compiled by the three data banks not by the data submitter. ------------------ Since release 40 ------------------ The RNA division was terminated . The RNA data have been redistributed according to the category of the organism. Therefore, you will find a human RNA sequence, for example, in the HUM division. ------------------ Since release 37 ------------------ The three data banks include the item VERSION in the flat file, which indicates a version of a submitted nucleotide sequence. It is expressed like AB123456.1, in which the digit(s) after the period is a version number. The reason for adding VERSION is that since a released sequence sometimes revised by the submitter, the accession number alone cannot specify the sequence in question causing the user a trouble. The number is increased by one every time when a revised sequence is made public. Accordingly, the translated protein sequence will be accompanied with a /protein_id which is expressed as BAA12345.1, in which the digit(s) after the period is again a version number. The number is increased by one when the corresponding nucleotide sequence is revised and the protein sequence is changed as a result, and when the revised protein sequence is made public. ------------------ Since release 31 ------------------ We have started adopting the unified taxonomy database to unify the biological source of the sequence. The database is made up with scientific names, ID of unidentified organisms, and synthetic constructs etc. ------------------ Since release 30 ------------------ NID and PID were terminated. This change was made on the agreement at the INSD collaborative meeting in 1999. ------------------ Since release 28 ------------------ The HTG (High Throughput Genomic sequence) has been included. This division was created to cope with genome project teams which deal with a clone as a sequencing unit. We terminated the ORG (Organelle) division. Thus, if you are interested in human mitochondrial sequences, for example, you are now advised to refer to the HUM division. ------------------ Since release 27 ------------------ The GSS division has been included. GSS stands for Genome Survey Sequence, which is similar to EST, except that GSS is genomic DNA whereas EST is cDNA. ------------------ Since release 25 ------------------ DDBJ release contains amino acid sequences that were translated from the corresponding nucleotide sequences of the INSD database. In the translation we paid much attention to the fact that some species or organella have a codon different from the universal one, and used the proper codon table. ------------------ Since release 22 ------------------ The HUM division has been included. Human genome projects have probably been most productive and yielded a large number of sequences Thus, we have the human (HUM) division solely for human sequences and the primate (PRI) division for non-human primate sequences. ------------------ Since release 12 ------------------ The EST (Expressed Sequence Tag) division has been included. The number of ESTs has been increasing at an enormous rate and is expected to be growing even more rapidly in the future. Thus, we created a division for ESTs ------------------ Since release 10 ------------------ The sequences submitted to GenBank or EMBL have been included in the DDBJ database. 9. File list. The files in this release are arranged in the following order with non-labeled format. file name number of entries number of bases file size ----------------------------------------------------------------------- ddbjrel.txt (DDBJ release note) 93367 ddbjacc.idx (Accession number index file) 1681205056 ddbjgen.idx (Gene name index file) 72369168 ddbjjou.idx (Journal citation index file) 1814941188 ddbjkey.idx (Keyword phrase index file) 1590628199 ---------------------------------------------------------------------- file name number of entries number of bases file size ----------------------------------------------------------------------- ddbjbct1.seq 33616 119506154 299020893 ddbjbct2.seq 6726 130595158 299570218 ddbjbct3.seq 6802 133473442 299011770 ddbjbct4.seq 78311 102427239 299000157 ddbjbct5.seq 32082 116878127 299001575 ddbjbct6.seq 90915 94252253 299001156 ddbjbct7.seq 20655 132201169 299396165 ddbjbct8.seq 448 129875576 301226085 ddbjbct9.seq 40968 120156017 299000591 ddbjbct10.seq 154 1592466 3759500 ddbjest1.seq 89192 33694618 299000134 ddbjest2.seq 94413 37824279 299002848 ddbjest3.seq 94451 36816451 299001930 ddbjest4.seq 89497 28248317 299001995 ddbjest5.seq 93388 35876886 299002188 ddbjest6.seq 96879 38445676 299002982 ddbjest7.seq 97789 37885672 299001650 ddbjest8.seq 97422 37462135 299001767 ddbjest9.seq 97130 38384450 299000578 ddbjest10.seq 98821 38849876 299003625 ddbjest11.seq 96673 38377758 299000109 ddbjest12.seq 96084 42824568 299000533 ddbjest13.seq 102952 42220586 299001681 ddbjest14.seq 101533 40615898 299002542 ddbjest15.seq 97361 40353432 299000860 ddbjest16.seq 95193 41905311 299002371 ddbjest17.seq 95363 39209026 299000942 ddbjest18.seq 97121 41494958 299002681 ddbjest19.seq 96246 43021712 299001145 ddbjest20.seq 93690 37924527 299001110 ddbjest21.seq 100539 44863983 298999957 ddbjest22.seq 131105 60650471 299001683 ddbjest23.seq 122322 64146812 299000327 ddbjest24.seq 94288 55734107 299001188 ddbjest25.seq 102842 75930568 299002392 ddbjest26.seq 119344 60977432 299000915 ddbjest27.seq 122866 62421255 299001704 ddbjest28.seq 122142 61145640 299001579 ddbjest29.seq 126650 57867973 299000093 ddbjest30.seq 95610 30103144 299002586 ddbjest31.seq 92529 24749244 299000103 ddbjest32.seq 78995 23517099 299002575 ddbjest33.seq 60731 16893997 299000518 ddbjest34.seq 60557 16016463 299003389 ddbjest35.seq 110900 47402414 299000315 ddbjest36.seq 116479 54804309 299001542 ddbjest37.seq 103391 52796887 299000938 ddbjest38.seq 117257 58246846 299000572 ddbjest39.seq 118897 59387413 299002462 ddbjest40.seq 92923 39581835 299000057 ddbjest41.seq 89962 39932166 299000901 ddbjest42.seq 90576 38646954 299001212 ddbjest43.seq 104782 42328629 299000547 ddbjest44.seq 91139 35852449 299001122 ddbjest45.seq 84664 37863339 299000722 ddbjest46.seq 96805 44239075 299002488 ddbjest47.seq 95646 41662165 299000023 ddbjest48.seq 95226 32879959 299002306 ddbjest49.seq 107474 48662108 299000643 ddbjest50.seq 60852 17031275 299001201 ddbjest51.seq 60252 17942680 299000161 ddbjest52.seq 60532 18527167 299002678 ddbjest53.seq 60482 19000306 299002695 ddbjest54.seq 60514 18516012 299002877 ddbjest55.seq 60380 18343719 299004073 ddbjest56.seq 61364 18372883 299004027 ddbjest57.seq 61725 19271786 299000556 ddbjest58.seq 61658 19516219 299003199 ddbjest59.seq 62442 18196106 299000761 ddbjest60.seq 57559 36301798 299002146 ddbjest61.seq 54872 22613216 299002936 ddbjest62.seq 54276 24665741 299003785 ddbjest63.seq 54460 22724108 299005189 ddbjest64.seq 63641 26012491 299002271 ddbjest65.seq 94853 39895072 299002554 ddbjest66.seq 95008 39388083 299001932 ddbjest67.seq 98520 55730912 299001849 ddbjest68.seq 98038 53476273 299000467 ddbjest69.seq 97167 45526330 299002582 ddbjest70.seq 92638 53056207 299002633 ddbjest71.seq 95430 43454083 299001135 ddbjest72.seq 93137 55803919 299002080 ddbjest73.seq 94855 50761755 299002756 ddbjest74.seq 87651 49038090 299002524 ddbjest75.seq 91576 45462310 299001084 ddbjest76.seq 93089 55815749 299003246 ddbjest77.seq 88032 54944902 299004104 ddbjest78.seq 96372 53579452 299002887 ddbjest79.seq 90508 38469990 299001529 ddbjest80.seq 84673 45768467 299002635 ddbjest81.seq 83157 45652386 299002208 ddbjest82.seq 92548 57012123 299001731 ddbjest83.seq 96907 42042936 299001562 ddbjest84.seq 95549 34421777 299001815 ddbjest85.seq 94671 42540306 299000380 ddbjest86.seq 84756 43846373 299000254 ddbjest87.seq 101768 61706893 299000715 ddbjest88.seq 97427 59155287 299000043 ddbjest89.seq 87184 52170703 299002703 ddbjest90.seq 92591 62542718 299000056 ddbjest91.seq 88886 55500827 299000967 ddbjest92.seq 95214 53994376 299001910 ddbjest93.seq 93929 62943558 299002086 ddbjest94.seq 93336 62824057 299001330 ddbjest95.seq 99264 52760210 299002459 ddbjest96.seq 97021 37372910 299000916 ddbjest97.seq 103421 60628870 299001603 ddbjest98.seq 95766 56382488 299001496 ddbjest99.seq 83335 41668338 299001534 ddbjest100.seq 88840 48686780 299001092 ddbjest101.seq 84693 45301690 299003013 ddbjest102.seq 91186 56213870 299001923 ddbjest103.seq 91618 57447151 299001075 ddbjest104.seq 86247 49683877 299003214 ddbjest105.seq 94296 54798449 299001806 ddbjest106.seq 80602 49175634 298999929 ddbjest107.seq 104570 56188829 299000603 ddbjest108.seq 108511 61892921 299000109 ddbjest109.seq 105078 54509775 299002162 ddbjest110.seq 130395 69571286 299001809 ddbjest111.seq 121131 67144660 299002103 ddbjest112.seq 95631 56856014 299002629 ddbjest113.seq 113294 66827135 299002227 ddbjest114.seq 109901 65948516 299001262 ddbjest115.seq 87469 46393302 299001088 ddbjest116.seq 80667 36168647 299000249 ddbjest117.seq 71950 34665203 299002493 ddbjest118.seq 80636 40701427 299003286 ddbjest119.seq 88214 46117003 299001602 ddbjest120.seq 87858 57873623 299001071 ddbjest121.seq 95551 66376318 299002747 ddbjest122.seq 82780 43875678 299000202 ddbjest123.seq 81167 41503819 299000411 ddbjest124.seq 85031 47574191 299000477 ddbjest125.seq 82513 57296379 299001774 ddbjest126.seq 104190 53257557 298999946 ddbjest127.seq 104037 55563241 299001421 ddbjest128.seq 119953 68737655 299001667 ddbjest129.seq 118479 67006677 299000403 ddbjest130.seq 96724 46990764 299000251 ddbjest131.seq 92931 41974794 299002255 ddbjest132.seq 112404 55283415 299000010 ddbjest133.seq 105013 43376587 299000073 ddbjest134.seq 96017 51373440 299002258 ddbjest135.seq 87403 53152804 299001664 ddbjest136.seq 77340 48915013 299002062 ddbjest137.seq 90701 46245989 299002716 ddbjest138.seq 97357 42418520 299000977 ddbjest139.seq 94614 54984152 299001187 ddbjest140.seq 87707 44430998 299002108 ddbjest141.seq 87750 59448312 299002388 ddbjest142.seq 87744 55709043 299002120 ddbjest143.seq 88059 45275224 299002416 ddbjest144.seq 89202 74221342 299002082 ddbjest145.seq 89470 44642332 299002249 ddbjest146.seq 89857 57804002 299000216 ddbjest147.seq 88470 71938460 299001254 ddbjest148.seq 82617 60655164 299001027 ddbjest149.seq 82351 60648226 299000088 ddbjest150.seq 83742 59143255 299002953 ddbjest151.seq 82082 61293586 299001087 ddbjest152.seq 81904 48895531 299003274 ddbjest153.seq 82955 46625021 299002094 ddbjest154.seq 94110 51151653 299000299 ddbjest155.seq 108121 69591080 299002179 ddbjest156.seq 98045 59942479 299000110 ddbjest157.seq 133474 82610053 299000969 ddbjest158.seq 136870 80796044 299000561 ddbjest159.seq 130079 82316125 299001565 ddbjest160.seq 139797 73955535 299001464 ddbjest161.seq 91460 49543955 299000468 ddbjest162.seq 84967 61790952 299002301 ddbjest163.seq 93433 79146254 299002087 ddbjest164.seq 100710 51692207 299000075 ddbjest165.seq 105052 65818608 299002687 ddbjest166.seq 88415 61965728 299000105 ddbjest167.seq 69050 33710478 299004382 ddbjest168.seq 57419 21332144 299004440 ddbjest169.seq 58498 20262281 299003981 ddbjest170.seq 56388 20694873 299003758 ddbjest171.seq 56362 23091384 299003700 ddbjest172.seq 56653 22068464 299003115 ddbjest173.seq 58402 20149601 299001712 ddbjest174.seq 58855 23587591 299001805 ddbjest175.seq 55715 24508184 299000821 ddbjest176.seq 55898 22513723 299003479 ddbjest177.seq 56043 24011530 299004829 ddbjest178.seq 56507 22336401 299003487 ddbjest179.seq 55326 27649140 299002946 ddbjest180.seq 54871 34428404 299003850 ddbjest181.seq 121395 44508860 299000773 ddbjest182.seq 92281 52819601 299000539 ddbjest183.seq 87902 56810831 299000358 ddbjest184.seq 88066 56596187 299002886 ddbjest185.seq 86855 51639309 299000158 ddbjest186.seq 90840 52874531 299003207 ddbjest187.seq 88443 60561678 299001630 ddbjest188.seq 85346 42231367 299000241 ddbjest189.seq 128547 59530658 298999949 ddbjest190.seq 93154 53027667 299000028 ddbjest191.seq 84894 41088890 299001151 ddbjest192.seq 88481 51038865 299000483 ddbjest193.seq 92405 43524483 299001368 ddbjest194.seq 89059 52630010 299002923 ddbjest195.seq 87275 47619585 299003382 ddbjest196.seq 86917 48230679 299000479 ddbjest197.seq 107772 55892224 299001471 ddbjest198.seq 94954 60929930 299001695 ddbjest199.seq 100149 69608213 299001432 ddbjest200.seq 137536 60943675 299002214 ddbjest201.seq 101265 56806303 299002277 ddbjest202.seq 90480 55614325 299001523 ddbjest203.seq 95111 46076280 299002747 ddbjest204.seq 95722 41060142 299002042 ddbjest205.seq 83855 50039442 299003080 ddbjest206.seq 81481 52303866 299001514 ddbjest207.seq 88239 49622047 299000951 ddbjest208.seq 78141 51863400 299003843 ddbjest209.seq 101548 53065022 299000064 ddbjest210.seq 103526 54952794 299002515 ddbjest211.seq 114279 67811052 299000264 ddbjest212.seq 147484 63380895 299000128 ddbjest213.seq 105911 50351369 299001849 ddbjest214.seq 87225 50323750 299000821 ddbjest215.seq 97722 55105138 299002276 ddbjest216.seq 93762 52637353 299002452 ddbjest217.seq 90684 52600345 299003352 ddbjest218.seq 78746 47208205 299001602 ddbjest219.seq 67926 36622916 299001701 ddbjest220.seq 93652 64891733 299000155 ddbjest221.seq 92743 54066693 299000867 ddbjest222.seq 87220 50608598 299003551 ddbjest223.seq 92156 54776233 299002953 ddbjest224.seq 100820 55961134 298999970 ddbjest225.seq 92344 65705773 299000373 ddbjest226.seq 80150 61975541 299002990 ddbjest227.seq 81161 51598559 299003988 ddbjest228.seq 85588 53343426 299002454 ddbjest229.seq 98460 58184762 299000602 ddbjest230.seq 91070 47573908 299000857 ddbjest231.seq 82286 44599524 299000736 ddbjest232.seq 79775 44377679 299003331 ddbjest233.seq 95010 56811538 299000982 ddbjest234.seq 96120 50832996 299000971 ddbjest235.seq 89677 55322069 299002524 ddbjest236.seq 86422 52850949 299000168 ddbjest237.seq 99358 59659804 299000830 ddbjest238.seq 109950 63706509 299001895 ddbjest239.seq 95203 51537741 299001974 ddbjest240.seq 79555 46070564 299002488 ddbjest241.seq 88869 47571102 299000401 ddbjest242.seq 62089 32439651 299002624 ddbjest243.seq 90143 53807300 299000832 ddbjest244.seq 124021 57780360 299001024 ddbjest245.seq 89104 59709470 299000013 ddbjest246.seq 100057 64741356 299003130 ddbjest247.seq 79018 45457988 299001691 ddbjest248.seq 109276 48718683 299000533 ddbjest249.seq 78023 49668571 299003337 ddbjest250.seq 83272 47821580 299000364 ddbjest251.seq 110726 67401381 299000339 ddbjest252.seq 103042 54105026 299000644 ddbjest253.seq 99222 55167757 299002652 ddbjest254.seq 63953 38747836 299007253 ddbjest255.seq 66123 37337053 299002464 ddbjest256.seq 103033 57457379 299000000 ddbjest257.seq 100114 58419294 299003171 ddbjest258.seq 91518 51364418 299001064 ddbjest259.seq 75312 49011569 299002237 ddbjest260.seq 85257 48104058 299001147 ddbjest261.seq 90113 40564036 298999999 ddbjest262.seq 100411 37869331 299001260 ddbjest263.seq 87976 51611511 299001904 ddbjest264.seq 86218 48232727 299002083 ddbjest265.seq 89575 58732248 299001219 ddbjest266.seq 85591 50455289 299001681 ddbjest267.seq 86715 49898942 299000495 ddbjest268.seq 91136 56705264 299000821 ddbjest269.seq 105781 62366637 299001942 ddbjest270.seq 82292 55416129 299002081 ddbjest271.seq 73747 53704328 299000432 ddbjest272.seq 74802 52950645 299001414 ddbjest273.seq 96020 58869131 299003113 ddbjest274.seq 68601 41600924 299002415 ddbjest275.seq 75929 48554177 299000654 ddbjest276.seq 72880 53035326 299003097 ddbjest277.seq 75985 54431272 299003045 ddbjest278.seq 62565 45238807 299001006 ddbjest279.seq 96525 49329501 299000651 ddbjest280.seq 71694 55058220 299001385 ddbjest281.seq 99426 45868736 299000060 ddbjest282.seq 89362 32421768 299001923 ddbjest283.seq 95123 34969782 299000152 ddbjest284.seq 92112 33397120 299002219 ddbjest285.seq 101315 34652117 299001136 ddbjest286.seq 88220 37113941 299002213 ddbjest287.seq 28730 9353138 79349981 ddbjgss1.seq 104256 75961613 299002146 ddbjgss2.seq 101457 70689957 299000264 ddbjgss3.seq 106256 59038630 299002822 ddbjgss4.seq 79842 68408063 299003043 ddbjgss5.seq 80477 67045155 299000325 ddbjgss6.seq 81263 65434438 299002882 ddbjgss7.seq 83847 67153687 299002287 ddbjgss8.seq 97846 75613693 299000050 ddbjgss9.seq 103882 54263509 299001741 ddbjgss10.seq 97970 75599400 299001890 ddbjgss11.seq 82076 68608208 298999953 ddbjgss12.seq 80367 72151634 299000021 ddbjgss13.seq 75029 64952145 299001626 ddbjgss14.seq 107201 45086244 299001565 ddbjgss15.seq 112420 46927281 299001362 ddbjgss16.seq 113870 50952147 299000713 ddbjgss17.seq 107256 52960086 299001057 ddbjgss18.seq 99829 52597484 299000479 ddbjgss19.seq 100112 49887663 299000838 ddbjgss20.seq 98081 50026929 299002594 ddbjgss21.seq 94264 47605437 299000734 ddbjgss22.seq 95558 53971292 299002783 ddbjgss23.seq 90156 46764660 299001960 ddbjgss24.seq 95754 50076806 299003180 ddbjgss25.seq 92267 61101479 299002888 ddbjgss26.seq 89448 41185321 299000762 ddbjgss27.seq 98890 57377779 299002629 ddbjgss28.seq 87721 40225281 299000298 ddbjgss29.seq 72960 37467357 299002832 ddbjgss30.seq 73920 34868353 299001433 ddbjgss31.seq 79456 43190178 299000961 ddbjgss32.seq 79671 39090761 299003758 ddbjgss33.seq 78702 48264101 299000271 ddbjgss34.seq 88106 37527799 298999953 ddbjgss35.seq 74273 33911944 299003347 ddbjgss36.seq 90238 43023081 299000226 ddbjgss37.seq 82373 44675231 299003208 ddbjgss38.seq 94824 53068289 299000618 ddbjgss39.seq 92354 53943244 299002226 ddbjgss40.seq 101065 53432847 299000464 ddbjgss41.seq 97620 52331575 299000560 ddbjgss42.seq 119451 79383194 299002357 ddbjgss43.seq 114403 64421043 299002212 ddbjgss44.seq 113710 66622417 299002451 ddbjgss45.seq 110886 45779605 299000653 ddbjgss46.seq 104341 60186484 299000689 ddbjgss47.seq 123429 80505554 299001705 ddbjgss48.seq 108023 49057828 299000783 ddbjgss49.seq 104180 73955697 299000989 ddbjgss50.seq 92346 60893251 299000918 ddbjgss51.seq 92386 60765956 299000985 ddbjgss52.seq 102247 50340329 299001119 ddbjgss53.seq 110295 76097767 299000112 ddbjgss54.seq 109492 81400028 299000879 ddbjgss55.seq 114359 69761575 299001804 ddbjgss56.seq 113450 76653519 299001822 ddbjgss57.seq 93910 49953710 299002389 ddbjgss58.seq 104633 64078803 299001322 ddbjgss59.seq 114170 51573249 299002391 ddbjgss60.seq 111299 68758598 299002295 ddbjgss61.seq 96490 88399140 299000579 ddbjgss62.seq 98542 95613979 299000696 ddbjgss63.seq 116308 81504344 299000282 ddbjgss64.seq 82264 57028324 299000400 ddbjgss65.seq 96229 68223969 299002521 ddbjgss66.seq 113703 83515063 299001721 ddbjgss67.seq 108531 65719805 299001999 ddbjgss68.seq 113859 74719057 299001735 ddbjgss69.seq 126318 70624263 299002213 ddbjgss70.seq 127251 69454722 299002304 ddbjgss71.seq 128416 67930080 299001829 ddbjgss72.seq 129967 65898207 299001865 ddbjgss73.seq 129479 66536921 299001309 ddbjgss74.seq 129917 65963197 299001958 ddbjgss75.seq 124707 72423481 299001445 ddbjgss76.seq 114139 89220490 299001533 ddbjgss77.seq 112879 88996025 298999942 ddbjgss78.seq 111389 88335715 299001175 ddbjgss79.seq 113109 77396802 299000142 ddbjgss80.seq 120397 34914906 299001060 ddbjgss81.seq 117468 45504669 299000944 ddbjgss82.seq 102187 61698709 299002502 ddbjgss83.seq 111808 72528790 299000541 ddbjgss84.seq 105022 80578329 299002117 ddbjgss85.seq 100446 101277448 299002181 ddbjgss86.seq 106547 64133080 299001851 ddbjgss87.seq 106474 61047685 299001546 ddbjgss88.seq 105675 59456427 299000850 ddbjgss89.seq 94475 76191762 299001801 ddbjgss90.seq 104807 70521559 299000712 ddbjgss91.seq 110716 74810286 299001374 ddbjgss92.seq 107077 71858805 299001595 ddbjgss93.seq 109968 78575333 299000013 ddbjgss94.seq 137337 88334783 299001707 ddbjgss95.seq 139916 89482801 299001442 ddbjgss96.seq 125536 68845613 299000805 ddbjgss97.seq 99499 65151927 299000746 ddbjgss98.seq 103017 61361252 299000616 ddbjgss99.seq 96353 56762011 299001246 ddbjgss100.seq 95222 58877022 299000517 ddbjgss101.seq 95331 58613694 299001907 ddbjgss102.seq 95447 58195199 299000765 ddbjgss103.seq 100684 64405681 299002295 ddbjgss104.seq 108880 73602992 299000369 ddbjgss105.seq 105318 66931450 299002506 ddbjgss106.seq 109168 66105478 299001196 ddbjgss107.seq 100633 69314788 299002164 ddbjgss108.seq 121893 61255960 299000463 ddbjgss109.seq 101109 64449636 299003150 ddbjgss110.seq 15281 8754769 43936672 ddbjhtc1.seq 37995 68278438 299006614 ddbjhtc2.seq 53829 61207572 299002100 ddbjhtc3.seq 88679 79859866 299001991 ddbjhtc4.seq 86862 102845033 299000736 ddbjhtc5.seq 107930 106312381 299002855 ddbjhtc6.seq 849 3033300 6157690 ddbjhtg1.seq 1570 225728940 299199131 ddbjhtg2.seq 3363 221497062 299065856 ddbjhtg3.seq 2995 223501492 299000055 ddbjhtg4.seq 1937 223593962 299096805 ddbjhtg5.seq 1525 221472429 299170668 ddbjhtg6.seq 1495 221771008 299009542 ddbjhtg7.seq 1536 221314192 299220116 ddbjhtg8.seq 1333 225556810 299041881 ddbjhtg9.seq 1789 218676960 299164204 ddbjhtg10.seq 1134 229059280 299214871 ddbjhtg11.seq 900 230150524 299318448 ddbjhtg12.seq 886 229879586 299138547 ddbjhtg13.seq 965 229754467 299072153 ddbjhtg14.seq 918 229847059 299084448 ddbjhtg15.seq 1997 212637694 299092709 ddbjhtg16.seq 1265 224739095 299155699 ddbjhtg17.seq 1371 223626681 299040517 ddbjhtg18.seq 931 229494299 299032462 ddbjhtg19.seq 1116 227660031 299091963 ddbjhtg20.seq 1036 228724442 299413412 ddbjhtg21.seq 952 229311702 299126870 ddbjhtg22.seq 1003 229081669 299065544 ddbjhtg23.seq 1030 229198888 299364176 ddbjhtg24.seq 1114 227814684 299149540 ddbjhtg25.seq 1116 227765059 299027830 ddbjhtg26.seq 1109 228086567 299182084 ddbjhtg27.seq 1061 228853267 299101403 ddbjhtg28.seq 1154 227210574 299009409 ddbjhtg29.seq 1034 228931400 299191030 ddbjhtg30.seq 1021 229115086 299141816 ddbjhtg31.seq 1020 228900304 299124829 ddbjhtg32.seq 1121 227683626 299045853 ddbjhtg33.seq 1129 228132470 299022619 ddbjhtg34.seq 1182 227376785 299090080 ddbjhtg35.seq 1417 224034505 299040085 ddbjhtg36.seq 1477 225459764 299247733 ddbjhtg37.seq 1387 225510330 299185459 ddbjhtg38.seq 1341 225461231 299121930 ddbjhtg39.seq 1361 228990491 299108618 ddbjhtg40.seq 1516 228185554 299076375 ddbjhtg41.seq 1438 228937156 299163992 ddbjhtg42.seq 1401 228279205 299088496 ddbjhtg43.seq 1300 227530548 299202249 ddbjhtg44.seq 1285 227540946 299053628 ddbjhtg45.seq 1415 227712919 299077634 ddbjhtg46.seq 1837 223910806 299037057 ddbjhtg47.seq 1349 229838969 299038704 ddbjhtg48.seq 1397 229676302 299987468 ddbjhtg49.seq 1655 228852474 299083740 ddbjhtg50.seq 1207 231395776 299030957 ddbjhtg51.seq 1239 231302185 299165021 ddbjhtg52.seq 1262 230969398 299056829 ddbjhtg53.seq 500 73482527 95245787 ddbjhum1.seq 13640 192014183 299157746 ddbjhum2.seq 1609 211986734 299163803 ddbjhum3.seq 1579 217693918 299135363 ddbjhum4.seq 1350 206434907 299019487 ddbjhum5.seq 1444 213292327 299100254 ddbjhum6.seq 1463 210787823 299099385 ddbjhum7.seq 1551 204111506 299157927 ddbjhum8.seq 1625 213717051 299149702 ddbjhum9.seq 1511 208215245 299119001 ddbjhum10.seq 1786 209393209 299136991 ddbjhum11.seq 1948 213087626 299075101 ddbjhum12.seq 31882 171512524 299008252 ddbjhum13.seq 70372 103241147 299009543 ddbjhum14.seq 18632 169761109 299036689 ddbjhum15.seq 2893 219830543 299014009 ddbjhum16.seq 2027 223285950 299015402 ddbjhum17.seq 2512 221221409 299114457 ddbjhum18.seq 4037 217312671 299145149 ddbjhum19.seq 7031 214671394 299038493 ddbjhum20.seq 51693 97249716 299006030 ddbjhum21.seq 44645 127231249 299001560 ddbjhum22.seq 50528 89939769 218132523 ddbjinv1.seq 14705 203313069 299188392 ddbjinv2.seq 3802 188348232 299001591 ddbjinv3.seq 86695 93034078 299002913 ddbjinv4.seq 69625 104771727 299000969 ddbjinv5.seq 79870 97996478 299043886 ddbjinv6.seq 15918 68108263 141781254 ddbjmam.seq 70323 119330247 291816908 ddbjpat1.seq 255273 89006233 299000330 ddbjpat2.seq 217607 107778253 299000042 ddbjpat3.seq 182119 120483906 299000350 ddbjpat4.seq 170588 95749924 299000222 ddbjpat5.seq 130352 131840948 299000492 ddbjpat6.seq 150804 113933531 299000197 ddbjpat7.seq 164543 101566981 299000711 ddbjpat8.seq 170333 62723932 299001228 ddbjpat9.seq 119674 73186362 299001897 ddbjpat10.seq 142028 63757625 299000557 ddbjpat11.seq 145011 71055426 299001227 ddbjpat12.seq 169923 64116586 299000023 ddbjpat13.seq 189346 81219150 299000647 ddbjpat14.seq 120574 141486153 299000960 ddbjpat15.seq 125400 130416170 299000087 ddbjpat16.seq 140667 91264016 299000071 ddbjpat17.seq 57473 13776807 58887729 ddbjphg.seq 2887 14169296 35976370 ddbjpln1.seq 30825 158145409 299079987 ddbjpln2.seq 1824 220274374 299015626 ddbjpln3.seq 61113 118641058 299002610 ddbjpln4.seq 87527 93575044 299001648 ddbjpln5.seq 68366 58071676 299003090 ddbjpln6.seq 26331 119461161 299008738 ddbjpln7.seq 1630 201657990 299209481 ddbjpln8.seq 1222 229162279 316925413 ddbjpln9.seq 9 264008642 334509961 ddbjpln10.seq 70084 97320448 299000528 ddbjpln11.seq 91187 93097600 299002745 ddbjpln12.seq 42621 132762237 299068267 ddbjpln13.seq 30091 57549859 148951819 ddbjpri.seq 33531 263539779 403604426 ddbjrod1.seq 8481 206099457 299014892 ddbjrod2.seq 1095 207676374 299018397 ddbjrod3.seq 1081 208841759 299022394 ddbjrod4.seq 1141 211288036 299004825 ddbjrod5.seq 1157 215196660 299090872 ddbjrod6.seq 1181 216742335 299000035 ddbjrod7.seq 1203 218524637 299112938 ddbjrod8.seq 1211 220643975 299135544 ddbjrod9.seq 32217 172879780 299002103 ddbjrod10.seq 4211 221702379 299142995 ddbjrod11.seq 1459 231618835 299056789 ddbjrod12.seq 25499 184135357 299005197 ddbjrod13.seq 24514 136561978 299028942 ddbjrod14.seq 37540 48501309 141380451 ddbjsts1.seq 104683 57906348 299002121 ddbjsts2.seq 86242 35520784 299000989 ddbjsts3.seq 94176 44638923 299004468 ddbjsts4.seq 65247 38072339 299001423 ddbjsts5.seq 65077 38369772 299002670 ddbjsts6.seq 64997 38503645 299004054 ddbjsts7.seq 100262 38861926 299001005 ddbjsts8.seq 36747 16604055 92827158 ddbjsyn.seq 15483 24522628 70756925 ddbjuna.seq 2356 1276659 5707803 ddbjvrl1.seq 86511 76814911 299001725 ddbjvrl2.seq 86406 77321642 299000172 ddbjvrl3.seq 85233 83494502 299011835 ddbjvrl4.seq 16965 17421425 61271764 ddbjvrt1.seq 76294 112771951 299009850 ddbjvrt2.seq 23661 190698835 299000660 ddbjvrt3.seq 66657 79380847 299001299 ddbjvrt4.seq 2265 228128044 299193766 ddbjvrt5.seq 1541 231064375 299158133 ddbjvrt6.seq 25532 193292763 299051839 ddbjvrt7.seq 19691 21076599 66602796 ------------------------------------------------------------------------------ Total 43118204 47099081750 164689923396 ddbjcon.seq 416054 0 1066298274 ddbjtpa.seq 4523 14616072 32290213 The entries and bases in the CON division and TPA dataset are not counted in the numbers given on the top of the release note or 'Total' on the above table.