DNA Data Bank of Japan DNA Database Release 58, June 2004, including 34,917,581 entries, 39,812,635,108 bases This database may be copied and redistributed without permission on the condition that all the statements in this release note are reproduced in each copy. The present release contains the newest data prepared by the DNA Data Bank of Japan (DDBJ), GenBank, and European Molecular Biology Laboratory/European Bioinformatics Institute (EMBL/EBI) as of May 27, 2004. This unified database was made possible thanks to the international collaboration among the three data banks. All the entries have accordingly been annotated using the feature keys common to them. All the entries designated by the accession numbers with the prefixes "C", "D", "E", "AB", "AG", "AK", "AP", "AT", "AU", "AV", "BA", "BB", "BD", "BJ", "BP", "BS", "BW" and "BY" have been collected and processed by DDBJ, and the rest have been prepared by GenBank and EMBL/EBI. There have been a number of genome projects going on worldwide. Among them human genome projects have probably been most productive and yielded a large number of ordinary sequences, huge amounts of ESTs and quantities of genome sequences. Thus, we have the human(HUM) division solely for human sequences and the primate (PRI) division for non-human primate sequences. The HUM division in this release was recorded in 22 files each of which had 300 MB storage capacity. Incidentally, the BCT, HTC, INV, PLN, ROD, STS, VRL and VRT divisions were recorded in 8, 3, 5, 10, 10, 4, 3, 5 files, respectively. Note that the EST division also contains human sequences. The present release does not have the ORG division. Thus, if you are interested in human mitochondrial sequences, for example, you are now advised to refer to the HUM division. This release includes a division (PAT) for patent data. The patent data are those which the Japanese Patent Office (JPO), United States Patent and Trademark Office (USPTO), and the European Patent Office (EPO) collected and processed. The accession numbers of the patent data collected by the Japanese Patent Office start with the prefix "E" and "BD", those collected and supplied by USPTO and GenBank respectively start with "I" and "AR", and those collected and supplied by EPO and EMBL/EBI respectively start with "A" "AX" and "CQ". The entries with the prefixes "I", "AR", "A", "AX", "CQ", "E" and "BD" were allocated to 14 files (ddbjpat1.seq _ ddbjpat14.seq) in the DDBJ format. Note also that unauthorized use of the patent data may cause legal issues for which we take no responsibility. In the present release, the SOURCE in the flat file was revisited and revised if necessary in accordance with the unified taxonomy database common to the three data banks. The number of ESTs has been increasing at an enormous rate and is expected to be growing even more rapidly in the future. Therefore, EST data were stored in 239 files each of which had the same storage capacity as the file of the HUM division. The present release includes the GSS division. GSS stands for the Genome Survey Sequence, which is similar to EST, except that GSS is genomic DNA whereas EST is cDNA. This division was recorded in 83 files similarly to the HUM division. This release also includes the High Throughput Genomic Sequence (HTGS), which comes mainly from genome project teams which deal with a clone as a sequencing unit. HTGS in this release were recorded in 52 files similarly to the HUM division. The index files are not presented in this release except for ddbjacc.idx, ddbjgen.idx, ddbjjou.idx, and ddbjkey.idx. Instead, we have included a program by which to make the index files not presented in this release. For the use of the program, see the files, seq2indexes.doc, seq2indexes.c, and seq2indexes.h in this release. The present release contains amino acid sequences that were translated from the corresponding nucleotide sequences in our database. In the translation we paid much attention to the fact that some species or organella have a codon different from the universal one, and used the proper codon table. If you find an incorrect codon in a translated sequence, please let us know. The three data banks include the item VERSION in the flat file, which indicates a version of a submitted nucleotide sequence (see Table 1). It is expressed like AB123456.1, in which the digit(s) after the period is a version number. The reason for adding VERSION is that since a released sequence sometimes revised by the submitter, the accession number alone cannot specify the sequence in question causing the user a trouble. The number is increased by one every time when a revised sequence is made public. Accordingly, the translated protein sequence will be accompanied with a /protein_id which is expressed as BAA12345.1, in which the digit(s) after the period is again a version number. The number is increased by one when the corresponding nucleotide sequence is revised and the protein sequence is changed as a result, and when the revised protein sequence is made public. We terminated the RNA division. The RNA data have been redistributed according to the category of the organism. Therefore, you will find a human RNA sequence, for example, in the HUM division. The present release includes a division, CON. The CON division is to show the order of related sequences in a genome, and expressed by join and the accession numbers of the sequences. The contents of the CON division are compiled by the three data banks not by the data submitter. The current number of the entries of this division is 248,865. The entries and bases in the CON division are not counted in the released numbers on the top of the release note. The present release also includes, HTC (High Throughput cDNA). This division is to include unfinished high throughput cDNA sequences, each of which has 5'UTR and 3'UTR at both ends and part of a coding region. The sequence may also include introns. When the sequence becomes finished later, it moves to the corresponding taxonomic division. The sequence is accompanied with a keyword, HTC (High Throughput cDNA), which is dropped when the sequence is finished and moved to a taxonomic division. Since release 51, TPA (Third Party Annotation) data have been available. The entries and bases in TPA are not counted in the released numbers on the top of the release note. Since release 54, '/sequenced_mol' qualifier has been changed to '/mol_type' qualifier. We accordingly completed retrofitting the pertinent entries. This change was made on the agreement at the INSD international collaborative meeting in 2002. /mol_type qualifier Definition: in vivo molecule type Value format: molecule type where molecule type is limited to followings; "genomic DNA", "genomic RNA", "mRNA" (incl. EST), "tRNA", "rRNA", "snoRNA", "snRNA", "scRNA", "pre-mRNA", "other RNA" (incl. synthetic), "other DNA" (incl. synthetic), "unassigned DNA" (incl. unknown), "unassigned RNA" (incl. unknown) The BASE COUNT line of the DDBJ flat file format has been changed since DDBJ release 56, corresponding to the relaxation of the maximum sequence length restriction (350,000 bp/entry) in the entry that had been practised at DDBJ/EMBL/GenBank International Nucleotide Sequence Databases. In the BASE COUNT line of the DDBJ flat file format, 6 digits had been allocated for each number of a, c, g, t and other bases in the sequence. Hereafter, in the new flat file format, 9 digits are allocated for each number of a, c, g and t, while the numbers of other bases are removed. In accordance with the relaxation of sequence length limitation, GenBank had already dropped the BASE COUNT line from their flat file format from GenBank Release 138 (Oct. 2003). We DDBJ have decided to maintain the BASE COUNT line in our flat file format from the view that GC contents are still important information to characterize the sequence. Prior to publication of release 56 in December, 2003, the new DDBJ flat file format is adopted to daily data update from Dec. 3. Following is an example of the new BASE COUNT line. 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 |----|----|----|----|----|----|----|----|----|----|----|----|----|----| BASE COUNT 123456789 a 123456789 c 123456789 g 123456789 t The three data banks have agreed that the maximum length limitation (350 kb) of a submitted sequence be relaxed. Following the agreement, we are now in preparation for the relaxation. This release is published by the following DDBJ staff. T. Gojobori, Y. Tateno, K. Nishikawa, H. Sugawara, N. Saitou, K. Okubo, K. Ikeo, Y. Suzuki, S. Fukuchi, A. Kinjo, K. Itoh, R. Barrero, T. Abe S. Adachi, H. Aono, M. Ejima, N. Endo, Y. Fujisawa, D. Fukuda, M. Gojobori, Y. Hikino, T. Hirai, N. Hoshi, H. Ichikawa, K. Ichikawa, N. Ishizaka, T. Kato, T. Kawamoto, J. Kohira, Ta. Koike, To. Koike, T. Konno, T. Kosuge, A. Kusakabe, Y. Lin, K. Mamiya, N. Maruyama, J. Mashima, M. Matsuo, K. Mimura, H-J. Min, S. Misu, S. Miyazawa, N. Murakata, S. Nagira, M. Nagura, N. Nishinomiya, T. Okido, K. Sakai, Y. Shigemoto, H. Shiozawa, F. Sugiyama, M. Suzuki, T. Takaki, H. Tsutsui, M. Tsuboi, M. Yamaguchi, Y. Yamamoto, E. Yokoyama Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Research Organization of Information and Systems Mishima 411-8540, Japan Phone: +81 55 981 6853 FAX: +81 55 981 6849 E-mail: ddbj@ddbj.nig.ac.jp (for general inquiry) ddbjsub@ddbj.nig.ac.jp (for data submission) ddbjupdt@ddbj.nig.ac.jp (for updates and notification of publication) WWW: http://www.ddbj.nig.ac.jp/ (for DDBJ WWW server) http://sakura.ddbj.nig.ac.jp/ (for DDBJ sequence data submission system) Acknowledgement: We are grateful to NCBI and EMBL/EBI for a firm friendship and an excellent collaboration with us. We also thank the Japanese Patent Office for a steady cooperation with us. The operation of DDBJ is supported by the Ministry of Education, Culture, Sports, Science and Technology, and we would gratefully note this here. DDBJ Database Release History Release Date Entries Bases Comments 58 06/04 34,917,581 39,812,635,108 57 03/04 32,693,678 38,008,449,840 56 12/03 30,405,173 36,079,046,032 55 09/03 27,753,140 34,280,225,489 54 06/03 25,149,821 32,162,041,177 53 02/03 23,250,813 29,711,299,332 52 12/02 20,354,812 26,931,456,316 51 09/02 18,401,358 22,782,404,136 TPA started 50 06/02 17,260,693 20,158,357,982 49 04/02 16,503,157 18,579,627,226 48 01/02 15,016,100 16,197,713,855 47 10/01 13,266,610 14,145,671,645 46 07/01 12,313,759 13,037,646,166 45 04/01 11,434,113 12,207,092,905 HTC division started 44 01/01 10,165,597 11,136,298,841 43 10/00 8,666,551 10,034,532,698 42 07/00 7,554,995 8,880,721,093 41 04/00 5,962,608 6,409,581,885 CON division started 40 01/00 5,388,125 4,762,696,173 RNA division terminated 39 10/99 4,810,773 3,728,000,562 NID and PID discarded 38 07/99 4,294,369 3,098,519,597 37 03/99 3,311,627 2,375,261,951 VERSION, /protein_id started 36 01/99 3,073,166 2,190,425,560 35 10/98 2,759,261 1,957,341,169 34 07/98 2,412,785 1,708,580,623 33 04/98 2,174,769 1,479,303,279 32 01/98 1,956,669 1,300,950,613 31 10/97 1,731,532 1,139,869,464 Adoption of the unified taxonomy database 30 07/97 1,534,115 992,788,339 NID and PID terminated 29 04/97 1,270,194 841,415,232 28 01/97 1,154,120 756,785,219 HTG division started ORG division terminated 27 10/96 936,697 608,103,057 GSS division started 26 07/96 835,552 551,932,448 25 04/96 744,490 499,300,364 /translation started 24 01/96 637,508 431,771,652 23 10/95 569,757 390,694,350 22 07/95 437,588 322,982,425 HUM division started 21 04/95 274,596 250,875,023 20 01/95 239,689 231,299,557 19 10/94 204,332 205,274,131 18 07/94 185,230 192,473,021 17 04/94 169,957 179,942,209 16 01/94 154,626 165,017,628 15 10/93 131,649 147,224,690 14 07/93 120,350 138,686,333 13 04/93 112,067 129,784,445 12 01/93 97,683 120,815,244 EST division started 11 07/92 65,693 84,839,075 10 01/92 59,317 77,805,556 GenBank/EMBL inclusion started 9 07/91 1,130 2,002,124 8 01/91 879 1,573,442 7 07/90 681 1,154,211 6 01/90 496 841,236 5 07/89 395 679,378 4 01/89 302 535,985 3 07/88 230 345,850 2 01/88 142 199,392 1 07/87 66 108,970 Started with DDBJ only ------------------------------------------------------------------------ This release covers 20 categories of organisms and others as follows: ------------------------------------------------------------------------------ ddbjbct.*** Category for bacteria ddbjest.*** Category for EST (expressed sequence tag) ddbjcon.*** Category for CON (Contig sequences) ddbjhtc.*** Category for HTC (high throughput cDNA) ddbjhtg.*** Category for HTG (high throughput genomic sequence) ddbjhum.*** Category for human ddbjgss.*** Category for GSS (Genome Survey Sequence) ddbjinv.*** Category for invertebrates ddbjmam.*** Category for mammals other than primates and rodents ddbjpat.*** Category for patents ddbjphg.*** Category for phages ddbjpln.*** Category for plants ddbjpri.*** Category for primates other than human ddbjrod.*** Category for rodents ddbjsts.*** Category for STS (sequence tagged site) ddbjsyn.*** Category for synthetic DNAs ddbjtpa.*** Category for TPA (Third Party Annotation) ddbjuna.*** Category for unannotated sequences ddbjvrl.*** Category for viruses ddbjvrt.*** Category for vertebrates other than mammals ------------------------------------------------------------------------------ Each category then has the following nine files. Note that all the files except for ddbj***.seq are created by the user by use of seq2indexes as mentioned in the release note. ------------------------------------------------------------------------------ ddbj***.seq List of an entry in DDBJ format, see Table 1. ddbj***.acc List of the accession numbers, see Table 2 . ddbj***.aut List of the authors, see Table 3. ddbj***.dir List of the short directory in DDBJ style, see Table 4. ddbj***.idx List of indices, see Table 5. ddbj***.jou List of the journals, see Table 6. ddbj***.key List of the key words, see Table 7. ddbj***.org List of the species names, see Table 8. ddbj***.sdr List of the short directory in DDBJ style, see Table 9. ------------------------------------------------------------------------------ The format of LOCUS line in the flat file was changed as shown below to adjust to the GenBank format from release 51. ------------------------------------------------------------------------------ Old (-rel. 50): LOCUS AB000001 660 bp DNA PLN 01-FEB-2001 Present (rel. 51-): LOCUS AB000001 660 bp DNA linear PLN 01-FEB-2001 New format specification: --------- -------- Positions Contents --------- -------- 01-05 'LOCUS' 06-12 spaces 13-28 Locus name 29-29 space 30-40 Length of sequence, right-justified 41-41 space 42-43 bp 44-44 space 45-47 spaces, ss- (single-stranded), ds- (double-stranded), or ms- (mixed-stranded) 48-53 DNA, RNA, tRNA (transfer RNA), rRNA (ribosomal RNA), mRNA (messenger RNA), uRNA (small nuclear RNA), scRNA, snRNA, snoRNA. Left justified. 54-55 space 56-63 'linear' followed by two spaces, or 'circular' 64-64 space 65-67 The division code 68-68 space 69-79 Date, in the form dd-MMM-yyyy (e.g., 15-MAR-1991) ------------------------------------------------------------------------------ Table 1. Part of the contents in the file 'ddbjbct.seq'. This shows all pieces of information on one entry in DDBJ format. ------------------------------------------------------------------------------ LOCUS D87069 993 bp mRNA linear BCT 14-APR-2000 DEFINITION Escherichia coli mRNA for RNA polymerase sigma subunit, truncated form of sigma-38, complete cds. ACCESSION D87069 VERSION D87069.1 KEYWORDS RNA polymerase sigma subunit, truncated form of sigma-38. SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 993) AUTHORS Jishage,M. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) to the DDBJ/EMBL/GenBank databases. Miki Jishage, National Institute of Genetics, Molecular Genetics; Yata 1111, Mishima, Shizuoka 411, Japan (E-mail:mjishage@lab.nig.ac.jp, Tel:0559-81-6742, Fax:0559-81-6746) REFERENCE 2 (bases 1 to 993) AUTHORS Jishage,M. and Ishihama,A. TITLE Variation in RNA polymerase sigma subunit composition within different stocks of Escherichia coli starin W3110 JOURNAL Unpublished (1996) REFERENCE 3 AUTHORS Ivanova,A., Renshaw,M., Guntaka,R. and Eisenstark,A. TITLE DNA base sequence variability in katF (putative sigma factor) gene Escherichia coli JOURNAL Nucleic Acids Res. 20, 5479-5480 (1992) REFERENCE 4 AUTHORS Takayanagi,Y., Tanaka,K. and Takahashi,H. TITLE Structure of the 5' upstream region and the regulation of the rpoS gene of Escherichia coli JOURNAL Mol. Gen. Genet. 243, 525-531 (1994) COMMENT FEATURES Location/Qualifiers source 1..993 /mol_type="mRNA" /organism="Escherichia coli" /strain="W3110" CDS 1..810 /note="the gene has four single base changes, resulting in two amino acid substitutions and an amber mutation" /product="RNA polymerase sigma subunit, truncated form of sigma-38" /protein_id="BAA13238.1" /transl_table=11 /translation="MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEYEPSDNDLAEEE LLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLV VKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMN QTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNER ITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAK" variation 75 /citation=[3] /replace="t" variation 97 /citation=[3] /replace="t" variation 99 /citation=[3] /replace="t" variation 808 /citation=[3] /replace="t" BASE COUNT 254 a 223 c 291 g 225 t ORIGIN 1 atgagtcaga atacgctgaa agttcatgat ttaaatgaag atgcggaatt tgatgagaac 61 ggagttgagg tttttgacga aaaggcctta gtagaatatg aacccagtga taacgatttg 121 gccgaagagg aactgttatc gcagggagcc acacagcgtg tgttggacgc gactcagctt 181 taccttggtg agattggtta ttcaccactg ttaacggccg aagaagaagt ttattttgcg 241 cgtcgcgcac tgcgtggaga tgtcgcctct cgccgccgga tgatcgagag taacttgcgt 301 ctggtggtaa aaattgcccg ccgttatggc aatcgtggtc tggcgttgct ggaccttatc 361 gaagagggca acctggggct gatccgcgcg gtagagaagt ttgacccgga acgtggtttc 421 cgcttctcaa catacgcaac ctggtggatt cgccagacga ttgaacgggc gattatgaac 481 caaacccgta ctattcgttt gccgattcac atcgtaaagg agctgaacgt ttacctgcga 541 accgcacgtg agttgtccca taagctggac catgaaccaa gtgcggaaga gatcgcagag 601 caactggata agccagttga tgacgtcagc cgtatgcttc gtcttaacga gcgcattacc 661 tcggtagaca ccccgctggg tggtgattcc gaaaaagcgt tgctggacat cctggccgat 721 gaaaaagaga acggtccgga agataccacg caagatgacg atatgaagca gagcatcgtc 781 aaatggctgt tcgagctgaa cgccaaatag cgtgaagtgc tggcacgtcg attcggtttg 841 ctggggtacg aagcggcaac actggaagat gtaggtcgtg aaattggcct cacccgtgaa 901 cgtgttcgcc agattcaggt tgaaggcctg cgccgtttgc gcgaaatcct gcaaacgcag 961 gggctgaata tcgaagcgct gttccgcgag taa // ------------------------------------------------------------------------------ Table 2. Part of the contents in the file 'ddbjbct.acc'. The first column refers to the secondary accession number, second column to the locus name, and third to the primary accession number. The primary number may be the same as the secondary number. They are arranged in the ascending order of the secondary accession numbers. ------------------------------------------------------------------------------ D00001 -> ECOPBPAA X04516 D00002 -> ECOPYRH X04469 D00006 -> PNS981TET D00006 D00020 -> COLE2LYS D00020 D00021 -> COLE31YS D00021 D00038 -> BRLAM330 D00038 D00066 -> BAC139AC D00066 D00067 -> ECONANA M20207 D00069 -> ECOUVRD2 D00069 D00087 -> BACXYNAA D00087 ------------------------------------------------------------------------------ Table 3. Part of the contents in the file 'ddbjbct.aut'. For each author name given on the left to the arrow, the corresponding locus name and primary accession number are respectively listed on the right. They are arranged in the alphabetical order of the author names. ------------------------------------------------------------------------------ Aan,F. -> STYCRR X05210 Aan,F. -> STYENZI M76176 Aaronson,W. -> ECOKPSD M64977 Aaronson,W. -> ECONEUA J05023 Abad-Lapuebla,M.A. -> VIBTDHI D90238 Abdel-Mawgood,A.L. -> CYAPSBHA X16394 Abdel-Meguid,S.S. -> TRNGDRECM J01843 Abdelal,A. -> STYCARA M36540 Abdelal,A. -> STYCARAB X13200 Abdelal,A.H. -> PSENOSA M60717 ------------------------------------------------------------------------------ Table 4. Part of the short directory in DDBJ style in the file 'ddbjbct.dir'. For each locus name given in the first column, the corresponding primary accession number, molecular type, number of nucleotide pairs, and description for the locus are respectively listed. They are arranged in the alphabetical order of the locus names. ------------------------------------------------------------------------------ ABCAARAA M34830 ds-DNA 1624 A.aceti acetic acid resistance protein (aarA) gene, complete cds. ABCADHCC D00635 ds-DNA 4230 A. polyoxogenes alcohol dehydrogenase (EC 1.1.99.8) and cytochrome c genes. ABCALDH D00521 ds-DNA 2683 A.polyoxogenes membrane-bound aldehyde dehydrogenase gene, complete cds and flanks. ABCBCSAA M37202 ds-DNA 9540 A.xylinum bcs B, bcs C and bcs D genes, complete cds and bcs A gene, partial cds. ABCCELA M76548 ds-DNA 1165 Acetobacter xylinum UDP pyrophosphorylase (celA) gene, complete cds. ABCCELSYN X54676 ds-DNA 5363 A. xylinum gene for cellulose biosynthesis ABCIS1380 D10043 ds-DNA 1665 A.pasteurianus insertion sequence IS1380. ACAADH1 D90004 ds-DNA 2467 Acetobacter aceti(K6033) alcohol dehydrogenase subunit gene(adh1). ACCAAC2 M62833 ds-DNA 1123 Acinetobacter baumannii aminoglycoside acetyltr ansferase (aac2) gene, complete cds. ACCACEAA M62822 ds-DNA 1874 A.baumannii chloramphenicol acetyltransferase (cat) gene, complete cds. ------------------------------------------------------------------------------ Table 5. Part of the contents in the file 'ddbjbct.idx'. The first column refers to the locus name, second column to the starting site of the locus in byte, and third to its ending site in byte. They are arranged in the alphabetical order of the locus names. ------------------------------------------------------------------------------ %***************************** #ABCAARAA 0 3211 #ABCADHCC 3212 10608 #ABCALDH 10609 15864 #ABCBCSAA 15865 29583 #ABCCELA 29584 32289 #ABCCELSYN 32290 40960 #ABCIS1380 40961 44711 #ACAADH1 44712 49357 #ACCAAC2 49358 52395 ------------------------------------------------------------------------------ Table 6. Part of the contents in the file 'ddbjbct.jou'. This gives information on the journal in which sequence data were published. ------------------------------------------------------------------------------ (in) Chaloupka,J. and Krumphanzl,V. (Eds.); Extracellular Enzymes of Microorganisms: 129-137, Plenum Press, New York (1987) -> BACAMYABS M57457 (in) Ganesan,A.T., Chang,S. and Hoch,J.A. (Eds.); Molecular Cloning and Gene Regulation in Bacilli: 3-10, Academic Press, New York (1982) -> BACRG16S M55011 (in) Ganesan,A.T., Chang,S. and Hoch,J.A. (Eds.); Molecular Cloning and Gene Regulation in Bacilli: 3-10, Academic Press, New York (1982) -> BACRG16SA M55006 (in) Ganesan,A.T., Chang,S. and Hoch,J.A. (Eds.); Molecular Cloning and Gene Regulation in Bacilli: 3-10, Academic Press, New York (1982) -> BACRG16SB M55008 (in) Hoch,J.A. and Setlow,P. (Eds.); Molecular Biology of Microbial Differentiation: 85-94, American Society for Microbiology, Washington, DC (1985) -> BACSPOII M57606 (in) Holmgren,A. (Ed.); Thioredoxin and Glutaredoxin Systems: Structure and Function: 11-19, Unknown name, Unknown city (1986) -> ECOTRXA1 M54881 (in) Kjeldgaard,N.C. and Maaloe,O. (Eds.); Control of ribosome synthesis: 138-143, Academic Press, New York (1976) -> ECOLAC J01636 (in) Losick,R. and Chamberlin,M. (Eds.); RNA polymerase: 455-472, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1976) -> ECOTGY1 K01197 (in) Sikes,C.S. and Wheeler,A.P. (Eds.); Surface reactive peptides and polymers. Discovery and commercialization.: 186-200, American Chemical Society, Washington, D.C. (1991) -> ECOTGP J01714 (in) Sund,H. and Blauer,G. (Eds.); Protein-Ligand Interactions: 193-207, Walter de Gruyter, New York (1975) -> ECOLAC J01636 (in) Wu,R. and Grossman,L. (Eds.); Methods in Enzymology, Recombinant DNA, part E: In press, Academic Press, New York, N.Y. (1986) -> PLMCG M11320 Acta Microbiol. Pol. 35, 175-190 (1986) -> ECOTGG1 M54893 Actinomycetologica 5, 14-17 (1991) -> STMARGG D00799 Adv. Biophys. 21, 115-133 (1986) -> R10REP M26840 Adv. Biophys. 21, 175-192 (1986) -> ECONUSAA M26839 Adv. Enzyme Regul. 21, 225-237 (1983) -> ECOPURFA M26893 Adv. Exp. Med. Biol. 195, 239-246 (1986) -> ECOAPT M14040 Agric. Biol. Chem. 50, 2155-2158 (1986) -> ECONANA M20207 Agric. Biol. Chem. 50, 2771-2778 (1986) -> BRLAM330 D00038 Agric. Biol. Chem. 51, 2019-2022 (1987) -> BACCGT D00129 Agric. Biol. Chem. 51, 2641-2648 (1987) -> STRSAGP D00219 Agric. Biol. Chem. 51, 2807-2809 (1987) -> BACPGECR M35503 Agric. Biol. Chem. 51, 3133-3135 (1987) -> BACXYLAP D00312 Agric. Biol. Chem. 51, 455-463 (1987) -> BACHDCRY D00117 Agric. Biol. Chem. 51, 953-955 (1987) -> BACXYNAA D00087 Agric. Biol. Chem. 52, 1565-1573 (1988) -> BACIP135 D00348 Agric. Biol. Chem. 52, 1785-1789 (1988) -> BACTMR D00343 Agric. Biol. Chem. 52, 2243-2246 (1988) -> PSEGI D00342 Agric. Biol. Chem. 52, 399-406 (1988) -> BACAMYEB M35517 Agric. Biol. Chem. 52, 479-487 (1988) -> ECAPALI D00217 ------------------------------------------------------------------------------ Table 7. Part of the contents in the file 'ddbjbct.key'. For the locus and accession number respectively given on the right to the arrow, the corresponding key words are listed on the left. ------------------------------------------------------------------------------ A.aceti acetic acid resistance protein (aarA) gene, complete cds. -> ABCAARAA M34830 acetic acid resistance protein. -> ABCAARAA M34830 Cloning of genes responsible for acetic acid resistance in acetobacter aceti -> ABCAARAA M34830 A. polyoxogenes alcohol dehydrogenase (EC 1.1.99.8) and cytochrome c genes. -> ABCADHCC D00635 alcohol dehydrogenase; cytochrome c. -> ABCADHCC D00635 Cloning and sequencing of the gene cluster encoding two subunits of membrane- bound alcohol dehydrogenase from Acetobacter polyoxogenes -> ABCADHCC D00635 These data kindly submitted in computer readable form by: Toshimi Tamaki Nakano Central Biochemical Institute 2-6 Nakamura-cho Handa-shi, Aichi-ken 475 Japan Phone: 0569-21-3331 Fax: 0569-23-8486 -> ABCADHCC D00635 A.polyoxogenes membrane-bound aldehyde dehydrogenase gene, complete cds and flanks. -> ABCALDH D00521 aldehyde dehydrogenase gene; ethanol oxidation; membrane-bound enzyme. -> ABCALDH D00521 Nucleotide sequence of the membrane-bound aldehyde dehydrogenase gene from Acetobacter polyoxogenes -> ABCALDH D00521 ------------------------------------------------------------------------------ Table 8. Part of the contents in the file 'ddbjbct.org'. For the locus and accession number respectively given on the right to the arrow, the corresponding taxonomic names are listed on the left. They are arranged in the alphabetical order of the species names. ------------------------------------------------------------------------------ A. nidulans 6301 DNA. Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> ANIRUBPS X00019 A. nidulans DNA, clone pAN4. Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> ANIRGGX X00343 A. nidulans DNA. Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> ANIRGG X00512 A. polyoxogenes genomic DNA. Acetobacter polyoxogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. - > ABCADHCC D00635 A. quadruplicatum (strain PR-6) DNA, clone pAQPR1. Agmenellum quadruplicatum Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> AQUPCAB K02660 A. quadruplicatum (strain PR6) DNA. Agmenellum quadruplicatum Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> AQUCPCAB K02659 A. vinelandii DNA. Azotobacter vinelandii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. -> AVINIFUSV M17349 A.aceti (strain 10-8) DNA, clone pAR1611. Acetobacter aceti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. -> ABCAARAA M34830 A.actinomycetemcomitans (strain JP2) DNA, clone lambda-OP8. Actinobacillus actinomycetemcomitans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. -> ACNLKTXN M27399 A.anitratum DNA, clone pLJD1. Acinetobacter anitratum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. -> ACCCITSYN M33037 ------------------------------------------------------------------------------ Table 9. Part of the short directory file in DDBJ style in the file 'ddbjbct.sdr'. The short directory file contains brief descriptions of all of the sequence entries contained in the DDBJ style. ------------------------------------------------------------------------------ ABCAARAA A.aceti acetic acid resistance protein (aarA) gene, complete 1624bp ABCADHCC A. polyoxogenes alcohol dehydrogenase (EC 1.1.99.8) and 4230bp ABCALDH A.polyoxogenes membrane-bound aldehyde dehydrogenase gene, 2683bp ABCBCSABCD A.xylinum bcs A, B, C and D genes, complete cds's. 9540bp ABCCELA Acetobacter xylinum UDP pyrophosphorylase (celA) gene, 1165bp ABCCELSYN A. xylinum gene for cellulose biosynthesis 5363bp ABCIS1380 A.pasteurianus insertion sequence IS1380. 1665bp ACAADH1 Acetobacter aceti(K6033) alcohol dehydrogenase subunit 2467bp ACCAAC2 Acinetobacter baumannii aminoglycoside acetyltransferase 1123bp ACCACEAA A.baumannii chloramphenicol acetyltransferase (cat) gene, 1874bp ACCAPHA6 Acinetobacter baumannii aphA-6 gene. 1170bp ACCBENABCA A.calcoaceticus BenA, BenB, BenC, BenD, and BenE proteins 15922bp ACCCAT Acinetobacter calcoaceticus cat operon. 15922bp ACCCATAM A.calcoaceticus catA and catM genes, encoding catechol 1, 5537bp ACCCHMO Acinetobacter sp. cyclohexanone monooxygenase gene, complete 2128bp ACCCITSYN A.anitratum citrate synthase gene, complete cds. 1895bp ------------------------------------------------------------------------------ In addition to the 9 tables the four following index files are included in this release. These files were prepared irrespective of the 10 categories of taxonomic divisions. Accession number index file Keyword phrase index file Journal citation index file Gene name index file A brief description is given for each file in the following. Table 10. Part of the accession number index file in the 'ddbjacc.idx'. The following excerpt from the accession number index file illustrates the format of the index. ------------------------------------------------------------------------------ D00100 PSEASPAA BCT D00100 D00101 RABNP450R MAM D00101 D00102 HUMLTX HUM D00102 D00103 AFARRN5SA BCT D00103 AFRRN5SA BCT X05517 D00104 AFARRN5SB BCT D00104 AFRRN5SB BCT X05518 D00105 AFARRN5S BCT D00105 ASRRN5S BCT X05524 D00106 ACH5SRR BCT D00106 AXRRN5S BCT X05522 AXRRN5SA BCT X05523 D00107 ACH5SRRX BCT D00107 ACRRN5S BCT X05521 ------------------------------------------------------------------------------ Table 11. Part of the keyword phrase index file in the 'ddbjkey.idx'. Keyword phrases consist of names for gene products and other characteristics of sequence entries. ------------------------------------------------------------------------------ A CHANNEL DROCHA INV M17155 A COMPONENT SQLCVEA VRL M38183 A LOCUS GORGOGOA3 PRI X54375 GORGOGOA4 PRI X54376 A LOCUS ALLELE GORA0101 PRI X60258 GORA0201 PRI X60259 GORA0401 PRI X60257 GORA0501 PRI X60256 A MULTI-GENE FAMILY RICGLUTE PLN D00584 A PROTEIN MS2AAR PHG M25187 ST1APCS PHG M25396 A SEQUENCE HS5TOA30 VRL D00148 HS5TOA31 VRL D00147 ------------------------------------------------------------------------------ Table 12. Part of the journal citation index file in 'ddbjjou.idx'. The journal citation index file lists all of the citations that appear in the references. ------------------------------------------------------------------------------ ACTA BIOCHIM. BIOPHYS. SIN. 23, 246-253 (1992) HUMPLASINS HUM M98056 ACTA BIOCHIM. BIOPHYS. SIN. 28, 233-239(1996) TKTII PLN X82230 ACTA BIOCHIM. POL. 24, 301-318 (1977) LUPTRFJ PLN K00345 LUPTRFN PLN K00346 ACTA BIOCHIM. POL. 26, 369-381(1979) HVTRNPHE PLN X02683 ACTA BIOCHIM. POL. 29, 143-149 (1982) EMEMTA PLN M32572 EMEMTB PLN M32573 EMEMTC PLN M32574 EMEMTD PLN M32575 EMEMTE PLN M32576 ACTA BIOCHIM. POL. 34, 21-27 (1987) LUPNOSP PLN M32571 ------------------------------------------------------------------------------ Table 13. Part of the gene name index file in 'ddbjgen.idx'. This file lists all the gene names that appear in the feature table. ------------------------------------------------------------------------------ AACC8 STMAACC8 BCT M55426 AACC9 MPUAACC9 BCT M55427 AACT HUMA1ACM PRI K01500 HUMA1ACMA PRI X00947 HUMA1ACMB PRI M18035 HUMAACT1 PRI M18906 HUMAACT2 PRI M22533 HUMAACTA PRI J05176 AAD INTINTORF BCT L06418 LMOMO229D BCT X17478 AAD A1 ENTAAC3VI BCT M88012 AAD9 ENEAAD9A BCT M69221 AADA LMOMO229A BCT X17479 S52249 BCT S52249 SYNAADA SYN M60473 TRNTAAB BCT M55547 TRNTN21CAS BCT M86913 ------------------------------------------------------------------------------ The files in this release are arranged in the following order with non-labeled format. file name number of entries number of bases file size ddbjrel.txt 66753 (DDBJ release note) ddbjbct1.seq 29697 120053661 299033173 ddbjbct2.seq 6973 130486388 299281528 ddbjbct3.seq 31863 122000911 298999973 ddbjbct4.seq 72000 102333594 299355171 ddbjbct5.seq 5439 138407753 299001270 ddbjbct6.seq 63627 104754149 299469155 ddbjbct7.seq 40768 117970357 299020595 ddbjbct8.seq 166 3989978 8899954 ddbjcon.seq 248865 0 498911158 ddbjest1.seq 90092 33919201 299000961 ddbjest2.seq 94909 38175564 299002718 ddbjest3.seq 95456 37053779 299000508 ddbjest4.seq 89429 28106820 299000560 ddbjest5.seq 94290 36413936 299001554 ddbjest6.seq 98428 39008125 298999953 ddbjest7.seq 98537 38046950 299002713 ddbjest8.seq 97457 37753782 299002153 ddbjest9.seq 98174 38758905 299002778 ddbjest10.seq 99189 38789629 299003488 ddbjest11.seq 97708 39052439 299000430 ddbjest12.seq 96818 43294993 299000350 ddbjest13.seq 105355 42366209 299001347 ddbjest14.seq 101825 41102710 299000878 ddbjest15.seq 97670 40979902 299000989 ddbjest16.seq 94482 42205385 299001727 ddbjest17.seq 96922 39090220 299000890 ddbjest18.seq 97880 42657204 299003077 ddbjest19.seq 96153 41869758 299003056 ddbjest20.seq 95219 39001775 299001067 ddbjest21.seq 108427 52108655 299001375 ddbjest22.seq 131188 56138066 299001024 ddbjest23.seq 91381 58671766 299000310 ddbjest24.seq 104023 74374585 299002338 ddbjest25.seq 120687 61294764 299001482 ddbjest26.seq 124750 62421004 299002767 ddbjest27.seq 122414 61554953 299001751 ddbjest28.seq 125818 57436983 299001405 ddbjest29.seq 93097 26562892 299002108 ddbjest30.seq 94168 25367784 299001321 ddbjest31.seq 74290 22087970 299001238 ddbjest32.seq 60729 16904783 299000834 ddbjest33.seq 60526 15988372 299000371 ddbjest34.seq 117536 51164603 299001865 ddbjest35.seq 114477 55315509 299002570 ddbjest36.seq 101858 50641539 299002755 ddbjest37.seq 121010 61732116 298999966 ddbjest38.seq 117521 57729662 299000365 ddbjest39.seq 90352 37908133 299003037 ddbjest40.seq 93937 41851530 299000049 ddbjest41.seq 90968 38269539 299000816 ddbjest42.seq 105877 42408177 299000650 ddbjest43.seq 91213 36476759 299003021 ddbjest44.seq 83843 37055059 299000199 ddbjest45.seq 97184 45154714 299001581 ddbjest46.seq 97962 40336421 299001156 ddbjest47.seq 95594 33578534 299001414 ddbjest48.seq 100886 45824298 299000426 ddbjest49.seq 61046 16977182 299001487 ddbjest50.seq 60027 18427742 299003037 ddbjest51.seq 60604 18154359 299003969 ddbjest52.seq 60458 19291612 299000247 ddbjest53.seq 60390 18524933 299002136 ddbjest54.seq 60518 18369686 299004053 ddbjest55.seq 61424 18114243 299003703 ddbjest56.seq 61926 19501173 299004291 ddbjest57.seq 61436 19761981 299002384 ddbjest58.seq 61771 21542864 299001583 ddbjest59.seq 57162 33418329 299004500 ddbjest60.seq 54852 22660039 299001940 ddbjest61.seq 53892 24439609 299001612 ddbjest62.seq 54891 22699122 299004274 ddbjest63.seq 73835 32943670 299002173 ddbjest64.seq 94810 37652335 299003034 ddbjest65.seq 94672 39557916 299002355 ddbjest66.seq 98610 56113953 299000816 ddbjest67.seq 98829 54711988 299000480 ddbjest68.seq 99634 48516768 299002264 ddbjest69.seq 92471 52186460 299001280 ddbjest70.seq 93819 43292285 299000183 ddbjest71.seq 91528 51327543 298999991 ddbjest72.seq 99330 58744891 299001110 ddbjest73.seq 85435 42795306 299002395 ddbjest74.seq 93749 48011225 298999992 ddbjest75.seq 94399 57616039 299002536 ddbjest76.seq 89871 56892904 299000396 ddbjest77.seq 94089 48642481 299000628 ddbjest78.seq 89286 38290216 299000033 ddbjest79.seq 83208 44102081 299000906 ddbjest80.seq 90707 51819421 299001304 ddbjest81.seq 91659 51640011 298999994 ddbjest82.seq 96572 41960725 298999924 ddbjest83.seq 96127 35154333 299000287 ddbjest84.seq 93178 50194204 299000022 ddbjest85.seq 89183 48827295 299001411 ddbjest86.seq 102355 53817796 299002528 ddbjest87.seq 93238 61641255 299000437 ddbjest88.seq 87366 53379119 299000356 ddbjest89.seq 92767 61770301 299001249 ddbjest90.seq 92746 56479546 299000184 ddbjest91.seq 96621 58013602 299000208 ddbjest92.seq 92823 61966477 299001111 ddbjest93.seq 94108 60996635 299001490 ddbjest94.seq 101564 50331756 299001379 ddbjest95.seq 100074 41894670 299000368 ddbjest96.seq 99403 56913870 299001070 ddbjest97.seq 93040 54299778 299005044 ddbjest98.seq 87411 43838550 299000339 ddbjest99.seq 86527 49928465 299001882 ddbjest100.seq 86627 47930403 299002447 ddbjest101.seq 92860 54823034 299003026 ddbjest102.seq 88988 55229905 299000121 ddbjest103.seq 92235 52424698 299000796 ddbjest104.seq 88956 54930226 299001039 ddbjest105.seq 80927 46243152 299000178 ddbjest106.seq 119828 65314122 299000986 ddbjest107.seq 87894 49096081 299002013 ddbjest108.seq 128253 67534090 299000772 ddbjest109.seq 129384 70287953 299002388 ddbjest110.seq 116798 66029337 299001009 ddbjest111.seq 97883 56020362 299001421 ddbjest112.seq 117001 72677904 299002310 ddbjest113.seq 89189 51261629 299002447 ddbjest114.seq 84711 38984495 299003427 ddbjest115.seq 71073 33066058 299004111 ddbjest116.seq 81947 40431109 299004112 ddbjest117.seq 76319 39982939 299000684 ddbjest118.seq 98571 65414825 299001591 ddbjest119.seq 87726 57818414 299000985 ddbjest120.seq 92956 52748983 299003376 ddbjest121.seq 79201 39347650 299004040 ddbjest122.seq 85666 49526924 299000580 ddbjest123.seq 82181 54718160 299000723 ddbjest124.seq 94687 52155581 299001372 ddbjest125.seq 110508 54784644 299001225 ddbjest126.seq 102519 60410216 299000221 ddbjest127.seq 104040 55826579 299002303 ddbjest128.seq 87874 56197751 299003453 ddbjest129.seq 77337 44534750 299002186 ddbjest130.seq 88316 53916385 299003892 ddbjest131.seq 94803 35941882 299000235 ddbjest132.seq 91749 56056018 299002894 ddbjest133.seq 94956 47492863 299002767 ddbjest134.seq 88493 50736901 299001654 ddbjest135.seq 89091 64746884 299001489 ddbjest136.seq 87830 46158028 299000987 ddbjest137.seq 88904 64282687 299000240 ddbjest138.seq 87439 54661816 299002990 ddbjest139.seq 92538 51232456 299001003 ddbjest140.seq 90004 75068466 299000850 ddbjest141.seq 82206 60294777 299003267 ddbjest142.seq 81755 60124046 299001483 ddbjest143.seq 83548 59121259 299001319 ddbjest144.seq 85652 64671327 299003760 ddbjest145.seq 78937 50321069 299001171 ddbjest146.seq 82151 44504304 299003247 ddbjest147.seq 85483 48749757 299000580 ddbjest148.seq 114208 66929200 299003541 ddbjest149.seq 90383 57123493 299002382 ddbjest150.seq 130612 82133702 299000771 ddbjest151.seq 135943 81511618 299001277 ddbjest152.seq 133309 81069688 299001000 ddbjest153.seq 118917 77363007 299002477 ddbjest154.seq 81302 44553688 299002380 ddbjest155.seq 89480 73525161 299001931 ddbjest156.seq 97842 73672428 299000747 ddbjest157.seq 99506 49919535 299001632 ddbjest158.seq 105586 72461864 299000517 ddbjest159.seq 83190 56899836 299002380 ddbjest160.seq 63543 26315227 299002625 ddbjest161.seq 56237 20660332 299003465 ddbjest162.seq 58761 20745296 299000469 ddbjest163.seq 56591 20408937 299003049 ddbjest164.seq 56212 23702531 299000780 ddbjest165.seq 57056 21121857 299004622 ddbjest166.seq 58402 21542578 299001382 ddbjest167.seq 58647 23319367 299002223 ddbjest168.seq 55451 24085651 299001562 ddbjest169.seq 55814 23119739 299004344 ddbjest170.seq 56256 23432484 299005171 ddbjest171.seq 56379 22083143 299001440 ddbjest172.seq 54837 31067636 299000219 ddbjest173.seq 59077 31735510 299002633 ddbjest174.seq 127417 50123781 299002101 ddbjest175.seq 94068 57331164 299000311 ddbjest176.seq 89310 57704250 299002559 ddbjest177.seq 89507 56855738 299000518 ddbjest178.seq 95703 55656329 299002466 ddbjest179.seq 85328 57603703 299000026 ddbjest180.seq 82960 46843744 299000710 ddbjest181.seq 108302 52596904 299001772 ddbjest182.seq 115228 56990001 299001415 ddbjest183.seq 89020 48277373 299001821 ddbjest184.seq 85402 42051981 299002194 ddbjest185.seq 91216 54536215 299002343 ddbjest186.seq 90453 42455630 299001408 ddbjest187.seq 95007 59555641 299000876 ddbjest188.seq 82736 46728138 299003461 ddbjest189.seq 93750 46714028 299001454 ddbjest190.seq 107833 60736329 299000807 ddbjest191.seq 102604 71389436 299002726 ddbjest192.seq 125438 65727784 299001728 ddbjest193.seq 113056 56731606 299000556 ddbjest194.seq 93033 57690242 299002699 ddbjest195.seq 95502 48414401 299001019 ddbjest196.seq 96636 42651042 299002576 ddbjest197.seq 88360 49418793 299001022 ddbjest198.seq 78244 48977206 299002043 ddbjest199.seq 94200 56092519 299000313 ddbjest200.seq 81213 53293370 299000961 ddbjest201.seq 99039 50470072 299001502 ddbjest202.seq 100834 52134013 299000770 ddbjest203.seq 113416 67462027 299000893 ddbjest204.seq 147191 63585652 299000252 ddbjest205.seq 107387 50873145 299003020 ddbjest206.seq 87364 49790123 299002398 ddbjest207.seq 99198 56319082 299001404 ddbjest208.seq 94681 53165880 299001495 ddbjest209.seq 91132 52489016 299001921 ddbjest210.seq 79945 48044336 299000465 ddbjest211.seq 66353 37167371 299003147 ddbjest212.seq 86637 50266600 299000212 ddbjest213.seq 92145 57705347 299002838 ddbjest214.seq 85159 50764704 299001074 ddbjest215.seq 91335 62813179 299002453 ddbjest216.seq 80428 61967243 299001364 ddbjest217.seq 80682 52415966 299000598 ddbjest218.seq 87381 53776581 299001409 ddbjest219.seq 103585 59067648 299000272 ddbjest220.seq 88797 48914266 299001712 ddbjest221.seq 74018 37414613 299000616 ddbjest222.seq 83040 49543585 299000702 ddbjest223.seq 104097 54985266 299002298 ddbjest224.seq 90336 57132427 299000369 ddbjest225.seq 92048 54497479 299002328 ddbjest226.seq 87569 56693659 299001542 ddbjest227.seq 111869 66537852 299001866 ddbjest228.seq 105828 57718128 299000354 ddbjest229.seq 78602 47972785 299000765 ddbjest230.seq 96362 49131879 299001737 ddbjest231.seq 63276 33064657 299001949 ddbjest232.seq 85321 49123617 299000287 ddbjest233.seq 112610 61059485 299001748 ddbjest234.seq 101884 34188192 299002140 ddbjest235.seq 92683 34137367 299000619 ddbjest236.seq 94645 33685804 299000856 ddbjest237.seq 100743 34704662 299002856 ddbjest238.seq 92367 36496017 299003101 ddbjest239.seq 69995 26350587 220775757 ddbjgss1.seq 104907 76506584 299000517 ddbjgss2.seq 101582 70413139 299000278 ddbjgss3.seq 119712 50932914 299000682 ddbjgss4.seq 99763 75384982 299000714 ddbjgss5.seq 82163 68623655 299000747 ddbjgss6.seq 80608 71845423 299002395 ddbjgss7.seq 73580 66163340 299000078 ddbjgss8.seq 108582 46433754 299002707 ddbjgss9.seq 111891 45984322 299001799 ddbjgss10.seq 114503 51163068 299002173 ddbjgss11.seq 107744 52859269 299002183 ddbjgss12.seq 100900 53264730 299002035 ddbjgss13.seq 100466 50074289 299001890 ddbjgss14.seq 98746 50312862 299001592 ddbjgss15.seq 94517 47752496 299002343 ddbjgss16.seq 95863 54152382 299002855 ddbjgss17.seq 90222 46768538 299000147 ddbjgss18.seq 96080 50243502 299001906 ddbjgss19.seq 90641 60906820 298999997 ddbjgss20.seq 90502 42855889 299001099 ddbjgss21.seq 98900 55929892 299002480 ddbjgss22.seq 87523 40086607 299001921 ddbjgss23.seq 73547 38074500 299001881 ddbjgss24.seq 74728 34910942 299001041 ddbjgss25.seq 80910 46021993 299000983 ddbjgss26.seq 79208 37017676 299001933 ddbjgss27.seq 81623 49692776 299001027 ddbjgss28.seq 86662 36238449 299001667 ddbjgss29.seq 74678 35047694 299002479 ddbjgss30.seq 90758 43582417 299001947 ddbjgss31.seq 86493 46753254 299000670 ddbjgss32.seq 94631 55519561 299001568 ddbjgss33.seq 96183 52614347 299001890 ddbjgss34.seq 99310 53013218 299000809 ddbjgss35.seq 101454 57170378 299002137 ddbjgss36.seq 119469 79361615 299001506 ddbjgss37.seq 114861 64208115 299001760 ddbjgss38.seq 113036 62211121 299002918 ddbjgss39.seq 108926 44239990 299001308 ddbjgss40.seq 104258 63089543 299001273 ddbjgss41.seq 126455 82923508 299001075 ddbjgss42.seq 96106 45254139 298999944 ddbjgss43.seq 97081 67770607 299001540 ddbjgss44.seq 93076 61364698 299000164 ddbjgss45.seq 98791 57650215 299000497 ddbjgss46.seq 102477 57416230 299001648 ddbjgss47.seq 116138 82423269 299001117 ddbjgss48.seq 105895 74334406 299000574 ddbjgss49.seq 116916 71986395 299001668 ddbjgss50.seq 104348 67224213 299002169 ddbjgss51.seq 96311 53086526 299000578 ddbjgss52.seq 116896 61839134 299001540 ddbjgss53.seq 110778 59169831 299000859 ddbjgss54.seq 106106 77106292 299000150 ddbjgss55.seq 91920 101279667 299000518 ddbjgss56.seq 112596 87278551 299001659 ddbjgss57.seq 98013 64475043 299001106 ddbjgss58.seq 82675 57988867 299003185 ddbjgss59.seq 113406 84493911 299000983 ddbjgss60.seq 109268 69618610 299001748 ddbjgss61.seq 112031 73294467 299001398 ddbjgss62.seq 121936 74567433 299002245 ddbjgss63.seq 127884 69331277 299000822 ddbjgss64.seq 128291 68801207 299000484 ddbjgss65.seq 129821 66806470 299000976 ddbjgss66.seq 129933 66660710 299001493 ddbjgss67.seq 130283 66202736 299000048 ddbjgss68.seq 128835 68092273 299002014 ddbjgss69.seq 117254 83762236 299000232 ddbjgss70.seq 113881 89693025 299001496 ddbjgss71.seq 111771 88475935 299000935 ddbjgss72.seq 112980 82576825 299001067 ddbjgss73.seq 118372 51496262 299001886 ddbjgss74.seq 120117 36972354 299000525 ddbjgss75.seq 108115 65952732 299000866 ddbjgss76.seq 105351 61380525 299002117 ddbjgss77.seq 110136 74856315 299002029 ddbjgss78.seq 99648 97217895 299001966 ddbjgss79.seq 100566 81216654 299001655 ddbjgss80.seq 115242 68796449 299001115 ddbjgss81.seq 100308 78047799 299000296 ddbjgss82.seq 115235 71443161 299002956 ddbjgss83.seq 20938 6819591 66984519 ddbjhtc1.seq 37992 68278358 299005092 ddbjhtc2.seq 47578 84558618 299002190 ddbjhtc3.seq 85995 74074428 248562983 ddbjhtg1.seq 1583 227688957 299086515 ddbjhtg2.seq 3381 224650903 299145341 ddbjhtg3.seq 3089 226146426 299003927 ddbjhtg4.seq 1873 226019784 299065902 ddbjhtg5.seq 1543 224743881 299113395 ddbjhtg6.seq 1514 224807422 299088020 ddbjhtg7.seq 1553 224604597 299099416 ddbjhtg8.seq 1345 227987961 299094156 ddbjhtg9.seq 1796 223243384 299292317 ddbjhtg10.seq 1145 229888584 299280693 ddbjhtg11.seq 896 230167000 299021047 ddbjhtg12.seq 891 230319560 299300952 ddbjhtg13.seq 962 230040255 299132803 ddbjhtg14.seq 914 230213973 299167772 ddbjhtg15.seq 1834 221381125 299049359 ddbjhtg16.seq 1533 224315170 299049054 ddbjhtg17.seq 1338 226436411 299033809 ddbjhtg18.seq 1017 229322728 299048147 ddbjhtg19.seq 973 229694429 299097396 ddbjhtg20.seq 1129 228802579 299057324 ddbjhtg21.seq 1058 229309376 299258368 ddbjhtg22.seq 963 229864174 299315624 ddbjhtg23.seq 1035 229627361 299203718 ddbjhtg24.seq 1003 229663358 299132700 ddbjhtg25.seq 1096 229193423 299310382 ddbjhtg26.seq 1171 228728628 299274730 ddbjhtg27.seq 1118 228926433 299033562 ddbjhtg28.seq 1130 228962807 299330199 ddbjhtg29.seq 1080 229530896 299074931 ddbjhtg30.seq 1096 229081601 299105537 ddbjhtg31.seq 1105 229214491 299092154 ddbjhtg32.seq 1075 229378675 299184621 ddbjhtg33.seq 997 229609898 299114111 ddbjhtg34.seq 1058 229335812 299202131 ddbjhtg35.seq 1165 228744401 299054285 ddbjhtg36.seq 1121 229798729 299326973 ddbjhtg37.seq 1141 229058857 299026864 ddbjhtg38.seq 1381 226858550 299128927 ddbjhtg39.seq 1357 227756761 299052122 ddbjhtg40.seq 1515 227091113 299051200 ddbjhtg41.seq 1331 229017531 299223964 ddbjhtg42.seq 1340 227684636 299169028 ddbjhtg43.seq 1411 228904450 299036455 ddbjhtg44.seq 1486 229791203 299007084 ddbjhtg45.seq 1401 229957351 299016652 ddbjhtg46.seq 1475 229035238 299151762 ddbjhtg47.seq 1261 228468390 299135406 ddbjhtg48.seq 1359 231333727 299139597 ddbjhtg49.seq 1200 232104946 299178009 ddbjhtg50.seq 1224 231064033 299005052 ddbjhtg51.seq 1285 230497738 299097486 ddbjhtg52.seq 422 64974947 84246015 ddbjhum1.seq 13751 192369942 299238430 ddbjhum2.seq 1612 212237397 299080540 ddbjhum3.seq 1576 217542275 299032493 ddbjhum4.seq 1350 206438287 299106598 ddbjhum5.seq 1447 213354794 299012700 ddbjhum6.seq 1463 210804406 299088976 ddbjhum7.seq 1549 204156587 299026585 ddbjhum8.seq 1629 213645825 299081993 ddbjhum9.seq 1508 208414833 299169790 ddbjhum10.seq 1792 209464072 299069417 ddbjhum11.seq 1945 213042633 299087112 ddbjhum12.seq 33098 170255311 299001745 ddbjhum13.seq 69143 102219069 299007631 ddbjhum14.seq 17356 168237850 299284477 ddbjhum15.seq 3613 208572968 299034862 ddbjhum16.seq 2111 216679228 299125768 ddbjhum17.seq 2488 217242121 299082419 ddbjhum18.seq 5143 218069873 299062091 ddbjhum19.seq 1748 222634585 299158574 ddbjhum20.seq 46396 100917107 299005334 ddbjhum21.seq 54416 125189930 299000138 ddbjhum22.seq 33200 59970007 143153324 ddbjinv1.seq 12798 206899000 299129607 ddbjinv2.seq 8911 181098773 299036984 ddbjinv3.seq 91372 89423931 299002344 ddbjinv4.seq 69225 104877952 299000764 ddbjinv5.seq 39400 112994269 261984794 ddbjmam.seq 59692 82715378 221195199 ddbjpat1.seq 255273 89006233 299000330 ddbjpat2.seq 217699 107791381 299002318 ddbjpat3.seq 167004 100563085 298999997 ddbjpat4.seq 129501 131960854 299000818 ddbjpat5.seq 166187 103810768 299001166 ddbjpat6.seq 147367 113680701 299000796 ddbjpat7.seq 183733 69310974 299002191 ddbjpat8.seq 123718 77219888 299001059 ddbjpat9.seq 132325 61397837 299000331 ddbjpat10.seq 174358 60801591 299001021 ddbjpat11.seq 169789 70720327 299001433 ddbjpat12.seq 136247 127707796 299003305 ddbjpat13.seq 160973 108915250 299003305 ddbjpat14.seq 112221 41374251 163064718 ddbjphg.seq 2509 12516768 31623786 ddbjpln1.seq 27019 163737530 299084309 ddbjpln2.seq 18720 188189446 299001974 ddbjpln3.seq 89969 92878618 299000970 ddbjpln4.seq 80924 75056775 299002467 ddbjpln5.seq 34212 84516033 299006389 ddbjpln6.seq 1788 204361604 299067631 ddbjpln7.seq 25376 173398271 299001223 ddbjpln8.seq 85656 86421812 299000662 ddbjpln9.seq 61001 112731499 299000374 ddbjpln10.seq 2580 7091194 16867752 ddbjpri.seq 27858 213124132 328193784 ddbjrod1.seq 7686 206839887 299026097 ddbjrod2.seq 1048 209483868 299219661 ddbjrod3.seq 1145 214144986 299258130 ddbjrod4.seq 1173 215660228 299112902 ddbjrod5.seq 1196 217567712 299199333 ddbjrod6.seq 33993 171751719 299044133 ddbjrod7.seq 1423 231875714 299096024 ddbjrod8.seq 1501 232055714 299122587 ddbjrod9.seq 33549 136001195 299009225 ddbjrod10.seq 42815 133355499 276306189 ddbjsts1.seq 104330 57812992 299001196 ddbjsts2.seq 85229 34062082 299002357 ddbjsts3.seq 104442 40031580 299000814 ddbjsts4.seq 37012 16749991 93539531 ddbjsyn.seq 11608 19899321 52291382 ddbjtpa.seq 4255 14310185 31544536 ddbjuna.seq 1270 556738 3115875 ddbjvrl1.seq 86313 77025254 299000266 ddbjvrl2.seq 87192 77396725 299001800 ddbjvrl3.seq 56781 57613694 202049830 ddbjvrt1.seq 73760 111950695 299121052 ddbjvrt2.seq 34052 173048037 299002679 ddbjvrt3.seq 17419 184206442 299094177 ddbjvrt4.seq 24648 195229870 299013721 ddbjvrt5.seq 13487 12765530 43651993 ddbjacc.idx 0 0 1361210782 (Accession number index file) ddbjgen.idx 0 0 60960637 (Gene name index file) ddbjjou.idx 0 0 1531820385 (Journal citation index file) ddbjkey.idx 0 0 1251624145 (Keyword phrase index file) ------------------------------------------------------- EST: expressed sequence tag CON: Contig sequences GSS: genome survey sequence HTC: high throughput cDNA HTG: high throughput genome sequence STS: sequence tagged site TPA: third party annotation