DNA Data Bank of Japan DNA Database Release 59, September 2004, including 37,926,117 entries, 42,245,956,937 bases This database may be copied and redistributed without permission on the condition that all the statements in this release note are reproduced in each copy. The present release contains the newest data prepared by the DNA Data Bank of Japan (DDBJ), GenBank, and European Molecular Biology Laboratory/European Bioinformatics Institute (EMBL/EBI) as of August 26, 2004. This unified database was made possible thanks to the international collaboration among the threedata banks. All the entries have accordingly been annotated using the feature keys common to them. All the entries designated by the accession numbers with the prefixes "C", "D", "E", "AB", "AG", "AK", "AP", "AT", "AU", "AV", "BA", "BB", "BD", "BJ", "BP", "BS", "BW" and "BY" have been collected and processed by DDBJ, and the rest have been prepared by GenBank and EMBL/EBI. There have been a number of genome projects going on worldwide. Among them human genome projects have probably been most productive and yielded a large number of ordinary sequences, huge amounts of ESTs and quantities of genome sequences. Thus, we have the human(HUM) division solely for human sequences and the primate (PRI) division for non-human primate sequences. The HUM division in this release was recorded in 22 files each of which had 300 MB storage capacity. Incidentally, the BCT, HTC, INV, PLN, ROD, STS, VRL and VRT divisions were recorded in 8, 5, 6, 11, 11, 4, 3, 5 files, respectively. Note that the EST division also contains human sequences. The present release does not have the ORG division. Thus, if you are Interested in human mitochondrial sequences, for example, you are now advised to refer to the HUM division. This release includes a division (PAT) for patent data. The patent data are those which the Japanese Patent Office (JPO), United States Patent and Trademark Office (USPTO), and the European Patent Office (EPO) collected and processed. The accession numbers of the patent data collected by the Japanese Patent Office start with the prefix "E" and "BD", those collected and supplied by USPTO and GenBank respectively start with "I" and "AR", and those collected and supplied by EPO and EMBL/EBI respectively start with "A" "AX" and "CQ". The entries with the prefixes "I", "AR", "A", "AX", "CQ", "E" and "BD" were allocated to 14 files (ddbjpat1.seq _ ddbjpat14.seq) in the DDBJ format. Note also that unauthorized use of the patent data may cause legal issues for which we take no responsibility. In the present release, the SOURCE in the flat file was revisited and revised if necessary in accordance with the unified taxonomy database common to the three data banks. The number of ESTs has been increasing at an enormous rate and is expected to be growing even more rapidly in the future. Therefore, EST data were stored in 256 files each of which had the same storage capacity as the file of the HUM division. The present release includes the GSS division. GSS stands for the Genome Survey Sequence, which is similar to EST, except that GSS is genomic DNA whereas EST is cDNA. This division was recorded in 93 files similarly to the HUM division. This release also includes the High Throughput Genomic Sequence (HTGS), which comes mainly from genome project teams which deal with a clone as a sequencing unit. HTGS in this release were recorded in 52 files similarly to the HUM division. The index files are not presented in this release except for ddbjacc.idx, ddbjgen.idx, ddbjjou.idx, and ddbjkey.idx. Instead, we have included a program by which to make the index files not presented in this release. For the use of the program, see the files, seq2indexes.doc, seq2indexes.c, and seq2indexes.h in this release. The present release contains amino acid sequences that were translated from the corresponding nucleotide sequences in our database. In the translation we paid much attention to the fact that some species or organella have a codon different from the universal one, and used the proper codon table. If you find an incorrect codon in a translated sequence, please let us know. The three data banks include the item VERSION in the flat file, which indicates a version of a submitted nucleotide sequence (see Table 1). It is expressed like AB123456.1, in which the digit(s) after the period is a version number. The reason for adding VERSION is that since a released sequence sometimes revised by the submitter, the accession number alone cannot specify the sequence in question causing the user a trouble. The number is increased by one every time when a revised sequence is made public. Accordingly, the translated protein sequence will be accompanied with a /protein_id which is expressed as BAA12345.1, in which the digit(s) after the period is again a version number. The number is increased by one when the corresponding nucleotide sequence is revised and the protein sequence is changed as a result, and when the revised protein sequence is made public. We terminated the RNA division. The RNA data have been redistributed according to the category of the organism. Therefore, you will find a human RNA sequence, for example, in the HUM division. The present release includes a division, CON. The CON division is to show the order of related sequences in a genome, and expressed by join and the accession numbers of the sequences. The contents of the CON division are compiled by the three data banks not by the data submitter. The current number of the entries of this division is 249,166. The entries and bases in the CON division are not counted in the released numbers on the top of the release note. The present release also includes, HTC (High Throughput cDNA). This division is to include unfinished high throughput cDNA sequences, each of which has 5'UTR and 3'UTR at both ends and part of a coding region. The sequence may also include introns. When the sequence becomes finished later, it moves to the corresponding taxonomic division. The sequence is accompanied with a keyword, HTC (High Throughput cDNA), which is dropped when the sequence is finished and moved to a taxonomic division. Since release 51, TPA (Third Party Annotation) data have been available. The entries and bases in TPA are not counted in the released numbers on the top of the release note. Since release 54, '/sequenced_mol' qualifier has been changed to '/mol_type' qualifier. We accordingly completed retrofitting the pertinent entries. This change was made on the agreement at the INSD international collaborative meeting in 2002. /mol_type qualifier Definition: in vivo molecule type Value format: molecule type where molecule type is limited to followings; "genomic DNA", "genomic RNA", "mRNA" (incl. EST), "tRNA", "rRNA", "snoRNA", "snRNA", "scRNA", "pre-mRNA", "other RNA" (incl. synthetic), "other DNA" (incl. synthetic), "unassigned DNA" (incl. unknown), "unassigned RNA" (incl. unknown) The BASE COUNT line of the DDBJ flat file format has been changed since DDBJ release 56, corresponding to the relaxation of the maximum sequence length restriction (350,000 bp/entry) in the entry that had been practised at DDBJ/EMBL/GenBank International Nucleotide Sequence Databases. In the BASE COUNT line of the DDBJ flat file format, 6 digits had been allocated for each number of a, c, g, t and other bases in the sequence. Hereafter, in the new flat file format, 9 digits are allocated for each number of a, c, g and t, while the numbers of other bases are removed. In accordance with the relaxation of sequence length limitation, GenBank had already dropped the BASE COUNT line from their flat file format from GenBank Release 138 (Oct. 2003). We DDBJ have decided to maintain the BASE COUNT line in our flat file format from the view that GC contents are still important information to characterize the sequence. Prior to publication of release 56 in December, 2003, the new DDBJ flat file format is adopted to daily data update from Dec. 3. Following is an example of the new BASE COUNT line. 1 6 11 16 21 26 31 36 41 46 51 56 61 66 71 |----|----|----|----|----|----|----|----|----|----|----|----|----|----| BASE COUNT 123456789 a 123456789 c 123456789 g 123456789 t The three data banks have agreed that the maximum length limitation (350 kb) of a submitted sequence be relaxed. Following the agreement, we are now in preparation for the relaxation. This release is published by the following DDBJ staff. T. Gojobori, Y. Tateno, K. Nishikawa, H. Sugawara, N. Saitou, K. Okubo, K. Ikeo, Y. Suzuki, S. Fukuchi, A. Kinjo, K. Itoh, R. Barrero, T. Abe H. Aono, T. Atsumi, M. Ejima, N. Endo, D. Fukuda, M. Gojobori, Y. Hikino, T. Hirai, N. Hoshi, H. Ichikawa, K. Ichikawa, N. Ishizaka, T. Kato, T. Kawamoto, J. Kohira, Ta. Koike, To. Koike, R. Kokubo, T. Konno, T. Kosuge, A. Kusakabe, Y. Lin, H. Maesako, K. Mamiya, N. Maruyama, J. Mashima, K. Mimura, H-J. Min, S. Miyamoto, S. Miyazawa, N. Murakata, S. Nagira, M. Nagura, N. Nishinomiya, T. Okido, K. Sakai, Y. Shigemoto, H. Shiozawa, F. Sugiyama, M. Suzuki, T. Takaki, H. Tsutsui, M. Tsuboi, M. Yamaguchi, Y. Yamamoto, E. Yokoyama Center for Information Biology and DNA Data Bank of Japan National Institute of Genetics Research Organization of Information and Systems Mishima 411-8540, Japan Phone: +81 55 981 6853 FAX: +81 55 981 6849 E-mail: ddbj@ddbj.nig.ac.jp (for general inquiry) ddbjsub@ddbj.nig.ac.jp (for data submission) ddbjupdt@ddbj.nig.ac.jp (for updates and notification of publication) WWW: http://www.ddbj.nig.ac.jp/ (for DDBJ WWW server) http://sakura.ddbj.nig.ac.jp/ (for DDBJ sequence data submission system) Acknowledgement: We are grateful to NCBI and EMBL/EBI for a firm friendship and an excellent collaboration with us. We also thank the Japanese Patent Office for a steady cooperation with us. The operation of DDBJ is supported by the Ministry of Education, Culture, Sports, Science and Technology, and we would gratefully note this here. DDBJ Database Release History Release Date Entries Bases Comments 59 09/04 37,926,117 42,245,956,937 58 06/04 34,917,581 39,812,635,108 57 03/04 32,693,678 38,008,449,840 56 12/03 30,405,173 36,079,046,032 55 09/03 27,753,140 34,280,225,489 54 06/03 25,149,821 32,162,041,177 53 02/03 23,250,813 29,711,299,332 52 12/02 20,354,812 26,931,456,316 51 09/02 18,401,358 22,782,404,136 TPA started 50 06/02 17,260,693 20,158,357,982 49 04/02 16,503,157 18,579,627,226 48 01/02 15,016,100 16,197,713,855 47 10/01 13,266,610 14,145,671,645 46 07/01 12,313,759 13,037,646,166 45 04/01 11,434,113 12,207,092,905 HTC division started 44 01/01 10,165,597 11,136,298,841 43 10/00 8,666,551 10,034,532,698 42 07/00 7,554,995 8,880,721,093 41 04/00 5,962,608 6,409,581,885 CON division started 40 01/00 5,388,125 4,762,696,173 RNA division terminated 39 10/99 4,810,773 3,728,000,562 NID and PID discarded 38 07/99 4,294,369 3,098,519,597 37 03/99 3,311,627 2,375,261,951 VERSION, /protein_id started 36 01/99 3,073,166 2,190,425,560 35 10/98 2,759,261 1,957,341,169 34 07/98 2,412,785 1,708,580,623 33 04/98 2,174,769 1,479,303,279 32 01/98 1,956,669 1,300,950,613 31 10/97 1,731,532 1,139,869,464 Adoption of the unified taxonomy database 30 07/97 1,534,115 992,788,339 NID and PID terminated 29 04/97 1,270,194 841,415,232 28 01/97 1,154,120 756,785,219 HTG division started ORG division terminated 27 10/96 936,697 608,103,057 GSS division started 26 07/96 835,552 551,932,448 25 04/96 744,490 499,300,364 /translation started 24 01/96 637,508 431,771,652 23 10/95 569,757 390,694,350 22 07/95 437,588 322,982,425 HUM division started 21 04/95 274,596 250,875,023 20 01/95 239,689 231,299,557 19 10/94 204,332 205,274,131 18 07/94 185,230 192,473,021 17 04/94 169,957 179,942,209 16 01/94 154,626 165,017,628 15 10/93 131,649 147,224,690 14 07/93 120,350 138,686,333 13 04/93 112,067 129,784,445 12 01/93 97,683 120,815,244 EST division started 11 07/92 65,693 84,839,075 10 01/92 59,317 77,805,556 GenBank/EMBL inclusion started 9 07/91 1,130 2,002,124 8 01/91 879 1,573,442 7 07/90 681 1,154,211 6 01/90 496 841,236 5 07/89 395 679,378 4 01/89 302 535,985 3 07/88 230 345,850 2 01/88 142 199,392 1 07/87 66 108,970 Started with DDBJ only ------------------------------------------------------------------------ This release covers 20 categories of organisms and others as follows: ------------------------------------------------------------------------------ ddbjbct.*** Category for bacteria ddbjest.*** Category for EST (expressed sequence tag) ddbjcon.*** Category for CON (Contig sequences) ddbjhtc.*** Category for HTC (high throughput cDNA) ddbjhtg.*** Category for HTG (high throughput genomic sequence) ddbjhum.*** Category for human ddbjgss.*** Category for GSS (Genome Survey Sequence) ddbjinv.*** Category for invertebrates ddbjmam.*** Category for mammals other than primates and rodents ddbjpat.*** Category for patents ddbjphg.*** Category for phages ddbjpln.*** Category for plants ddbjpri.*** Category for primates other than human ddbjrod.*** Category for rodents ddbjsts.*** Category for STS (sequence tagged site) ddbjsyn.*** Category for synthetic DNAs ddbjtpa.*** Category for TPA (Third Party Annotation) ddbjuna.*** Category for unannotated sequences ddbjvrl.*** Category for viruses ddbjvrt.*** Category for vertebrates other than mammals ------------------------------------------------------------------------------ Each category then has the following nine files. Note that all the files except for ddbj***.seq are created by the user by use of seq2indexes as mentioned in the release note. ------------------------------------------------------------------------------ ddbj***.seq List of an entry in DDBJ format, see Table 1. ddbj***.acc List of the accession numbers, see Table 2 . ddbj***.aut List of the authors, see Table 3. ddbj***.dir List of the short directory in DDBJ style, see Table 4. ddbj***.idx List of indices, see Table 5. ddbj***.jou List of the journals, see Table 6. ddbj***.key List of the key words, see Table 7. ddbj***.org List of the species names, see Table 8. ddbj***.sdr List of the short directory in DDBJ style, see Table 9. ------------------------------------------------------------------------------ The format of LOCUS line in the flat file was changed as shown below to adjust to the GenBank format from release 51. ------------------------------------------------------------------------------ Old (-rel. 50): LOCUS AB000001 660 bp DNA PLN 01-FEB-2001 Present (rel. 51-): LOCUS AB000001 660 bp DNA linear PLN 01-FEB-2001 New format specification: --------- -------- Positions Contents --------- -------- 01-05 'LOCUS' 06-12 spaces 13-28 Locus name 29-29 space 30-40 Length of sequence, right-justified 41-41 space 42-43 bp 44-44 space 45-47 spaces, ss- (single-stranded), ds- (double-stranded), or ms- (mixed-stranded) 48-53 DNA, RNA, tRNA (transfer RNA), rRNA (ribosomal RNA), mRNA (messenger RNA), uRNA (small nuclear RNA), scRNA, snRNA, snoRNA. Left justified. 54-55 space 56-63 'linear' followed by two spaces, or 'circular' 64-64 space 65-67 The division code 68-68 space 69-79 Date, in the form dd-MMM-yyyy (e.g., 15-MAR-1991) ------------------------------------------------------------------------------ Table 1. Part of the contents in the file 'ddbjbct.seq'. This shows all pieces of information on one entry in DDBJ format. ------------------------------------------------------------------------------ LOCUS D87069 993 bp mRNA linear BCT 14-APR-2000 DEFINITION Escherichia coli mRNA for RNA polymerase sigma subunit, truncated form of sigma-38, complete cds. ACCESSION D87069 VERSION D87069.1 KEYWORDS RNA polymerase sigma subunit, truncated form of sigma-38. SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 993) AUTHORS Jishage,M. TITLE Direct Submission JOURNAL Submitted (14-AUG-1996) to the DDBJ/EMBL/GenBank databases. Miki Jishage, National Institute of Genetics, Molecular Genetics; Yata 1111, Mishima, Shizuoka 411, Japan (E-mail:mjishage@lab.nig.ac.jp, Tel:0559-81-6742, Fax:0559-81-6746) REFERENCE 2 (bases 1 to 993) AUTHORS Jishage,M. and Ishihama,A. TITLE Variation in RNA polymerase sigma subunit composition within different stocks of Escherichia coli starin W3110 JOURNAL Unpublished (1996) REFERENCE 3 AUTHORS Ivanova,A., Renshaw,M., Guntaka,R. and Eisenstark,A. TITLE DNA base sequence variability in katF (putative sigma factor) gene Escherichia coli JOURNAL Nucleic Acids Res. 20, 5479-5480 (1992) REFERENCE 4 AUTHORS Takayanagi,Y., Tanaka,K. and Takahashi,H. TITLE Structure of the 5' upstream region and the regulation of the rpoS gene of Escherichia coli JOURNAL Mol. Gen. Genet. 243, 525-531 (1994) COMMENT FEATURES Location/Qualifiers source 1..993 /mol_type="mRNA" /organism="Escherichia coli" /strain="W3110" CDS 1..810 /note="the gene has four single base changes, resulting in two amino acid substitutions and an amber mutation" /product="RNA polymerase sigma subunit, truncated form of sigma-38" /protein_id="BAA13238.1" /transl_table=11 /translation="MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEYEPSDNDLAEEE LLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLV VKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMN QTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNER ITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAK" variation 75 /citation=[3] /replace="t" variation 97 /citation=[3] /replace="t" variation 99 /citation=[3] /replace="t" variation 808 /citation=[3] /replace="t" BASE COUNT 254 a 223 c 291 g 225 t ORIGIN 1 atgagtcaga atacgctgaa agttcatgat ttaaatgaag atgcggaatt tgatgagaac 61 ggagttgagg tttttgacga aaaggcctta gtagaatatg aacccagtga taacgatttg 121 gccgaagagg aactgttatc gcagggagcc acacagcgtg tgttggacgc gactcagctt 181 taccttggtg agattggtta ttcaccactg ttaacggccg aagaagaagt ttattttgcg 241 cgtcgcgcac tgcgtggaga tgtcgcctct cgccgccgga tgatcgagag taacttgcgt 301 ctggtggtaa aaattgcccg ccgttatggc aatcgtggtc tggcgttgct ggaccttatc 361 gaagagggca acctggggct gatccgcgcg gtagagaagt ttgacccgga acgtggtttc 421 cgcttctcaa catacgcaac ctggtggatt cgccagacga ttgaacgggc gattatgaac 481 caaacccgta ctattcgttt gccgattcac atcgtaaagg agctgaacgt ttacctgcga 541 accgcacgtg agttgtccca taagctggac catgaaccaa gtgcggaaga gatcgcagag 601 caactggata agccagttga tgacgtcagc cgtatgcttc gtcttaacga gcgcattacc 661 tcggtagaca ccccgctggg tggtgattcc gaaaaagcgt tgctggacat cctggccgat 721 gaaaaagaga acggtccgga agataccacg caagatgacg atatgaagca gagcatcgtc 781 aaatggctgt tcgagctgaa cgccaaatag cgtgaagtgc tggcacgtcg attcggtttg 841 ctggggtacg aagcggcaac actggaagat gtaggtcgtg aaattggcct cacccgtgaa 901 cgtgttcgcc agattcaggt tgaaggcctg cgccgtttgc gcgaaatcct gcaaacgcag 961 gggctgaata tcgaagcgct gttccgcgag taa // ------------------------------------------------------------------------------ Table 2. Part of the contents in the file 'ddbjbct.acc'. The first column refers to the secondary accession number, second column to the locus name, and third to the primary accession number. The primary number may be the same as the secondary number. They are arranged in the ascending order of the secondary accession numbers. ------------------------------------------------------------------------------ D00001 -> ECOPBPAA X04516 D00002 -> ECOPYRH X04469 D00006 -> PNS981TET D00006 D00020 -> COLE2LYS D00020 D00021 -> COLE31YS D00021 D00038 -> BRLAM330 D00038 D00066 -> BAC139AC D00066 D00067 -> ECONANA M20207 D00069 -> ECOUVRD2 D00069 D00087 -> BACXYNAA D00087 ------------------------------------------------------------------------------ Table 3. Part of the contents in the file 'ddbjbct.aut'. For each author name given on the left to the arrow, the corresponding locus name and primary accession number are respectively listed on the right. They are arranged in the alphabetical order of the author names. ------------------------------------------------------------------------------ Aan,F. -> STYCRR X05210 Aan,F. -> STYENZI M76176 Aaronson,W. -> ECOKPSD M64977 Aaronson,W. -> ECONEUA J05023 Abad-Lapuebla,M.A. -> VIBTDHI D90238 Abdel-Mawgood,A.L. -> CYAPSBHA X16394 Abdel-Meguid,S.S. -> TRNGDRECM J01843 Abdelal,A. -> STYCARA M36540 Abdelal,A. -> STYCARAB X13200 Abdelal,A.H. -> PSENOSA M60717 ------------------------------------------------------------------------------ Table 4. Part of the short directory in DDBJ style in the file 'ddbjbct.dir'. For each locus name given in the first column, the corresponding primary accession number, molecular type, number of nucleotide pairs, and description for the locus are respectively listed. They are arranged in the alphabetical order of the locus names. ------------------------------------------------------------------------------ ABCAARAA M34830 ds-DNA 1624 A.aceti acetic acid resistance protein (aarA) gene, complete cds. ABCADHCC D00635 ds-DNA 4230 A. polyoxogenes alcohol dehydrogenase (EC 1.1.99.8) and cytochrome c genes. ABCALDH D00521 ds-DNA 2683 A.polyoxogenes membrane-bound aldehyde dehydrogenase gene, complete cds and flanks. ABCBCSAA M37202 ds-DNA 9540 A.xylinum bcs B, bcs C and bcs D genes, complete cds and bcs A gene, partial cds. ABCCELA M76548 ds-DNA 1165 Acetobacter xylinum UDP pyrophosphorylase (celA) gene, complete cds. ABCCELSYN X54676 ds-DNA 5363 A. xylinum gene for cellulose biosynthesis ABCIS1380 D10043 ds-DNA 1665 A.pasteurianus insertion sequence IS1380. ACAADH1 D90004 ds-DNA 2467 Acetobacter aceti(K6033) alcohol dehydrogenase subunit gene(adh1). ACCAAC2 M62833 ds-DNA 1123 Acinetobacter baumannii aminoglycoside acetyltr ansferase (aac2) gene, complete cds. ACCACEAA M62822 ds-DNA 1874 A.baumannii chloramphenicol acetyltransferase (cat) gene, complete cds. ------------------------------------------------------------------------------ Table 5. Part of the contents in the file 'ddbjbct.idx'. The first column refers to the locus name, second column to the starting site of the locus in byte, and third to its ending site in byte. They are arranged in the alphabetical order of the locus names. ------------------------------------------------------------------------------ %***************************** #ABCAARAA 0 3211 #ABCADHCC 3212 10608 #ABCALDH 10609 15864 #ABCBCSAA 15865 29583 #ABCCELA 29584 32289 #ABCCELSYN 32290 40960 #ABCIS1380 40961 44711 #ACAADH1 44712 49357 #ACCAAC2 49358 52395 ------------------------------------------------------------------------------ Table 6. Part of the contents in the file 'ddbjbct.jou'. This gives information on the journal in which sequence data were published. ------------------------------------------------------------------------------ (in) Chaloupka,J. and Krumphanzl,V. (Eds.); Extracellular Enzymes of Microorganisms: 129-137, Plenum Press, New York (1987) -> BACAMYABS M57457 (in) Ganesan,A.T., Chang,S. and Hoch,J.A. (Eds.); Molecular Cloning and Gene Regulation in Bacilli: 3-10, Academic Press, New York (1982) -> BACRG16S M55011 (in) Ganesan,A.T., Chang,S. and Hoch,J.A. (Eds.); Molecular Cloning and Gene Regulation in Bacilli: 3-10, Academic Press, New York (1982) -> BACRG16SA M55006 (in) Ganesan,A.T., Chang,S. and Hoch,J.A. (Eds.); Molecular Cloning and Gene Regulation in Bacilli: 3-10, Academic Press, New York (1982) -> BACRG16SB M55008 (in) Hoch,J.A. and Setlow,P. (Eds.); Molecular Biology of Microbial Differentiation: 85-94, American Society for Microbiology, Washington, DC (1985) -> BACSPOII M57606 (in) Holmgren,A. (Ed.); Thioredoxin and Glutaredoxin Systems: Structure and Function: 11-19, Unknown name, Unknown city (1986) -> ECOTRXA1 M54881 (in) Kjeldgaard,N.C. and Maaloe,O. (Eds.); Control of ribosome synthesis: 138-143, Academic Press, New York (1976) -> ECOLAC J01636 (in) Losick,R. and Chamberlin,M. (Eds.); RNA polymerase: 455-472, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1976) -> ECOTGY1 K01197 (in) Sikes,C.S. and Wheeler,A.P. (Eds.); Surface reactive peptides and polymers. Discovery and commercialization.: 186-200, American Chemical Society, Washington, D.C. (1991) -> ECOTGP J01714 (in) Sund,H. and Blauer,G. (Eds.); Protein-Ligand Interactions: 193-207, Walter de Gruyter, New York (1975) -> ECOLAC J01636 (in) Wu,R. and Grossman,L. (Eds.); Methods in Enzymology, Recombinant DNA, part E: In press, Academic Press, New York, N.Y. (1986) -> PLMCG M11320 Acta Microbiol. Pol. 35, 175-190 (1986) -> ECOTGG1 M54893 Actinomycetologica 5, 14-17 (1991) -> STMARGG D00799 Adv. Biophys. 21, 115-133 (1986) -> R10REP M26840 Adv. Biophys. 21, 175-192 (1986) -> ECONUSAA M26839 Adv. Enzyme Regul. 21, 225-237 (1983) -> ECOPURFA M26893 Adv. Exp. Med. Biol. 195, 239-246 (1986) -> ECOAPT M14040 Agric. Biol. Chem. 50, 2155-2158 (1986) -> ECONANA M20207 Agric. Biol. Chem. 50, 2771-2778 (1986) -> BRLAM330 D00038 Agric. Biol. Chem. 51, 2019-2022 (1987) -> BACCGT D00129 Agric. Biol. Chem. 51, 2641-2648 (1987) -> STRSAGP D00219 Agric. Biol. Chem. 51, 2807-2809 (1987) -> BACPGECR M35503 Agric. Biol. Chem. 51, 3133-3135 (1987) -> BACXYLAP D00312 Agric. Biol. Chem. 51, 455-463 (1987) -> BACHDCRY D00117 Agric. Biol. Chem. 51, 953-955 (1987) -> BACXYNAA D00087 Agric. Biol. Chem. 52, 1565-1573 (1988) -> BACIP135 D00348 Agric. Biol. Chem. 52, 1785-1789 (1988) -> BACTMR D00343 Agric. Biol. Chem. 52, 2243-2246 (1988) -> PSEGI D00342 Agric. Biol. Chem. 52, 399-406 (1988) -> BACAMYEB M35517 Agric. Biol. Chem. 52, 479-487 (1988) -> ECAPALI D00217 ------------------------------------------------------------------------------ Table 7. Part of the contents in the file 'ddbjbct.key'. For the locus and accession number respectively given on the right to the arrow, the corresponding key words are listed on the left. ------------------------------------------------------------------------------ A.aceti acetic acid resistance protein (aarA) gene, complete cds. -> ABCAARAA M34830 acetic acid resistance protein. -> ABCAARAA M34830 Cloning of genes responsible for acetic acid resistance in acetobacter aceti -> ABCAARAA M34830 A. polyoxogenes alcohol dehydrogenase (EC 1.1.99.8) and cytochrome c genes. -> ABCADHCC D00635 alcohol dehydrogenase; cytochrome c. -> ABCADHCC D00635 Cloning and sequencing of the gene cluster encoding two subunits of membrane- bound alcohol dehydrogenase from Acetobacter polyoxogenes -> ABCADHCC D00635 These data kindly submitted in computer readable form by: Toshimi Tamaki Nakano Central Biochemical Institute 2-6 Nakamura-cho Handa-shi, Aichi-ken 475 Japan Phone: 0569-21-3331 Fax: 0569-23-8486 -> ABCADHCC D00635 A.polyoxogenes membrane-bound aldehyde dehydrogenase gene, complete cds and flanks. -> ABCALDH D00521 aldehyde dehydrogenase gene; ethanol oxidation; membrane-bound enzyme. -> ABCALDH D00521 Nucleotide sequence of the membrane-bound aldehyde dehydrogenase gene from Acetobacter polyoxogenes -> ABCALDH D00521 ------------------------------------------------------------------------------ Table 8. Part of the contents in the file 'ddbjbct.org'. For the locus and accession number respectively given on the right to the arrow, the corresponding taxonomic names are listed on the left. They are arranged in the alphabetical order of the species names. ------------------------------------------------------------------------------ A. nidulans 6301 DNA. Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> ANIRUBPS X00019 A. nidulans DNA, clone pAN4. Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> ANIRGGX X00343 A. nidulans DNA. Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> ANIRGG X00512 A. polyoxogenes genomic DNA. Acetobacter polyoxogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. - > ABCADHCC D00635 A. quadruplicatum (strain PR-6) DNA, clone pAQPR1. Agmenellum quadruplicatum Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> AQUPCAB K02660 A. quadruplicatum (strain PR6) DNA. Agmenellum quadruplicatum Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. -> AQUCPCAB K02659 A. vinelandii DNA. Azotobacter vinelandii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. -> AVINIFUSV M17349 A.aceti (strain 10-8) DNA, clone pAR1611. Acetobacter aceti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. -> ABCAARAA M34830 A.actinomycetemcomitans (strain JP2) DNA, clone lambda-OP8. Actinobacillus actinomycetemcomitans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. -> ACNLKTXN M27399 A.anitratum DNA, clone pLJD1. Acinetobacter anitratum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. -> ACCCITSYN M33037 ------------------------------------------------------------------------------ Table 9. Part of the short directory file in DDBJ style in the file 'ddbjbct.sdr'. The short directory file contains brief descriptions of all of the sequence entries contained in the DDBJ style. ------------------------------------------------------------------------------ ABCAARAA A.aceti acetic acid resistance protein (aarA) gene, complete 1624bp ABCADHCC A. polyoxogenes alcohol dehydrogenase (EC 1.1.99.8) and 4230bp ABCALDH A.polyoxogenes membrane-bound aldehyde dehydrogenase gene, 2683bp ABCBCSABCD A.xylinum bcs A, B, C and D genes, complete cds's. 9540bp ABCCELA Acetobacter xylinum UDP pyrophosphorylase (celA) gene, 1165bp ABCCELSYN A. xylinum gene for cellulose biosynthesis 5363bp ABCIS1380 A.pasteurianus insertion sequence IS1380. 1665bp ACAADH1 Acetobacter aceti(K6033) alcohol dehydrogenase subunit 2467bp ACCAAC2 Acinetobacter baumannii aminoglycoside acetyltransferase 1123bp ACCACEAA A.baumannii chloramphenicol acetyltransferase (cat) gene, 1874bp ACCAPHA6 Acinetobacter baumannii aphA-6 gene. 1170bp ACCBENABCA A.calcoaceticus BenA, BenB, BenC, BenD, and BenE proteins 15922bp ACCCAT Acinetobacter calcoaceticus cat operon. 15922bp ACCCATAM A.calcoaceticus catA and catM genes, encoding catechol 1, 5537bp ACCCHMO Acinetobacter sp. cyclohexanone monooxygenase gene, complete 2128bp ACCCITSYN A.anitratum citrate synthase gene, complete cds. 1895bp ------------------------------------------------------------------------------ In addition to the 9 tables the four following index files are included in this release. These files were prepared for BCT, EST, GSS, HTC, HTG, HUM, INV, MAM, PAT, PHG, PLN, PRI, ROD, STS, SYN, UNA, VRL, VRT divisions. Accession number index file Keyword phrase index file Journal citation index file Gene name index file A brief description is given for each file in the following. Table 10. Part of the accession number index file in the 'ddbjacc.idx'. The following excerpt from the accession number index file illustrates the format of the index. ------------------------------------------------------------------------------ D00100 PSEASPAA BCT D00100 D00101 RABNP450R MAM D00101 D00102 HUMLTX HUM D00102 D00103 AFARRN5SA BCT D00103 AFRRN5SA BCT X05517 D00104 AFARRN5SB BCT D00104 AFRRN5SB BCT X05518 D00105 AFARRN5S BCT D00105 ASRRN5S BCT X05524 D00106 ACH5SRR BCT D00106 AXRRN5S BCT X05522 AXRRN5SA BCT X05523 D00107 ACH5SRRX BCT D00107 ACRRN5S BCT X05521 ------------------------------------------------------------------------------ Table 11. Part of the keyword phrase index file in the 'ddbjkey.idx'. Keyword phrases consist of names for gene products and other characteristics of sequence entries. ------------------------------------------------------------------------------ A CHANNEL DROCHA INV M17155 A COMPONENT SQLCVEA VRL M38183 A LOCUS GORGOGOA3 PRI X54375 GORGOGOA4 PRI X54376 A LOCUS ALLELE GORA0101 PRI X60258 GORA0201 PRI X60259 GORA0401 PRI X60257 GORA0501 PRI X60256 A MULTI-GENE FAMILY RICGLUTE PLN D00584 A PROTEIN MS2AAR PHG M25187 ST1APCS PHG M25396 A SEQUENCE HS5TOA30 VRL D00148 HS5TOA31 VRL D00147 ------------------------------------------------------------------------------ Table 12. Part of the journal citation index file in 'ddbjjou.idx'. The journal citation index file lists all of the citations that appear in the references. ------------------------------------------------------------------------------ ACTA BIOCHIM. BIOPHYS. SIN. 23, 246-253 (1992) HUMPLASINS HUM M98056 ACTA BIOCHIM. BIOPHYS. SIN. 28, 233-239(1996) TKTII PLN X82230 ACTA BIOCHIM. POL. 24, 301-318 (1977) LUPTRFJ PLN K00345 LUPTRFN PLN K00346 ACTA BIOCHIM. POL. 26, 369-381(1979) HVTRNPHE PLN X02683 ACTA BIOCHIM. POL. 29, 143-149 (1982) EMEMTA PLN M32572 EMEMTB PLN M32573 EMEMTC PLN M32574 EMEMTD PLN M32575 EMEMTE PLN M32576 ACTA BIOCHIM. POL. 34, 21-27 (1987) LUPNOSP PLN M32571 ------------------------------------------------------------------------------ Table 13. Part of the gene name index file in 'ddbjgen.idx'. This file lists all the gene names that appear in the feature table. ------------------------------------------------------------------------------ AACC8 STMAACC8 BCT M55426 AACC9 MPUAACC9 BCT M55427 AACT HUMA1ACM PRI K01500 HUMA1ACMA PRI X00947 HUMA1ACMB PRI M18035 HUMAACT1 PRI M18906 HUMAACT2 PRI M22533 HUMAACTA PRI J05176 AAD INTINTORF BCT L06418 LMOMO229D BCT X17478 AAD A1 ENTAAC3VI BCT M88012 AAD9 ENEAAD9A BCT M69221 AADA LMOMO229A BCT X17479 S52249 BCT S52249 SYNAADA SYN M60473 TRNTAAB BCT M55547 TRNTN21CAS BCT M86913 ------------------------------------------------------------------------------ The files in this release are arranged in the following order with non-labeled format. file name number of entries number of bases file size ------------------------------------------------------------------------------ ddbjrel.txt (DDBJ release note) 68767 ddbjacc.idx (Accession number index file) 1478543803 ddbjgen.idx (Gene name index file) 63746360 ddbjjou.idx (Journal citation index file) 1620239416 ddbjkey.idx (Keyword phrase index file) 1398767053 ddbjbct1.seq 30770 120298107 299017288 ddbjbct2.seq 6856 130959266 299153573 ddbjbct3.seq 23437 125278784 299041636 ddbjbct4.seq 79098 101023969 299001878 ddbjbct5.seq 3613 136352623 299219679 ddbjbct6.seq 81359 100034180 299207872 ddbjbct7.seq 14021 127134305 299003095 ddbjbct8.seq 27269 60947852 161496317 ddbjcon.seq 249166 0 501585635 ddbjest1.seq 90092 33919201 299000961 ddbjest2.seq 94909 38175564 299002718 ddbjest3.seq 95456 37053779 299000508 ddbjest4.seq 89429 28106820 299000560 ddbjest5.seq 94290 36413936 299001554 ddbjest6.seq 98428 39008125 298999953 ddbjest7.seq 98537 38046950 299002713 ddbjest8.seq 97457 37753782 299001979 ddbjest9.seq 98174 38758905 299002778 ddbjest10.seq 99189 38789629 299003488 ddbjest11.seq 97711 39077740 299000695 ddbjest12.seq 96826 43282987 299001098 ddbjest13.seq 105361 42374156 299002526 ddbjest14.seq 101813 41098665 299001483 ddbjest15.seq 97438 40872846 299000483 ddbjest16.seq 94339 42129341 299002062 ddbjest17.seq 96981 39077889 299000810 ddbjest18.seq 97820 42617810 299000698 ddbjest19.seq 96065 41950717 299003220 ddbjest20.seq 95116 38943977 299003484 ddbjest21.seq 107205 51804475 299001220 ddbjest22.seq 131531 56161548 299001813 ddbjest23.seq 104244 53482641 299001020 ddbjest24.seq 97077 78823826 299002522 ddbjest25.seq 120271 64518173 299001365 ddbjest26.seq 118438 61464065 299002199 ddbjest27.seq 126868 60597814 299000434 ddbjest28.seq 123613 59779391 299001506 ddbjest29.seq 103333 36734549 299001935 ddbjest30.seq 92674 24547957 299000735 ddbjest31.seq 85538 25316152 299001816 ddbjest32.seq 60584 16404641 299003668 ddbjest33.seq 60616 16610417 299003592 ddbjest34.seq 98047 39390337 299000544 ddbjest35.seq 118024 54289468 299001057 ddbjest36.seq 108596 56076704 299002470 ddbjest37.seq 108573 52068829 299000884 ddbjest38.seq 128134 65940089 299004310 ddbjest39.seq 96254 41459719 299002825 ddbjest40.seq 88833 39156401 299000450 ddbjest41.seq 91861 39562775 299003902 ddbjest42.seq 100737 40541852 299003864 ddbjest43.seq 94423 37268606 299001969 ddbjest44.seq 83098 37128390 299001123 ddbjest45.seq 95281 42199939 299001170 ddbjest46.seq 96951 43752081 299000176 ddbjest47.seq 96475 35140053 299002224 ddbjest48.seq 108735 48062890 299001223 ddbjest49.seq 65587 19763312 299002828 ddbjest50.seq 60366 17502972 299004592 ddbjest51.seq 60472 18606915 299003318 ddbjest52.seq 60450 18910662 299003996 ddbjest53.seq 60488 18898350 299001297 ddbjest54.seq 60421 18091934 299002959 ddbjest55.seq 61079 18627316 299003390 ddbjest56.seq 61807 18824105 299003100 ddbjest57.seq 61699 19487660 299001713 ddbjest58.seq 62464 17522636 299001814 ddbjest59.seq 58182 34793539 299004795 ddbjest60.seq 55077 24890099 299001047 ddbjest61.seq 54679 24166283 299001425 ddbjest62.seq 54172 22473493 299004337 ddbjest63.seq 60503 24579735 299001912 ddbjest64.seq 93306 39479817 299002820 ddbjest65.seq 95743 38671768 299003176 ddbjest66.seq 97153 54940452 299000877 ddbjest67.seq 98933 52831602 299002126 ddbjest68.seq 97833 47297531 298999969 ddbjest69.seq 94248 52756410 299000344 ddbjest70.seq 94509 45236921 299002587 ddbjest71.seq 95770 56058309 299002854 ddbjest72.seq 93305 47447786 299001945 ddbjest73.seq 89489 53147416 299002192 ddbjest74.seq 89933 44303772 299002154 ddbjest75.seq 95383 57668395 299002547 ddbjest76.seq 90433 54287825 299003775 ddbjest77.seq 94302 54360565 299002835 ddbjest78.seq 91578 38662408 299002558 ddbjest79.seq 85055 44953046 299000798 ddbjest80.seq 84196 46038878 299000810 ddbjest81.seq 91904 57103569 299001110 ddbjest82.seq 97167 42781092 299001460 ddbjest83.seq 95823 34477348 299002843 ddbjest84.seq 95115 42795251 299003070 ddbjest85.seq 85469 43938741 299003079 ddbjest86.seq 101812 61416526 299000771 ddbjest87.seq 98755 59699456 299003141 ddbjest88.seq 86977 51915883 299000024 ddbjest89.seq 93146 62990057 299001920 ddbjest90.seq 89329 55851882 299000963 ddbjest91.seq 95409 54112236 299000938 ddbjest92.seq 94475 63420752 299001776 ddbjest93.seq 93660 63068329 299000111 ddbjest94.seq 99931 53201338 299001801 ddbjest95.seq 97397 36780569 299000997 ddbjest96.seq 103777 61184595 299001580 ddbjest97.seq 96250 57314803 299000761 ddbjest98.seq 83991 41203246 299001257 ddbjest99.seq 88211 48454941 299001629 ddbjest100.seq 84772 46385079 299002726 ddbjest101.seq 91752 55566161 299002671 ddbjest102.seq 91682 57073528 299003498 ddbjest103.seq 86050 49795321 299001512 ddbjest104.seq 95526 55621281 299001391 ddbjest105.seq 79817 48515252 299000040 ddbjest106.seq 107353 58177431 299002411 ddbjest107.seq 105944 60186810 299003089 ddbjest108.seq 108166 56313103 299001649 ddbjest109.seq 130637 69637846 299000299 ddbjest110.seq 120226 66984068 299002187 ddbjest111.seq 95753 56787052 299001234 ddbjest112.seq 117636 69493008 299001681 ddbjest113.seq 100049 61033637 299000949 ddbjest114.seq 85831 41416317 299003415 ddbjest115.seq 77459 35188854 299001454 ddbjest116.seq 78759 38252050 299001618 ddbjest117.seq 76119 38462029 299007026 ddbjest118.seq 92294 63360486 299001676 ddbjest119.seq 86843 56276201 299000375 ddbjest120.seq 101354 57831879 299004096 ddbjest121.seq 76415 39163741 299001821 ddbjest122.seq 83948 49617015 299000474 ddbjest123.seq 84125 43754190 299003616 ddbjest124.seq 80280 54951717 299003254 ddbjest125.seq 118468 53928602 299002301 ddbjest126.seq 100784 60870851 299002147 ddbjest127.seq 94780 44634244 299002624 ddbjest128.seq 97001 46332569 299002227 ddbjest129.seq 103922 53865242 299000382 ddbjest130.seq 87986 56182526 299002714 ddbjest131.seq 77709 44714741 299001710 ddbjest132.seq 88659 53904674 298999947 ddbjest133.seq 96196 37032066 299001471 ddbjest134.seq 91989 55924043 299000509 ddbjest135.seq 93583 47151621 299001723 ddbjest136.seq 88571 51604993 299002336 ddbjest137.seq 89682 63301458 299001860 ddbjest138.seq 86855 45499846 299002598 ddbjest139.seq 88951 66590291 299001194 ddbjest140.seq 87487 52050390 299001303 ddbjest141.seq 92255 52923675 299002359 ddbjest142.seq 89834 74318630 299003154 ddbjest143.seq 82165 60436827 299003317 ddbjest144.seq 81659 60334800 299003024 ddbjest145.seq 83617 59057755 299002205 ddbjest146.seq 84780 63896887 299002394 ddbjest147.seq 79326 50372467 299001133 ddbjest148.seq 82093 44493661 299003894 ddbjest149.seq 87619 50107097 299001087 ddbjest150.seq 112995 67297288 299002501 ddbjest151.seq 91314 56400223 299001507 ddbjest152.seq 131919 82909491 299000342 ddbjest153.seq 136010 81446287 299001532 ddbjest154.seq 132815 81377826 299001452 ddbjest155.seq 133928 78151052 299001311 ddbjest156.seq 108923 55526744 299001753 ddbjest157.seq 80760 52624699 299002749 ddbjest158.seq 92666 80276118 299002741 ddbjest159.seq 100870 56297105 299001437 ddbjest160.seq 102267 60687655 299001926 ddbjest161.seq 95418 67833056 299003413 ddbjest162.seq 73959 40593818 299001244 ddbjest163.seq 58312 21887136 299001332 ddbjest164.seq 57328 20188362 299001420 ddbjest165.seq 57061 20627255 299004275 ddbjest166.seq 56464 22162921 299002721 ddbjest167.seq 56274 22978734 299000858 ddbjest168.seq 58364 19922687 299003691 ddbjest169.seq 58683 23112448 299004650 ddbjest170.seq 56439 24286800 299005200 ddbjest171.seq 55791 22972866 299002438 ddbjest172.seq 55897 23863417 299002547 ddbjest173.seq 56520 22653968 299001643 ddbjest174.seq 56152 23571775 299002903 ddbjest175.seq 53002 36342567 299004850 ddbjest176.seq 104889 37427940 299002036 ddbjest177.seq 105888 56058660 298999929 ddbjest178.seq 87516 56847942 299001487 ddbjest179.seq 88215 56439365 299000865 ddbjest180.seq 83899 51421884 299001154 ddbjest181.seq 96289 55778763 299001579 ddbjest182.seq 84467 60502218 299001345 ddbjest183.seq 81396 39794098 299000899 ddbjest184.seq 130413 60804827 299000668 ddbjest185.seq 96688 53238530 299001918 ddbjest186.seq 83432 40701727 299001335 ddbjest187.seq 92247 53629938 299001738 ddbjest188.seq 93473 44443079 299002536 ddbjest189.seq 89790 48174259 299001597 ddbjest190.seq 86932 51303820 299001401 ddbjest191.seq 86823 48169092 299001203 ddbjest192.seq 107118 55889866 299001815 ddbjest193.seq 100904 65958729 299001738 ddbjest194.seq 106262 69093792 299000643 ddbjest195.seq 133739 60700584 299002236 ddbjest196.seq 99900 55862762 299002119 ddbjest197.seq 93229 54706420 299001869 ddbjest198.seq 93732 46704110 299000764 ddbjest199.seq 93858 41981555 299003285 ddbjest200.seq 83294 49214351 299004029 ddbjest201.seq 83159 52893753 299000478 ddbjest202.seq 88701 51089843 299002691 ddbjest203.seq 78288 50714048 299002791 ddbjest204.seq 103621 54254964 299002945 ddbjest205.seq 103174 56619372 299000703 ddbjest206.seq 120631 70351516 299001534 ddbjest207.seq 150199 61650804 299000725 ddbjest208.seq 98029 49266126 299002886 ddbjest209.seq 85838 49438150 299001491 ddbjest210.seq 102264 55170144 299002740 ddbjest211.seq 90298 52400765 299003718 ddbjest212.seq 90818 52799657 299002287 ddbjest213.seq 81537 49200085 299000678 ddbjest214.seq 67838 39137389 299000388 ddbjest215.seq 91341 60054791 299003379 ddbjest216.seq 96566 58643007 299002526 ddbjest217.seq 85579 47374784 299001893 ddbjest218.seq 93029 58109031 299004270 ddbjest219.seq 84107 68504461 299001441 ddbjest220.seq 79782 59890022 299001897 ddbjest221.seq 79121 43054066 298999927 ddbjest222.seq 94935 65898770 299002756 ddbjest223.seq 98315 49059356 299004546 ddbjest224.seq 86950 48859657 299003584 ddbjest225.seq 79724 40302593 299003217 ddbjest226.seq 81934 53300520 299002898 ddbjest227.seq 105526 54676420 299001920 ddbjest228.seq 88059 54365952 299002787 ddbjest229.seq 84481 48593257 299000409 ddbjest230.seq 96900 62197366 299000029 ddbjest231.seq 111774 64253066 299001874 ddbjest232.seq 99309 54003582 299001377 ddbjest233.seq 77844 46925968 299002620 ddbjest234.seq 92109 47601607 299003637 ddbjest235.seq 61583 32132630 299000093 ddbjest236.seq 89273 52344749 299001238 ddbjest237.seq 121717 58340680 299000577 ddbjest238.seq 91255 60732947 299000183 ddbjest239.seq 98802 63964363 299001163 ddbjest240.seq 81260 45830567 299001971 ddbjest241.seq 106932 48215152 299000434 ddbjest242.seq 80413 49920243 299000326 ddbjest243.seq 78517 46508082 299000311 ddbjest244.seq 118479 69722950 299001730 ddbjest245.seq 96428 57413625 299000848 ddbjest246.seq 77908 51054266 299001125 ddbjest247.seq 45497 21351892 299000384 ddbjest248.seq 105498 62850332 299000580 ddbjest249.seq 102707 57429687 299002041 ddbjest250.seq 108242 50707377 299000666 ddbjest251.seq 99135 33425791 299000815 ddbjest252.seq 92899 34236038 299000423 ddbjest253.seq 94647 33762984 299002088 ddbjest254.seq 100827 34606483 299002273 ddbjest255.seq 92289 36692783 299002333 ddbjest256.seq 66231 24774822 208359629 ddbjgss1.seq 104916 76488705 299001094 ddbjgss2.seq 101576 70427648 299001710 ddbjgss3.seq 108404 61085518 299000316 ddbjgss4.seq 85351 74220888 299004032 ddbjgss5.seq 87027 71024913 299001403 ddbjgss6.seq 87351 70440504 299002381 ddbjgss7.seq 100323 60359928 299000981 ddbjgss8.seq 97327 75719136 299001554 ddbjgss9.seq 82076 68801152 299000545 ddbjgss10.seq 80234 72626524 299002742 ddbjgss11.seq 77211 63206033 299000977 ddbjgss12.seq 105365 43366292 299000365 ddbjgss13.seq 115602 48772626 299000501 ddbjgss14.seq 113844 51704785 299002383 ddbjgss15.seq 107384 53332723 299001581 ddbjgss16.seq 99790 52546396 299000170 ddbjgss17.seq 99747 49864773 299001264 ddbjgss18.seq 98928 50412661 299000407 ddbjgss19.seq 94644 47685342 299002736 ddbjgss20.seq 96478 54584734 299000078 ddbjgss21.seq 90193 47353992 299002687 ddbjgss22.seq 95376 49367630 299000591 ddbjgss23.seq 91012 60087239 299002024 ddbjgss24.seq 91469 44160640 299002216 ddbjgss25.seq 97410 54819907 299000366 ddbjgss26.seq 85662 39540731 299002780 ddbjgss27.seq 73592 37961508 299000054 ddbjgss28.seq 74956 34288482 299001779 ddbjgss29.seq 82427 47429957 299002579 ddbjgss30.seq 77471 36204061 299003864 ddbjgss31.seq 84245 50683019 299002394 ddbjgss32.seq 84267 34652575 299002945 ddbjgss33.seq 76327 36485883 299000095 ddbjgss34.seq 88753 43107208 299000737 ddbjgss35.seq 88848 48119735 299000081 ddbjgss36.seq 93106 57479221 299000619 ddbjgss37.seq 99132 51519698 299000246 ddbjgss38.seq 98122 52011385 299000070 ddbjgss39.seq 103591 60031553 299000161 ddbjgss40.seq 119460 79371993 299002022 ddbjgss41.seq 114967 63971449 299001854 ddbjgss42.seq 110963 58538833 299002129 ddbjgss43.seq 110266 42440975 299000980 ddbjgss44.seq 106071 68885419 299001381 ddbjgss45.seq 121696 76917747 299000614 ddbjgss46.seq 109275 54979656 299000973 ddbjgss47.seq 99093 70155339 299002853 ddbjgss48.seq 93026 61458215 299000680 ddbjgss49.seq 98309 59306045 299001260 ddbjgss50.seq 100844 54655849 299000964 ddbjgss51.seq 113434 78993076 299002267 ddbjgss52.seq 104827 76008444 299001894 ddbjgss53.seq 119108 76246824 299000608 ddbjgss54.seq 107602 67663661 299000466 ddbjgss55.seq 93231 48938412 299000543 ddbjgss56.seq 116021 66461313 299001437 ddbjgss57.seq 110863 54000326 299000161 ddbjgss58.seq 109708 72341207 299000012 ddbjgss59.seq 92883 100390340 299002684 ddbjgss60.seq 107295 90526842 299001772 ddbjgss61.seq 105126 70220803 299003200 ddbjgss62.seq 82807 57662822 299002483 ddbjgss63.seq 106800 79262119 299000118 ddbjgss64.seq 110617 74089234 299001112 ddbjgss65.seq 112005 69473181 299001575 ddbjgss66.seq 117161 78053205 299001407 ddbjgss67.seq 128664 68313544 298999941 ddbjgss68.seq 128168 68962440 299001912 ddbjgss69.seq 129615 67075016 299001328 ddbjgss70.seq 130025 66541046 299001674 ddbjgss71.seq 130121 66416078 299002090 ddbjgss72.seq 129334 67441747 299002218 ddbjgss73.seq 119850 79967332 299001991 ddbjgss74.seq 113889 89680378 299000771 ddbjgss75.seq 112206 88673754 299000076 ddbjgss76.seq 112032 86459170 299000026 ddbjgss77.seq 117248 58249119 299000559 ddbjgss78.seq 120492 36353076 299000890 ddbjgss79.seq 110301 59170489 299002743 ddbjgss80.seq 104375 60244760 299001184 ddbjgss81.seq 112978 80053442 299002842 ddbjgss82.seq 99138 87176290 299000846 ddbjgss83.seq 101067 88588485 299002764 ddbjgss84.seq 112813 67957124 299000103 ddbjgss85.seq 98380 53997681 299001002 ddbjgss86.seq 98959 76068532 298999974 ddbjgss87.seq 105621 65709450 299001305 ddbjgss88.seq 107258 76285470 299000147 ddbjgss89.seq 107764 73363651 299000563 ddbjgss90.seq 131753 82612321 299000768 ddbjgss91.seq 140542 88828616 299001724 ddbjgss92.seq 124381 71496484 299001053 ddbjgss93.seq 30551 15331122 67920712 ddbjhtc1.seq 37992 68278358 299005092 ddbjhtc2.seq 49896 81438301 299002844 ddbjhtc3.seq 96437 93657713 299004055 ddbjhtc4.seq 99217 114571777 299002653 ddbjhtc5.seq 36237 37070902 100009180 ddbjhtg1.seq 1583 227595888 299076232 ddbjhtg2.seq 3381 224539924 299211340 ddbjhtg3.seq 3091 226056317 299063109 ddbjhtg4.seq 1870 225997753 299157377 ddbjhtg5.seq 1543 224638032 299082670 ddbjhtg6.seq 1518 224675353 299073093 ddbjhtg7.seq 1556 224447724 299024823 ddbjhtg8.seq 1351 227811530 299042046 ddbjhtg9.seq 1762 223190050 299096062 ddbjhtg10.seq 1150 229656739 299101526 ddbjhtg11.seq 899 230205462 299139743 ddbjhtg12.seq 885 230256326 299274999 ddbjhtg13.seq 969 229887901 299004149 ddbjhtg14.seq 921 230165756 299159830 ddbjhtg15.seq 2014 219156028 299081626 ddbjhtg16.seq 1359 226422922 299237043 ddbjhtg17.seq 1388 226260570 299246220 ddbjhtg18.seq 962 229661654 299148710 ddbjhtg19.seq 1052 229065202 299053485 ddbjhtg20.seq 1063 229423498 299303578 ddbjhtg21.seq 1071 228834196 299013515 ddbjhtg22.seq 923 229914656 299015240 ddbjhtg23.seq 1029 229497322 299126560 ddbjhtg24.seq 1055 229542921 299309400 ddbjhtg25.seq 1147 228515987 299047022 ddbjhtg26.seq 1118 228809320 299228231 ddbjhtg27.seq 1123 228971435 299195443 ddbjhtg28.seq 1115 228915762 299246723 ddbjhtg29.seq 1048 229617930 299117061 ddbjhtg30.seq 1190 228396900 299027057 ddbjhtg31.seq 1045 229294540 299066050 ddbjhtg32.seq 1026 229498101 299157110 ddbjhtg33.seq 1025 229351682 299099742 ddbjhtg34.seq 1113 228921951 299169431 ddbjhtg35.seq 1126 229443462 299052892 ddbjhtg36.seq 1151 228657122 299022343 ddbjhtg37.seq 1310 227602099 299055188 ddbjhtg38.seq 1374 226838959 299070860 ddbjhtg39.seq 1555 227209570 299157191 ddbjhtg40.seq 1349 228498794 299169313 ddbjhtg41.seq 1344 227460056 299134412 ddbjhtg42.seq 1413 228801169 299060804 ddbjhtg43.seq 1481 229664001 299152644 ddbjhtg44.seq 1414 229744725 299081867 ddbjhtg45.seq 1493 229572394 299030285 ddbjhtg46.seq 1466 228751581 299124436 ddbjhtg47.seq 1275 228911369 299236539 ddbjhtg48.seq 1340 231074725 299106857 ddbjhtg49.seq 1183 231872213 299113572 ddbjhtg50.seq 1258 230792398 299020991 ddbjhtg51.seq 1268 230998969 299166693 ddbjhtg52.seq 244 30429851 39533619 ddbjhum1.seq 13972 191816278 299026377 ddbjhum2.seq 1609 211905457 299026546 ddbjhum3.seq 1578 217576106 299014582 ddbjhum4.seq 1350 206444257 299024738 ddbjhum5.seq 1444 213270716 299083078 ddbjhum6.seq 1464 210738950 299042151 ddbjhum7.seq 1550 204159956 299205810 ddbjhum8.seq 1625 213692272 299097401 ddbjhum9.seq 1511 208259100 299196489 ddbjhum10.seq 1787 209456105 299192566 ddbjhum11.seq 1949 213025501 299007094 ddbjhum12.seq 32671 170306562 299001292 ddbjhum13.seq 69326 103375237 298999922 ddbjhum14.seq 17808 166318574 299111693 ddbjhum15.seq 3367 208766948 299087827 ddbjhum16.seq 2080 216689218 299216529 ddbjhum17.seq 2436 217215200 299056964 ddbjhum18.seq 4582 218990281 299092381 ddbjhum19.seq 1746 222630887 299146423 ddbjhum20.seq 47125 102620645 299002527 ddbjhum21.seq 52726 123036426 299002924 ddbjhum22.seq 38591 71389473 169907789 ddbjinv1.seq 13390 205985158 299190266 ddbjinv2.seq 7611 183015240 299000168 ddbjinv3.seq 91006 89428985 299001109 ddbjinv4.seq 70787 104262034 299001306 ddbjinv5.seq 53207 124087078 299014769 ddbjinv6.seq 790 4467618 8527259 ddbjmam.seq 62849 98156694 248071598 ddbjpat1.seq 255273 89006233 299000330 ddbjpat2.seq 217405 107881248 299000727 ddbjpat3.seq 166751 100363848 299001148 ddbjpat4.seq 129516 131963558 299001487 ddbjpat5.seq 166160 103865016 299002587 ddbjpat6.seq 148216 113093306 299000597 ddbjpat7.seq 182767 69304423 299000886 ddbjpat8.seq 123815 76730268 299000093 ddbjpat9.seq 132682 61380223 299001168 ddbjpat10.seq 155221 63534851 299000929 ddbjpat11.seq 171711 62148303 299001285 ddbjpat12.seq 167955 102805290 299009215 ddbjpat13.seq 131431 132482251 299000011 ddbjpat14.seq 177128 90473164 292478515 ddbjphg.seq 2696 12907481 32776504 ddbjpln1.seq 28174 161914846 299077203 ddbjpln2.seq 3191 209828542 299000576 ddbjpln3.seq 89838 93206789 299000723 ddbjpln4.seq 89759 84344024 299000634 ddbjpln5.seq 38356 63325618 299044520 ddbjpln6.seq 8373 190577636 299045075 ddbjpln7.seq 1590 206085209 299012357 ddbjpln8.seq 70524 96585243 299001668 ddbjpln9.seq 70076 106951660 299555093 ddbjpln10.seq 51636 126599174 299032366 ddbjpln11.seq 318 1451983 3262543 ddbjpri.seq 29445 230647357 353701063 ddbjrod1.seq 8536 206239648 299083709 ddbjrod2.seq 1057 208498554 299239542 ddbjrod3.seq 1121 211157225 299062717 ddbjrod4.seq 1156 215424414 299019011 ddbjrod5.seq 1193 216471276 299218002 ddbjrod6.seq 1209 218267535 299020237 ddbjrod7.seq 34760 170927685 299009543 ddbjrod8.seq 1446 231685336 299030804 ddbjrod9.seq 1501 232082041 299053703 ddbjrod10.seq 36885 129042369 299008269 ddbjrod11.seq 42931 138100573 281327898 ddbjsts1.seq 104352 57804302 299002192 ddbjsts2.seq 85310 34128396 299000870 ddbjsts3.seq 118283 43911405 299001519 ddbjsts4.seq 67510 30160637 187913700 ddbjsyn.seq 14300 21997737 62795279 ddbjtpa.seq 4358 14572841 32179547 ddbjuna.seq 1271 557857 3118489 ddbjvrl1.seq 86595 76704535 299000043 ddbjvrl2.seq 87044 77279014 299001170 ddbjvrl3.seq 70155 70041461 248909342 ddbjvrt1.seq 75105 112087075 299084754 ddbjvrt2.seq 28135 179540703 299000817 ddbjvrt3.seq 37840 137365108 299055301 ddbjvrt4.seq 1498 232108736 299190813 ddbjvrt5.seq 39607 14963915 271288955 ---------------------------------------------------------------------- EST: expressed sequence tag CON: Contig sequences GSS: genome survey sequence HTC: high throughput cDNA HTG: high throughput genome sequence STS: sequence tagged site TPA: third party annotation