DDBJ Amino Acid Sequence Database (DAD) Release 1.4 April 21, 1997 This is a test release of DDBJ Amino Acid Sequence Database (DAD). This database was created by extracting all translated sequences from DDBJ entries. The release 1.4 was made from DDBJ release 29 (April 1997). 1. Format of DAD Entries DAD is divided into 16 files according to the organisms from which amino acid sequences are derived. The divisions are the same as those of DDBJ DNA Database. Please refer to the release note of DDBJ (filename: ddbjrel.txt). All the amino acid sequences are stored in a fasta-like format (ODEN format). The first line of an entry has a character '>' followed by an accession number of the entry, its PID (protein ID), and the product name if it is known. Accession number of DAD is comprised of DDBJ accession number and a consecutive integer that begins from 1. These two numbers are combined by a hyphen (-). For example, amino acid sequences extracted from DDBJ entry D12345 have accession numbers D12345-1, D12345-2, etc. An amino acid sequence begins from the next line of accession number. Up to sixty amino acids are written on one line. Following the amino acid sequence, there is a double slash (//), which means the end of the entry. Below is an example of DAD entry. >X65727-1 PID:g825605 glutathione S-transferase MAEKPKLHYSNTRGRMESIRWLLAAAGVEFEEKFIKSAEDLDKLRNDGYLMFQQVPMVEI DGMKLVQTRAILNYIASKYNLYGKDIKEKALIDMYIEGIADLGEMILLLPFTQPEEQDAK LALIQEKTKNRYFPAFEKVLKSHGQDYLVGNKLSRADIHLVELLYYVEELDSSLISSFPL LKALKTRISNLPTVKKFLQPGSPRKPPMDEKSLEESRKIFRF // 2. Statistics of DAD The following are statistics of this release of DAD. total number of entries 229,541 total length of sequences 70,817,559 aa average length 308.5 aa name of longest sequence X90568-1 PID:g1212992 length of longest sequence 26,926 aa (X90568-1) file entries peptides ------------------------------------- ddbjbct 57745 17222415 ddbjest 963 86525 ddbjhtg 84 30806 ddbjhum 22912 7000754 ddbjinv 25633 9775706 ddbjmam 6763 1872938 ddbjpat 1986 548223 ddbjphg 2572 490410 ddbjpln 34851 13366927 ddbjpri 1833 345364 ddbjrod 21804 6412942 ddbjsts 11 869 ddbjsyn 1775 386587 ddbjuna 954 316490 ddbjvrl 40683 10511018 ddbjvrt 8972 2449585 ------------------------------------- total 229541 70817559