DDBJ Amino Acid Sequence Database (DAD) Release 1.3 February 7, 1997 This is a test release of DDBJ Amino Acid Sequence Database (DAD). This database was created by extracting all translated sequences from DDBJ entries. The release 1.3 was made from DDBJ release 28 (Jan 1997). 1. Format of DAD Entries DAD is divided into 16 files according to the organisms from which amino acid sequences are derived. The divisions are the same as those of DDBJ DNA Database. Please refer to the release note of DDBJ (filename: ddbjrel.txt). All the amino acid sequences are stored in a fasta-like format (ODEN format). The first line of an entry has a character '>' followed by an accession number of the entry, its PID (protein ID), and the product name if it is known. Accession number of DAD is comprised of DDBJ accession number and a consecutive integer that begins from 1. These two numbers are combined by a hyphen (-). For example, amino acid sequences extracted from DDBJ entry D12345 have accession numbers D12345-1, D12345-2, etc. An amino acid sequence begins from the next line of accession number. Up to sixty amino acids are written on one line. Following the amino acid sequence, there is a double slash (//), which means the end of the entry. Below is an example of DAD entry. >X65727-1 PID:g825605 glutathione S-transferase MAEKPKLHYSNTRGRMESIRWLLAAAGVEFEEKFIKSAEDLDKLRNDGYLMFQQVPMVEI DGMKLVQTRAILNYIASKYNLYGKDIKEKALIDMYIEGIADLGEMILLLPFTQPEEQDAK LALIQEKTKNRYFPAFEKVLKSHGQDYLVGNKLSRADIHLVELLYYVEELDSSLISSFPL LKALKTRISNLPTVKKFLQPGSPRKPPMDEKSLEESRKIFRF // 2. Statistics of DAD The following are statistics of this release of DAD. total number of entries 214,066 total length of sequences 66,063,073 aa average length 308.6 aa name of longest sequence X90568-1 PID:g1212992 length of longest sequence 26,926 aa (X90568-1) file entries peptides ------------------------------------- ddbjbct 50384 14901442 ddbjest 946 85135 ddbjgss 1 53 ddbjhtg 39 15623 ddbjhum 21812 6639299 ddbjinv 24014 9206311 ddbjmam 6442 1790610 ddbjpat 1976 545619 ddbjphg 2560 487072 ddbjpln 33502 12930511 ddbjpri 1755 333164 ddbjrod 20895 6142851 ddbjsts 13 1283 ddbjsyn 1736 374837 ddbjuna 997 309199 ddbjvrl 38473 9956860 ddbjvrt 8521 2343204 ------------------------------------- total 214066 66063073