READSEQ
READSEQ reformats nucleic acid or protein sequence files into a different format. READSEQ is particularly useful as it automatically detects many sequence formats, and interconverts among them.
Formats which readseq currently understands:
- IG/Stanford, used by Intelliigenetics and others
- GenBank/GB, genbank flatfile format
- NBRF format
- EMBL, EMBL flatfile format
- GCG, single sequence format of GCG software
- DNAStrider, for common Mac program
- Fitch format, limited use
- Pearson/Fasta, a common format used by Fasta programs and others
- Zuker format, limited use. Input only.
- Olsen, format printed by Olsen VMS sequence editor. Input only.
- Phylip3.2, sequential format for Phylip programs
- Phylip, interleaved format for Phylip programs (v3.3, v3.4)
- Plain/Raw, sequence data only (no name, document, numbering)
- MSF multi sequence format used by GCG software
- PAUP's multiple sequence (NEXUS) format
- PIR/CODATA format used by PIR
- ASN.1 format used by NCBI
- Pretty print with various options for nice looking output. Output only.
Copyright 1990 by D. G. Gilbert, Biology dept., Indiana University,
Bloomington, In 47405
e-mail: gilbertd@bio.indiana.edu
Installed on the Opteron Cluster.
Version 1Feb93
READSEQ users will find it convenient to use the /biomed/lib/examples/.login.fixed login file. It adds to your default path directories where many sequence analysis packages are stored, sets up the environment for EMBOSS, and defines some environment variables for sequence analysis packages.
Usage:
/biomed/bin/readseq
Other READSEQ Documentation
- Tutorial on converting from one sequence file format to another.
- Documentation provided by the author.
See also:
- Other nucleic acid and protein sequence analysis software installed at the PSC.