-
Notifications
You must be signed in to change notification settings - Fork 10
/
Copy pathREADME
177 lines (133 loc) · 6.53 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
FAST --- FAST Analysis of Sequences Toolbox
Version 1.06. April 20, 2015, after acceptance in Frontiers in Genetics
Version 1.05. April, 2015
Version 1.00. February, 2015
INTRODUCTION
The FAST Analysis of Sequences Toolbox (FAST) is a set of Unix tools
(for example fasgrep, fascut, fashead and fastr) for sequence
bioinformatics modeled after the Unix textutils (such as grep, cut,
head, tr, etc). FAST workflows are designed for "inline" (serial)
processing of flatfile biological sequence record databases
per-sequence, rather than per-line, through Unix command
pipelines. The default data exchange format is multifasta
(specifically, a restriction of BioPerl FastA format). FAST tools
expose the power of Perl and BioPerl for sequence analysis to
non-programmers in an easy-to-learn command-line paradigm.
You do not need to know Perl or BioPerl to use FAST.
UTILITIES
FAST 1.0 contains the following utilities. Each has its own man
page. FAST utilities may be classifed as for annotation, selection,
transformation, and analysis.
annotation
----------
faslen -- annotate sequence lengths
fascodon -- tally/annotate codon usage
fascomp -- tally/annotate monomer frequencies
fasxl -- translate gapped and ungapped sequences and alignments
fasrc -- reverse complement nucleotide sequences and alignments
selection
---------
fasgrep -- select sequence records by perl regular expressions
fasfilter -- select sequence records by numerical values
fastax -- select sequence records by NCBI Taxonomy IDs or names
fascut -- select/reorder sequence record data by sequential ranges
fasuniq -- remove duplicate sequence records from sorted data
fashead -- select leading sequence records
fastail -- select trailing sequence records
alncut -- select sites based on variation and gap-content content
gbfcut -- select sequences by regex match on GenBank features
gbfalncut -- select sites by regex match on GenBank features
transformation
--------------
fasconvert -- convert sequences to or from from fasta format
fassort -- sort sequence records
fastaxsort -- sort sequence records by NCBI Taxonomy IDs or names
faspaste -- concatenate sequence record data
fastr -- transform sequence records by characters and alphabets
fassub -- transform sequence records by regex-based substitutions
analysis
--------
faswc -- tally sequences and characters
alnpi -- tally molecular population genetic statistics
PREREQUISITES
FAST has some Perl dependencies. Once posted to CPAN, the easiest way
to install FAST with its dependencies will be by executing this
command:
(sudo) perl -MCPAN -e 'install FAST'
You can also follow INSTALLATION instructions below, but you may need to
first run these commands to install FAST dependencies:
(sudo) perl -MPCAN -e 'install Sort::Key'
(sudo) perl -MCPAN -e 'install Sort::MergeSort'
(sudo) perl -MCPAN -e 'install Bit::Vector'
(sudo) perl -MCPAN -e 'install Pod::Usage'
INSTALLATION
To install this module, please run the following:
perl Makefile.PL
make
make test
(sudo) make install
If you haven't installed perl-based software on your system before, or
you are having problems, please see more detailed notes in INSTALL.
GETTING STARTED
perldoc fasgrep
man fasgrep
perldoc FAST
Some usage examples in the man pages refer to data that ships with
FAST, in the installation package under t/data. Run these examples
with these data to get started right away using FAST.
DOCUMENTATION
Installation generates a man page for each utility. From the installation
directory, additional resources are in ./doc, particularly the FAST_Cookbook
SUPPORT
You can also look for information at:
RT, CPAN's request tracker (report bugs here)
http://rt.cpan.org/NoAuth/Bugs.html?Dist=FAST
AnnoCPAN, Annotated CPAN documentation
http://annocpan.org/dist/FAST
CPAN Ratings
http://cpanratings.perl.org/d/FAST
Search CPAN
http://search.cpan.org/dist/FAST/
CITING
If you use FAST please, cite Lawrence et al. (2015). FAST: FAST Analysis of
Sequences Toolbox in Frontiers in Genetics, and Stajich et al. (2002).
BioPerl project. If you publish work that cites data provided in this FAST
installer, please cite the authors of this data accordingly. For example, if
you use the tRNAdb-CE data provided in this installer, please cite: Abe T,
Inokuchi H, Yamada Y, Muto A, Iwasaki Y and Ikemura T (2014) tRNADB-CE: tRNA
gene database well-timed in the era of big sequence data. Front. Genet.
5:114. doi: 10.3389/fgene.2014.00114
VERSION AND CHANGES
1.05 -- Current version. Please see file Changes.
1.0 -- First public release
LICENSE AND COPYRIGHT
Copyright (C) 2015 David H. Ardell
This program is free software; you can redistribute it and/or modify it
under the terms of the the Artistic License (2.0). You may obtain a
copy of the full license at:
L<http://www.perlfoundation.org/artistic_license_2_0>
Any use, modification, and distribution of the Standard or Modified
Versions is governed by this Artistic License. By using, modifying or
distributing the Package, you accept this license. Do not use, modify,
or distribute the Package, if you do not accept this license.
If your Modified Version has been derived from a Modified Version made
by someone other than you, you are nevertheless required to ensure that
your Modified Version complies with the requirements of this license.
This license does not grant you the right to use any trademark, service
mark, tradename, or logo of the Copyright Holder.
This license includes the non-exclusive, worldwide, free-of-charge
patent license to make, have made, use, offer to sell, sell, import and
otherwise transfer the Package with respect to any patent claims
licensable by the Copyright Holder that are necessarily infringed by the
Package. If you institute patent litigation (including a cross-claim or
counterclaim) against any party alleging that the Package constitutes
direct or contributory patent infringement, then this Artistic License
to you shall terminate on the date that such litigation is filed.
Disclaimer of Warranty: THE PACKAGE IS PROVIDED BY THE COPYRIGHT HOLDER
AND CONTRIBUTORS "AS IS' AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES.
THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR
PURPOSE, OR NON-INFRINGEMENT ARE DISCLAIMED TO THE EXTENT PERMITTED BY
YOUR LOCAL LAW. UNLESS REQUIRED BY LAW, NO COPYRIGHT HOLDER OR
CONTRIBUTOR WILL BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, OR
CONSEQUENTIAL DAMAGES ARISING IN ANY WAY OUT OF THE USE OF THE PACKAGE,
EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.