MassBlast was developed by André Veríssimo and Jean-Etienne Brassard and is released under GNU General Public License version 3
BLAST+ (see table in Install section)
Command line application to perform BLAST queries from multiple files against different databases at once.
A pre-print of the manuscript describing this application is available at bioRxiv and can be accessed here.
General description of the MassBlast workflow:
blastn -version
mass-blast
scriptMassBlast downloads | Windows | Mac OS X | Linux |
BLAST+ (pre-requirement) | v2.2.30 (32-bit) | v2.6.0 | 2.6.0 |
BLAST+ important notes
ncbi.ini
located at a subdirectory at the AppData
folder in the user directorynote: Ruby and all other requirements are included in the package files, it is not necessary to install when using packaged version.
db_and_queries/queries
folder.
db_and_queries/db
folder.
user.yml.example
for more information.Example of a nucleic-acid query file that could be placed in db_and_queries/queries
folder:
>Example01
attgggaatttactgcaactcaaggagaagaaaccctaccagacttttacaaggtgggct
gaggagt
>Example03
attgggaatttactgcaactcaaggagaagaaaccctaccagactttt
>Example02
attgggaatttactgcaactcaaggagaagaaaccctaccagacttttacaaggtgggct
gaggagtatttactgcaactcaaggagaagaaaccctaccagacttttacaaggtggtgg
gcaactcaagcaactcaagcaactcaagcaactcaa
The method is defined in the file user.yml
Using makeblastdb command that comes bundled with Blast+
Run makeblastdb command in that directory
nucleic-acid database
$ makeblastdb -in <filename> -dbtype nucl -out "<blast_db_new_name>" -title "<blast_db_new_name>"
protein database
$ makeblastdb -in <filename> -dbtype nucl -out "<blast_db_new_name>" -title "<blast_db_new_name>"
note: do to not use spaces in the <blast db new name>
Place the fasta files for the database in db_and_queries/import_dbs
directory and run the appropriate script.
You also need to say if it is a nucleic-acid or protein-based fasta file.
For Linux and Mac OS X run the import_fastas.sh
script
$ cd db_and_queries/import_dbs
# for nucleic-acid
$ sh import_fastas.sh nucl
# for protein
$ sh import_fastas.sh prot
For Windows run the import_fastas.bat
script
$ cd db_and_queries/import_dbs
# for nucleic-acid
$ import_fastas.bat nucl
# for protein
$ import_fastas.bat prot
MassBlast was developed primarily by André Veríssimo, Jean-Etienne Bassard and Susana Vinga
A pre-print of the manuscript is available at bioRxiv and can be accessed here
This work was supported by:
We would like to thank Cathie Martin and Philippe Vain for reading the manuscript and providing us with important comments and insights. We would also like to thank Aldo Ricardo Almeida Robles and Nuno Mira for testing MassBlast.