DOKK / manpages / debian 12 / bioperl / bp_dbsplit.1p.en
BP_DBSPLIT(1p) User Contributed Perl Documentation BP_DBSPLIT(1p)

bp_dbsplit - script to split an input set of database(s) into smaller pieces

  bp_dbsplit.PLS --size 50 [-i inputfile] [-if inputformat] [-of outputformat]
              [--prefix outputprefix] [ < file1 file 2  OR file1 file2]

This script will take as input a list of filenames or a single file or from STDIN a sequence database and split the database into separate files of X numbers of sequences. You specify X with the "--size/-s" parameter. The input and output sequence format is any that is supported by bioperl (fasta,embl,genbank,gcg, swissprot, etc).

You can specify the input data either as a single file with -i filename, or as a single file as an argument like

  % bp_dbsplit file1 file2

or as a list of sequence data with

  % cat file1 file2 file3 | bp_dbsplit

You'll want to use the "--prefix" to specify what the output prefix will be.

User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated.

  bioperl-l@bioperl.org                  - General discussion
  http://bioperl.org/wiki/Mailing_lists  - About the mailing lists

Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web:

  https://github.com/bioperl/bioperl-live/issues

Jason Stajich, jason-at-bioperl-dot-org

2021-08-15 perl v5.32.1