RNAstructure logo

RNAstructure Command Line Help
probscan

ProbScan calculates the exact probability of features in a structure. Output is written to standard out.

USAGE: ProbScan <input file> [options]

Required parameters:

<input file> This is a sequence file (.seq or FASTA format) or partition function save file. The default is a partition function save file, the -s flag (below) is used to specify a sequence. The partition function will need to be calculated, so it is faster to use the partition function save file.

Options that do not require added values:

-a -A --hairpin Print probabilities for all possible hairpin loops.
-b -B --bulge
Print probabilities for all possible bulge loops.
-d -D --DNA

Specify that the sequence is DNA, and DNA parameters are to be used. Default is to use RNA parameters. This only has any effect if a sequence is read and -s is specified.

-h -H --help Display the usage details message.
--i -I --internal Print probabilities for all possible internal loops.
--scientific Display probabilities in scientific notation rather than the default of three digits to the right of the decimal point. This is helpful when the threshold is lowered (see --threshold).
-s -S --sequence
Provide a sequence file. Partition function will be calculated (may take a while); if you're going to query the same sequence repeatedly, you could save a lot of time by running from a partition function save file produced by the 'partition' program.
-v -V --version

Display version and copyright information for this interface.

Options that require added values:

--alphabet Specify the name of a folding alphabet and associated nearest neighbor parameters. The alphabet is the prefix for the thermodynamic parameter files, e.g. "rna" for RNA parameters or "dna" for DNA parameters or a custom extended/modified alphabet. The thermodynamic parameters need to reside in the at the location indicated by environment variable DATAPATH. The default is "rna" (i.e. use RNA parameters). This option overrides the --DNA flag.
-e -E --helix
Print probabilities for all possible helices with this number of base pair stacks. To get single base pair stacks, use -e 1.
-m -M --multibranch
Provide a file with a list multibranch loops, one multibranch loop per line. Each line should contain a tag followed by a list of pairs where the indices of the pair are separated by a dash and each pair is separated by a tab character; e.g. "loop1 20-40 22-30 32-39" These multibranch loops' probabilities will be checked.
-p -P --pairs Calculate probability for a user-specified loop. The loop must be provided as a set of pairs of nucleotide indices (1-indexed), where the nucs in the pair are delimited by dashes and each pair is delimited by a comma; eg '-e 5-20' will show the probability of a hairpin loop closed by a pair between nucleotides 5 and 20, and '-e 10-120,15-70,75-110' will give the probability of a three-way junction where the exiting helices are closed by pairs at 10-120, 15-70, and 75-110.
--stemloop Calculate the probability of a stem-loop. The closing pair and the first pair of the stem are specified. All pairs will be required between the closing pair and the first pair of the stem loop. For example '--stemloop 100-108,96-112' will calculate the probability of a hairpin loop closed by the pairs 100-108, 99-109, 98-110, 97-111, and 96-112, i.e. a stem loop with 6 pairs and 7 unpaired nucleotides.
-t -T --threshold The modes that find all possible loops of a given type (-a, -b, -i; above) require a loop to exceed a probability threshold to be included in the output. By default, the threshold is 0.01. This parameters is used to override this value. If this threshold is lowered, it is helpful to change the output probability notation to scientific with --scientific.

 

References:

  1. Sloma, M.F. and Mathews, D.H.
    "Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures."
    RNA. 22:1808-1818. (2016).
  2. Reuter, J.S. and Mathews, D.H.
    "RNAstructure: software for RNA secondary structure prediction and analysis."
    BMC Bioinformatics, 11:129. (2010).
  3. Mathews, D.H., Sabina, J., Zuker, M. and Turner, D.H.
    "Expanded sequence dependence of thermodynamic parameters provides improved prediction of RNA secondary structure."
    J. Mol. Biol., 288:911-940. (1999).