RNAstructure Command Line Help
|
||||||||||||||
DecoyFinder applies machine learning trained with features determined by TurboFold to detect sequences that are not homologous with the other sequences in the set. USAGE: python3 DecoyFinder.py <ct file> [options]example: python3 /home/username/RNAstructure/scripts/DecoyFinder /home/username/TurboFold.conf Required parameters:
Options that do not require added values:
Notes:DecoyFinder post-processes the result of a TurboFold calculation. It uses the configuration file from TurboFold. If all the TurboFold output is not available, DecoyFinder calls TurboFold with the configuration file first. The configuration file must include the option SaveFiles and must include the option StartingSaveFiles. These save the partition function results that are processed by DecoyFinder. The configuration file must also specify the OutAln, the alignment file name, which is also used by DecoyFinder. The output is to standard out. Identified decoys (non-homologous sequences) are listed. They are named according to the sequence name iside the sequence file. Required Libraries:DecoyFinder requires installation of the following Python libraries: joblib, numpy, and scikit-learn. We highly recommend the use of conda. A good reference for installing scikit-learn is available at: https://scikit-learn.org/stable/install.html. Set up RNAstructure: DecoyFinder calls components of the RNAstructure package. Make sure to add the RNAstructure executables to your global path. The following steps show how to do so in Linux: DecoyFinder Training:DecoyFinder also provides code for user to train their own model. DecoyFinder training will generate configure file for DecoyFinder to train and output the trained AdaBoost model. USAGE: python3 DecoyFinder_training.py <sequence file directory> <output file directory> <Model output> Required parameters:
References:
|
||||||||||||||
Visit The Mathews Lab RNAstructure Page for updates and latest information. |