RNAstructure Command Line Help |
||
Sequence File FormatThe native sequence format is the sequence file with a .SEQ extension. Lowercase nucleotides will be forced single stranded in structure prediction. The file must have the following format:
FASTA (Sequence File) FormatSequences can also be provided to RNAstructure in FASTA format. For FASTA, the first line, a title line, needs to start with ">". Subsequent lines should only contain sequence and whitespace, which is ignored. Lowercase nucleotides will be forced single stranded in structure prediction. CT File FormatA CT (Connectivity Table) file contains secondary structure information for a sequence. These files are saved with a CT extension. When entering a structure to calculate the free energy, the following format must be followed.
The CT file may hold multiple structures for a single sequence. This is done by repeating the format for each structure without any blank lines between structures.
Constraint File FormatFolding constraints are saved in plain text with a CON extension. These can be hand edited. For multiple entries of a specific type of constraint, entries are each listed on a separate line. Note that all specifiers, followed by "-1" or "-1 -1", are expected by RNAstructure. For all specifiers that take two arguments, it is assumed that the first argument is the 5'nucleotide. Nucleotides positions are specified from the 5' end, where the first nucleotide in the sequence is in position 1.The file format is as follows: DS: XA -1 SS: XB -1 Mod: XC -1 Pairs: XD1 XD2 -1 -1 FMN: XE -1 Forbids: XF1 XF2 -1 -1
SHAPE Data File FormatThe file format for SHAPE reactivity comprises two columns. The first column is the nucleotide number, and the second is the reactivity. Nucleotides for which there is no SHAPE data can either be left out of the file, or the reactivity can be entered as less than -500. Columns are separated by any white space. Note that there is no header information. Nucleotides 1 through 10 have no reactivity information. Nucleotide 11 has a normalized SHAPE reactivity of 0.042816. Nucleotide 12 has a normalized SHAPE reactivity of 0, which is NOT the same as having no reactivity when using the pseudo-energy constraints. By default, RNAstructure looks for SHAPE data files to have the file extension SHAPE, but any plain text file can be read. List File FormatList files have a LIS extension. This file contains any number of sequences of any length or nucleic acid, each on its own line. Offset File FormatOffset files are plain text. The files contain two colums: the nucleotide followed by the offset value in kcal. Experimental Pair Bonus File FormatBonus files are plain text. They are formatted as an nxn matrix of bonus values, where n is the length of the sequence. Alignment File FormatAlignment files are plain text. They are formatted as a nucleotide in the first sequence immediately followed by the nucleotide in the second sequence it's aligned to, separated by a space. Only one alignment pair can be on each line, and the last line of the file must be "-1 -1". Dot Bracket File FormatDot bracket files are plain text. The encode a sequence and secondary structure. The first line starts with a ">" character and a file title follows. The next line contains the sequence. The final line contains "." (unpaired nucleotide), "(" (nucleotide that is 5' in a pair), and ")" (nucleotide that is 3' in a pair. RNAstructure/examples/bmorivector.dot provides a sample file. |
||
Visit The Mathews Lab RNAstructure Page for updates and latest information. |