mzmatch.ipeak.sort.RelatedPeaksLC-MS experiments yield large amounts of peaks, many of which correspond to derivatives of peaks of interest (eg, isotope peaks, adducts, fragments, multiply charged molecules), termed here as related peaks. This tool clusters groups of related peaks together and attempts to identify their relationship. Each cluster is given a unique id, which is stored as an annotation with the name 'relation.id'. The relationship is stored as an annotation with the name 'relation.ship'.
Windows batch-file: SET JAVA=java -cp mzmatch.jar -da -dsa -Xmn1g -Xms1425m -Xmx1425m -Xss128k -XX:+UseParallelGC -XX:ParallelGCThreads=10 REM extract all the mass chromatograms %JAVA% mzmatch.ipeak.ExtractMassChromatograms -v -i raw\*.mzXML -o peaks\ -ppm 3 REM combine the individual timepoints %JAVA% mzmatch.ipeak.Combine -v -i peaks\24hr_*.peakml -o 24hr.peakml -ppm 3 -rtwindow 30 -combination biological %JAVA% mzmatch.ipeak.Combine -v -i peaks\28hr_*.peakml -o 28hr.peakml -ppm 3 -rtwindow 30 -combination biological %JAVA% mzmatch.ipeak.Combine -v -i peaks\32hr_*.peakml -o 32hr.peakml -ppm 3 -rtwindow 30 -combination biological REM combine all timepoints in a single file %JAVA% mzmatch.ipeak.Combine -v -i *hr.peakml -o timeseries.peakml -ppm 3 -rtwindow 30 -combination set REM detect the related peaks %JAVA% mzmatch.ipeak.sort.RelatedPeaks -v -i timeseries.peakml -o timeseries_related.peakml -ppm 3 -rtwindow 15References:
Commandline options* | |
---|---|
-i <filename> | Option for the input file. The only allowed format is PeakML and when it is not set the input is read from standard in. The contents of the file is enforced to be peaksets (result of Combine) as this tool utilizes the full information of peaksets in order to identify related peaks. |
-o <filename> | Optional filename where the output is written. If this is not set the output is written to the standard output. |
-basepeaks <filename> | Optional filename where the output is written. If this is not set the file with the basepeaks is not written. |
-ppm <double> | The mass tolerance in ppm. This value is used for the identification of the relations in the found sets of related peaks. |
-rtwindow <double> | The retention time window in seconds, defining the range where to look for matches. |
-minrt <double> | Denotes the minimum retention time in seconds peaks should occur before being taken into account for the relation process. When this value is not set all peaks are taken into account. |
-h | When this is set, the help is shown. |
-v | When this is set, the progress is shown on the standard output. |