The website introduces the procedures of preparing input data for our tools. The user guide is also avaliable in pdf format. Download

User Guide for Preparing Input Data

The quantitation tools developed by the Systems Proteomics Lab, Institute of Information Science, Academia Sinica require spectral data files (.mzXML) and search results as the input data files. In the following, we introduce conversion of raw spectral data to mzXML files and preparation of search results. The workflows for Mascot users and PeptideProphet/ProteinProphet users are shown in the following figures.


Figure. Data preparation for Mascot users



Figure.
 Data preparation for PeptideProphet/ProteinProphet users (Also for users using Mascot, SEQUEST, and X!tandem coupled with PeptideProphet/ProteinProphet)


I.    Convert Spectral data files to mzXML Files
All the spectral data files generated from instrument have to be converted into mzXML format. Here we list the mzXML converter for RAW, Wiff, and BAF files. For the detail information of mzXML data format, you can check the following website of Seattle Proteome Center: ttp://tools.proteomecenter.org/wiki/index.php?title=Formats:mzXML


a.    For Wiff data format
To convert Wiff files to mzXML format, we recommend use mzWiff, which is available at http://tools.proteomecenter.org/wiki/index.php?title=Software:mzWiff.


b.    For RAW (ThermoFinnigan) data format
To convert RAW files to mzXML format for ThermoFinnigan users , we recommend use ReAdW, which is available at http://tools.proteomecenter.org/wiki/index.php?title=Software:ReAdW.


c.    For RAW (Waters) data format
To convert RAW files to mzXML format for Waters users, we recommend use massWolf, which is available at http://tools.proteomecenter.org/wiki/index.php?title=Software:massWolf.


d.    For BAF data format
The detailed information of conversion BAF files to mzXML format by CompassXport can be found at http://tools.proteomecenter.org/wiki/index.php?title=Software:CompassXport.

II.    Prepare Search Results
The tools accept search results from Mascot, SEQUEST, X!Tandem, and PeptideProPhet /ProteinProphet.

a.    For Mascot Users
The tools accept XML format from Mascot output. When exporting search results from Mascot, please make sure that the option “Query title" is checked in “Peptide Match Information Section”, as shown below.


 


Troubleshooting: If you get any error message when executing our tools, please first make sure your search result files belong to the following four types that contain required information for subsequent quantitation.


For
Bruker instruments:

Note: Since the peptide query title provides insufficient information for quantitation, after completing Mascot search, please make sure that the name of search result file is consistent with that of mzXML file.

 

For Waters instruments:


For ThermoFinnigan instruments:


For SCIEX/ABI Q-STAR instrument:

  1. Click “Open Dir” button and select the directory where .WIFF files locate. Then all WIFF files in the folder are listed (as shown below).
  2. Check the WIFF files and click “GO” button to generate .table files. If a checked WIFF file contains spectral data from more than one LC-MS runs, please check the option “Convert to individual files”.
  3. When the process is completed, the “status” field will show “Done!” and .table files are shown in the folder.


b.    For SEQUEST, X!Tandem and PeptideProphet-ProteinProphet Users

SEQUEST and X!Tandem users need to go through PeptideProphet and ProteinProphet to generate PepXML and ProtXML files for peptide identification and protein identification results, respectively . Other PeptideProphet and ProteinProphet users also use the generated PepXML and ProtXML files as search result input in our quantitation tools.




Systems Proteomics Laboratory
Institute of Information Science
Academia Sinica, Taipei 115, TAIWAN
Last updated 10/24/2008

Website: http://ms.iis.sinica.edu.tw/
E-mail: bioinfo@iis.sinica.edu.tw