Testing new candidate exoprotease substrates TIMING ~2-14 h
1| Thaw an aliquot of human serum from Sigma (250 μL) on wet ice for an hour.
2| During this time, prepare fresh reagents (see REAGENTS SETUP).
3| Analyze substrate peptides before proteolysis (zero time point) using 100 pmol of the peptide which will be used in a particular assay (e.g., C3f_
13C) to verify the molecular weight (
m/z value) (
Box 1).
BOX 1 - Manual solid-phase extractionA `time-zero' data point is taken using 100 pmol of the peptide under study to verify the MH+ value.
1. Transfer 20 μL of thawed serum (or plasma) to a well of a 0.2-mL eight-tube strip.
2. To the sample, add 10 μL of 10 pmol / μL substrate (labeled / unlabeled) solution to yield 100 pmol.
▲ CRITICAL STEP Change tips in between each step to avoid contamination.
3. Immediately, add 5 μL of the pre-washed Dynabeads and mix it three times using your pipette tip.
4. Place the eight-tube strip on a magnetic plate to separate the peptide bound Dynabeads from the sample.
5. Remove and discard the supernatant.
6. Wash the Dynabeads by adding 200 μL of 0.1% TFA and mixing it by moving the eight-tube strip back and forth eight times.
7. Aspirate and discard the washing solution.
8. Repeat the wash step (steps 6 and 7) one more time.
9. After removing the wash solution, resuspend the beads with 6 μL of 50% acetonitrile by pipetting the beads up and down quickly ten times.
10. Transfer 5 μL of eluate into a new well.
11. Add 10 μL of matrix solution to the well containing 5 μL of the eluate.
12. Mix the matrix with the eluate by pipetting up and down.
13. Aspirate 1 μL of the mixture and spot on a MALDI plate.
14. After the spot has crystallized, insert the plate into a MALDI-TOF mass spectrometer.
15. Select the method `ExoproteaseAssay.par'
Box 4. (Reflectron mode method created in house) in the FlexControl program.
16. Acquire a spectrum and save it in a new created folder to be stored for future reference.
17. Open the FlexAnalysis program and choose your newly created file.
18. You should be able to see the parent peptide peak among the endogenous serum peptides. ! CAUTION If 100 pmol is insufficient, you must adjust the amount of peptide and repeat the above steps until the proper concentration is established. This may vary among peptides.
? TROUBLESHOOTING
4| Select preliminary incubation time points to test. Generally, two preliminary incubation time points are selected, for instance 15 minutes (min) and 1 hour (h).
? TROUBLESHOOTING
5|Prepare Samples for the preliminary test time points, as described in
Box 1, and evaluate the selected timepoints. Based on the results (extent of degradation or lack thereof) after these incubations, a decision should be made about which additional time points to select. For example, if many or all rungs of the expected ladder are observed after 15 min, incubations of the peptide in serum will also be done for 5 and 30 min. If there was degradation only after 1 h, additional time points at 30 min and 2 h time will be investigated. Lastly, if no degradation was observed after 1 h, incubation periods of 3 h, 6 h and overnight should be tested ().
▲ CRITICAL STEP Cover the samples in the eight-tube strip with its eight-dome cap strip to avoid evaporation and precipitation.
▲ CRITICAL STEP Incubate reactions at room temperature (RT: 19-24 °C)
6|After the best time point(s) are established, synthesize isotopically labeled versions of the peptides judged to be optimal substrates, as well as synthesize a series of doubly-labeled, non-degradable D-peptides (reference peptides consisting of D-amino acids) that correspond to each rung in the degradation ladder.
7|Quantitate newly synthesized peptides by amino acid analysis, as described in
Box 2)
Testing new labeled exoprotease substrates and reference peptides in serum pools TIMING ~ 2-15 h
8|Prepare pools of the clinical groups you will be studying. First transfer an equal amount from each individual sample type into a large vial. Then mix the new created pool by pipetting up and down couple of times. Make 20 μL aliquots of the pool and freeze samples in -80 °C.
▲ CRITICAL STEP Pools must be prepared on wet ice.
9|Thaw an aliquot of the pool on wet ice.
! CAUTION Leftover sample should not be frozen for further use.
10|Repeat steps 2 - 4 with the thawed aliquots (exactly as performed for the unlabeled peptide) to confirm that the same degradation time and concentration conditions apply to the labeled version.
Incorporating the corresponding reference D-peptides into your assay
11|Transfer 20 μL of thawed pool of reference D-peptides () to a well of a 0.2-mL eight-tube strip.
12|Add 10 μL of 0.2 pmol / μL reference peptide mix (see REAGENTS SETUP) to yield a total amount of 2 pmol.
! CAUTION Change tips in between each step to avoid contamination.
13|Add 10 μL of 10 pmol / μL substrate (degradable labeled peptide) (see REAGENTS SETUP) to yield a total amount of 100 pmol.
! CAUTION Change tips in between each step to avoid contamination.
14|Cap the eight-tube strip with its eight-dome cap strips and then start incubation at RT (19-24°C).
15|After incubation, follow
Box 1.
16|Check that MS results show the presence of the `exogenous' ladder with its corresponding reference D-peptides (see mass spectrometry section for details). If this is not the case, more time points should be added to the pilot studies and /or the quantitation of the D-peptides double-checked.
Blood collection and Serum preparation and storage TIMING ~ 21 h
17|Collect venous blood into one BD Vacutainer SST tube.
▲ CRITICAL STEP Fill blood collection tubes to the top (8.5 mL) to avoid hemolysis.
! CAUTION For all blood and blood derived samples, always observe precautions: handle all biological samples as a potential source of pathogens, use the appropriate protective attire (lab coats, safety glasses, latex gloves, etc.) and dispose properly of all biohazardous materials.
? TROUBLESHOOTING
18|Gently invert the tube five times to mix the clot activator with the blood. Allow blood to clot for 1 h at RT (19-24°C) with the tube vertical.
19|Keep the SST tubes on wet ice in vertical position until transport. Then place the tubes in an appropriately coded bag. Keep below 4°C during transport.
20|Spin SST tubes in a clinical centrifuge at 1,400-2,000g for 10 min at RT.
21|Label 4-mL cryovials.
22|Transfer the serum (upper phase) to the appropriately labeled 4-mL cryovials. The volume of serum per cryovial should be approximately 1 mL. These cryovials will be called `source vials' throughout this protocol.
23|Immediately store all serum samples at -80°C. Avoid freeze-thawing cycles. Transport serum samples to the MS laboratory on dry ice.
24|Enter all sample information into the LIMS before the samples arrive at the lab. Fill out the necessary information for each specimen (see EQUIPMENT SETUP). Bar-code labels for the original source vial tubes should be created automatically every time sample information is submitted. Keep labels organized in the same order as the samples to make the process easier. Generate a source vial list for verification while labeling the source vials.
25|Label the source vials while they are frozen on dry ice. Note that the barcode labels are designed for small 0.5-mL microcentrifuge tubes and not for bigger vials. Labels will not overlap on the back of larger tubes, so it is difficult for the labels to stay attached, especially when the frozen vials have water condensation on the surface. For those reasons, it may be necessary to wipe the tubes dry with a piece of paper tissue, prior to attaching the label and adding a piece of tape on top to secure the label effectively.
▲ CRITICAL STEP Always double-check that sample IDs on source vials match the sample information associated with their bar-code labels in the database.
26|Store the labeled source vials in the -80 °C freezer in cardboard freezer boxes with temporary labels indicating the project and date, and marked `source vials to be aliquoted.' If more than one source vial is received for each specimen, then use only one of them for aliquoting. Place the remaining source vials in separate boxes temporarily labeled with tape with the project name, date and marked `source vials for permanent storage.' These vials will be entered later into bar-coded boxes and those boxes will be added to the inventory.
▲ PAUSE POINT Serum source vials can be kept frozen for at least 5 years at -80 °C.
27|Take the boxes containing the source vials to be aliquoted out of the freezer and place the vials on dry ice.
▲CRITICAL STEP The samples should remain frozen while the bar-codes for the aliquots are being generated.
28|Scan the bar-code of the first source vial to be aliquoted. A `Specimen ID' will be generated if the database recognizes the sample correctly. Select the volume of serum in the source vial. Indicate the volume of the aliquots (50 μL) and select the number of aliquots to be created based on the volume of sample available. New labels will be printed for all of the aliquots created. If sufficient sample is available, generate nine aliquots of 50 μL each per specimen (source vial). Keep adding specimens until the list has been filled.
29|Label clean 0.5-mL microcentrifuge tubes with the aliquot bar-code labels and organize them according to the order of the original source vials. If available, use 12×8-hole plastic racks for 0.5-mL tubes or place the labeled tubes directly in the aliquot racks of the Tecan liquid handler
12. Generally, tubes have to be organized in eight rows of nine tubes. From each set of tubes, eight will be placed in temporary `aliquots for permanent storage' boxes and one aliquot (normally, aliquot no.1) will be placed in a separate box marked `to be run with Tecan' and dated.
30|Place the source vials on wet ice once all the aliquot tubes have been labeled, organized and checked. Use big-flat ice buckets to place the samples in the same order they will be aliquoted and let thaw for 60-90 min.
▲ CRITICAL STEP Verify that the source vials and their associated aliquots match before proceeding with the aliquoting process to avoid thawing the serum of source vials that were not planned to be aliquoted.
31|Load the clean, newly bar-coded aliquot tubes in the aliquot racks (16 rows of nine aliquots of each sample). Tubes have to be placed so the lids rest against the back lip of the rack, allowing the tubes to be closed easily once they have been aliquoted with serum. Place the aluminum aliquoting racks in the robot carriers in the layout as indicated in and then load the rack with the source vial samples in the same order as the aliquot tubes. Keep the source vial racks on wet ice to maintain the serum as cold as possible before the aliquoting process begins.
32|Open the Gemini program by clicking on the icon located on the PC Desktop. Run the Gemini aliquot program or manually perform the aliquoting using microcentrifuge micropipettes. Aliquoting process can be done automatically or manually, as long as all the tubes are carefully organized.
33|Close the tubes and put them in their corresponding foam inserts and boxes on dry ice, to be temporarily stored at -80°C in the prelabeled temporary boxes. The boxes' numbers and dates will allow the samples to be correctly identified during later steps. This process must be kept as short as possible to avoid exposure of the samples to room temperature.
▲ PAUSE POINT. Serum aliquots can be kept frozen for at least 5 years at -80 °C.
34|Create labels for the boxes for permanent storage of the aliquots. Calculate how many boxes are needed to store all the created aliquots, dividing the total number of aliquots by 128 (maximum number of aliquots per box) and rounding up to the next whole number. For this purpose new `Containers' will be created in the `Inventory' section of the LIMS. During this process, bar-code labels for the upper and lower foam inserts, for the entire freezer box, and for the freezer racks will be created. Place the label on the center of the outer part of the box. The label for the upper foam insert goes in the upper left corner of the inner part of the box and that for the lower foam insert at the lower right corner of the inner part.
35|Scan the aliquots to be placed in their permanent locations in the prelabeled boxes, using one temporary box at a time. Enter the aliquot's permanent location in the inventory section of the LIMS. Scan the samples and place them in the physical location assigned by the database. (A1 is the top left spot in the foam inserts.) In a similar manner, fill the other containers: place aliquots inside foam inserts, foam inserts inside cardboard boxes, boxes inside freezer racks, racks inside freezer shelves, shelves inside freezer and freezer inside room. Keep all aliquots frozen on dry ice at all times.
▲ CRITICAL STEP Make sure that all aliquots go to their proper permanent storage box. From this time on, information on where the specimens are will be stored only in the LIMS system.
▲ PAUSE POINT Serum aliquots can be kept frozen for at least 5 years at -80 °C.
Set up of the robot run TIMING ~ 4 h
36|Scan the bar-codes of the aliquots to be run and make a list of their bar-code identifiers. Save the list with the project name and date as a plain text file (.txt in NotePad or TextEdit). Keep all aliquots frozen on dry ice at all times. These aliquots (normally aliquot no. 1 of each sample) should be in temporary `to be run' boxes after the aliquoting step.
37|Open the randomization software, MALDI autoexecute generator v 3.0.exe
12, and indicate the number of samples to be run. The maximum number of serum aliquots that can be run for each MALDI plate is 96. If more than 96 samples are added, the program will create two or more MALDI plates.
38) Press the `begin' button and the program will ask to choose the file containing the file names (the recently created plain text file). Once you have selected the file, a layout of the samples' positions on the MALDI plate in the randomized order will be generated, together with the calibrant locations. The layout generated by the randomization software does not represent the 96-well plate where the serum samples will be placed for the solid-phase extraction. Instead, it represents the 384 spots corresponding to the MALDI target plate, showing where the 96 serum samples plus the calibrants will be spotted for mass spectrometry analysis.
39) Next, the program will ask where to save the output file (Autoexecute file), which contains the set of instructions for the FlexControl program to run the mass spectrometer automatically.
40) Print the target layouts (generated in step 38), as they will be needed during the organization of the randomized serum aliquots and will later be filed in the corresponding record's folder.
41| Locate and organize the aliquots in 96-well plastic racks (precooled on dry ice for 10 min) according to the randomized order shown in the target layouts. Always verify the sample list and the layouts to make sure that listed samples, information in the layout and tube identifications match. Take the boxes with the aliquots already organized out of the -80°C freezer and put them in big-flat ice buckets containing dry ice. Then, take aliquots from the boxes and put them in 96-well plastic racks following the randomized order. Keep all aliquots frozen on dry ice at all times.
▲ CRITICAL STEP This is a very tedious but important step. Any error in properly organizing the samples in the randomized order will affect the whole experiment.
▲ PAUSE POINT Serum aliquots can be kept frozen at least 5 years at -80°C.
42| Transfer output files (Autoexecute files) to the MALDI-TOF PC (for instance: C:\data\methods\autoexecute files).
▲CRITICAL STEP Print Autoexecute files to keep them in the records.
43| Schedule the runs of clinical samples. Check the following: Have you randomized all the aliquots to be run, organized them in plastic racks, double-checked to ensure each aliquot is in the correct position on the correct plate and placed them in the -80°C freezer?; Have you generated Autoexecute files and printouts and transferred them to the appropriate folders?; Are all the reagents and materials available for the assay?
44| Start setting up the robot during the morning of the day of the runs by cleaning the Tecan liquid handler, degassing the water that the robot will use for the runs, ensuring the waste container has enough room to receive what the assay will discard. Also, flush the system twice.
▲CRITICAL STEP There should be no visible bubbles traveling the lines throughout the flushes. Flush until no bubbles remain.
? TROUBLESHOOTING
45|Perform an system performance verification experiment (see
BOX 5) on the morning of, or the day before, analysis of each sample set submitted for activity assaying.
BOX 5 | EXOPEPTIDASE ASSAY PERFORMANCE VERIFICATION1. Thaw an aliquot of frozen human serum (Sigma) on wet ice for 1 h.
2. Prepare the Tecan robot (step 44) and reagents (enough for the run; 10 wells) (see REAGENTS SETUP) while the samples thaw on wet ice. Prepare extra solution in case needed.
3. After the human serum (Sigma) has thawed, transfer 20 μL to the designated wells of 0.2-mL polypropylene “Template III PCR” half-skirted, 96-well microtiter plate. To each serum sample, add 10 μL of 0.2 pmol / μL reference peptide mix (final amount = 2 pmol). And then add 10 μL of 10 pmol / μL C3f_13C (final amount = 100 pmol)
▲ CRITICAL STEP Sample preparation is done on wet ice.
4. Add 40 μL PBS to remainder of empty wells.
5. Cover the 96-well plate with aluminum foil and incubate at RT (19-24 °C) for 3 h.
6. Place plate in the `plate holder 1' position ().
7. Run the Gemini program (identical to the program that is run for the first half of the plate during the runs with clinical samples) (
Box 3).
8. Remove the target and take it to the MALDI-TOF mass spectrometer.
9. Insert the MALDI target to be analyzed and select the method `ExoproteaseAssay.par.'
10. Open the quality control Autoexecute template and edit the file (change the date and directory folder for saving the spectra). After editing the file, save a newly named file in the quality control Autoexecute file directory.
11. Open the edited Autoexecute file in the FlexControl program and run it.
12. Open the FlexAnalysis program after the run has finished. Open all the just-created quality control spectra using the `open multiple spectra' function.
13. Overlay the ten mass spectra and make sure that the overlay has a similar peptide complexity to that shown in . In addition, zoom in on the areas to see the degradation ladder ().
14. Calculate the ratios of the ion intensities of selected `exogenous' degradation products (i.e., those derived from labeled, exogenous substrate) over the ion intensities of the corresponding reference peptides (sequences listed in ) for each replicate:
C3f_13C_8 / C3f_D_8;
C3f_13C_10 / C3f_D_10;
C3f_13C_11 / C3f_D_11;
C3f_13C_12 / C3f_D_12;
C3f_13C_16 / C3f_D_16.
Coefficients of variation (C.V.'s) of these ratios, achieved during a quality control analysis of 10 technical replicates, should be <15% for each of the 5 peptide pairs
11.
Automated solid-phase extraction TIMING ~ 6 h
46| Thaw the 48 serum samples, taking the first plastic rack (containing aliquots in the randomized order) from the -80°C freezer and transferring the samples to wet ice for about 60 min, carefully keeping the same order of the samples as in the plastic racks.
47| Prepare reagents and the calibrants (see REAGENTS SETUP).
▲CRITICAL STEP Calibrants and some of the reagents must be prepared the day of the assay.
48| Mix the Dynabeads gently until completely dispersed. Using an 8-well strip of 0.2-mL thin-wall tubes, put 75 μL of magnetic beads in each of the 8 wells and place in the proper holder on the robot deck ().
? TROUBLESHOOTING
49| Add fresh 0.1 % TFA (See REAGENTS) to the TFA trough on the robot ().
50| Prepare a 96-well half-skirted microtiter plate: the first column should contain 100 μL 50% acetonitrile in each well; the second column should contain 100 μL of matrix.
▲ CRITICAL STEP 50% acetonitrile should be made fresh on the day of experiment.
51|Tightly seal the wells of the 96-well half-skirted microtiter plate with a piece of self-adhesive foil using a rubber roller. Attach the plate to the cooler rack with a piece of Parafilm wrapped around the microtiter plate and the inner part of the cooler rack, so the plate will remain in place when the robot tips pierce the foil. Also, use 4 mini binder clips attaching the cooling rack to the robot to prevent it from falling out of position. The cooler rack should be kept at -20°C for several hours before it can be used for a clinical run. Note that the rack changes color with temperature making it easier to know whether it can be used for a run.
? TROUBLESHOOTING
52| Place a clean MALDI target plate in position ().
53| Transfer 20 μL from each aliquot (50 μL) to the proper well of a half-skirted 96-well microtiter plate. Note that only the first half of the microtiter plate contains serum samples. The second half will be used during the SPE automatic process. If prepared properly, the aliquots should have thawed in the same positions as they will be placed in the microtiter plate. Check corresponding randomization layout to make sure
▲CRITICAL STEP Reactions must be prepared on wet ice.
54| Add 10 μL of the non-degradable reference peptide mix (see REAGENTS SETUP) to yield total of 2 pmol per reaction.
▲CRITICAL STEP Change pipette tips in between each sample to avoid cross contamination.
55| Add 10 μL of your substrate (for example, C3f-13C) (see REAGENTS SETUP) to yield a total of 100 pmol per reaction.
▲CRITICAL STEP Change pipette tips in between each sample to avoid cross contamination.
56| Cover the 96-well plate with aluminum foil and incubate at RT (19-24°C) for an appropriate time which has already been determined in steps 4 and 5 (for example, C3f-13C requires 3 h incubation time).
? TROUBLESHOOTING
57| Place plate in the `plate holder 1' position ().
58| After the incubation of reactions is complete, flush the robot one more time.
59| Run the TECAN program. The robot will carry out the steps described in
Box 3.
? TROUBLESHOTTING
▲ CRITICAL STEP This protocol can also be done manually using 96-well plate holders, magnet racks and eight-channel multipipettes. However, the manual procedure is not recommended because of a lack of reproducibility.
Mass Spectrometry TIMING ~ 2 h
60| Insert the target plate to be analyzed in the MALDI-TOF mass spectrometer.
61| Go to FlexControl, load the correct Autoexecute file and start the automatic run, this will perform the steps described in
Box 4. The method referred to in this Autoexecute file should be updated to use the proper laser energy. The method is a Reflectron mode.
BOX 4 | `ExoproteaseAssay.par' FILE CREATION STEPS1. Go to the first spot in the MALDI plate.
2. Start collecting the spectrum for the 0.7- 4 kDa m/z: average 400 laser shots, delivered in four sets of 100 shots (at 50 Hz) to each of four different locations on the surface of the matrix spot (performing a spiral movement from one set to the next one).
3. Acquire spectra in reflectron mode geometry under 20 kV (16.45 kV during delayed extraction) of ion accelerating and -1.4 kV multiplier potentials and with gating of mass ions set to m/z 500. Delayed extraction is maintained for 80 ns to give time lag focusing after each laser shot. The effective laser energy delivered to the target is carefully controlled to be 16 μJ (± 10%) per shot. The entire irradiation program is controlled using the instrument's `Autoexecute' function, which automates the following steps: loading the correct spectrum acquisition method (with the optimized instrument settings), going to the right spot on the MALDI plate, and delivering the four sets of 100 shots.
4. Keep collecting mass spectra for all of the samples and calibrants.
? TROUBLESHOOTING
▲ CRITICAL STEP The spectra can also be collected manually, going spot-to-spot and delivering sets of laser shots using the acquisition method optimized for the mass range (0.7 - 4 kDa). However, the manual procedure is very tedious and subject to bias as the operator aims the laser beam at an approximate location of choice.
62| Open the FlexAnalysis program after the run has been completed. Open all the spectra that have just been generated using the `open multiple spectra' function and check that all the spectra have been created (by comparing the spectra with the Autoexecute file of the run).
63| Perform spectral analysis. This can be done through the `Process' functions in FlexAnalysis. Alternatively, spectra can be translated to tab-separated text files that contains both x and y coordinates for each spectrum as separate columns using a custom-written macro (see EQUIPMENT). Once the spectra are in text format, they can be analyzed using different software.
Signal Processing and Quantitation
64| Convert mass spectra from binary format to ASCII files, containing two columns of data (x: m/z; y: intensity), by a custom-written macro in FlexAnalysis (Bruker Daltonics, Billerica, MA).
65| Transfer data to MATLAB. Do additional data processing with a custom script, “qcealign” invoking qpeaks, a commercial program, to do smoothing, base-line subtraction, and peak labeling (see EQUIPMENT)
13.
66| Use Signal Processing & Preview (SPP), a custom-built graphical viewer for spectra in ASCII format, to plot raw and processed spectra side by side to review the outcome of signal processing and to optimize parameters for Qpeaks
13. The singlet width parameter should be set to 1500, thereby specifying the resolution for processing. After processing, a peak table with normalized intensities, smoothed curve, and baseline is created for each spectrum before alignment.
67| Use the custom algorithm, “Entropycal”, to align sample data files to a reference file (a spectrum sum of all the sample files) using a minimum entropy algorithm by taking unsmoothed (“raw”), baseline-corrected data
13. All peaks in the rows within (
m/z) of the strongest peak at a given
m/z value are binned together, and a spreadsheet containing the normalized aligned data is created for further data analysis.
68| Analyze this spreadsheet in conjunction with a custom visual interface for processed spectra, “Mass Spectra Viewer” (see EQUIPMENT), to select only those peaks that correspond to the peptide ladder(s) resulting from peptide substrate degradation, and to the spiked reference peptides
12.
69| Calculate the ratios of the normalized ion intensities of the DEGR peaks over the normalized ion intensities of the corresponding REF peptide peaks for each rung of the ladder (this can be completed in Excel). The spreadsheet resulting from the replicate analyses should be subjected to the same process, and the DEGR/REF ratios for each of the rungs of the same peptide ladder from the replicate experiments averaged.
Importing data and creating experiments in GeneSpring
70| In the same location as the parameter file, Qcealign will have created a “processed01.out” file; check this is present This is the binned, aligned peaklist for all samples.
71| Launch GeneSpring. From the File Menu, select “Import Data”. In the resulting dialog window, navigate to and select the “processed01.out” file. This is the binned, aligned peaklist for all samples of the project; it is in the same location as the parameter file and it is created by
Qcealign 13. A dialog will appear asking for the name. Click “Create a New Genome” and enter a name for the dataset. For this example, we will use “CancerStudy” which is the name of the parent folder where the spectra are stored. Then click “Next”. Another window appears. The first column should be set to “Gene Identifier”. The remaining columns should be set to Signal. Then, click next to dismiss this window. Another window will appear asking to import more data. Since we have none, click next again. A window will appear, saying how many samples are created. Click “Yes” to continue. Then GeneSpring will create an experiment. When you hit “Next” to continue to create an experiment, a warning may appear saying that a column lacks a title. This means that there is an empty column. Scroll through and locate the empty column (usually the last column) and change the column designation from “Signal” to “Unused”.
72| To create an experiment, first type a name for the Experiment and hit “Save.” We will call it “CancerStudy” for our data set. Next, a window appears to set the different statistical properties of the experiment. Click on “Normalizations.” Since Qcealign already normalized the data, we need no normalizations. Remove all default normalizations using the “delete” button and press “OK.” Then, click on the “Parameters” button. In the appearing window, enter the parameters you want to study. Click on “New Parameter” and enter the clinical information for each sample. Enter “Parameter1” as the name for the Column, accepting the default column properties. If the Parameter is Numeric (i.e. body weight), then set “Numeric” to “Yes”, otherwise leave it as “No.” Similarly, set the value for “Logarithmic” to Yes or No, depending on whether the data is in log scale. One can enter many parameters with which to analyze the data. Copy the parameter information from the spreadsheet obtained in step 8. Press “Save” when done.
73| Next, click on “Experiment Interpretation”. In the resulting dialog, for the Interpretation named “Default Interpretation,” set Mode to “Ratio (signal/control)”. Also set the clinical Parameters. Make sure “Use Cross-Gene Error Model in this Interpretation” is unchecked. No conditions should be excluded. Press “Save” to continue. Since we don't use the Error Model, skip the “Error Model” button and press “Close.”
Statistical Analysis
74|Depending on the number of samples available two different strategies can be used to do the statistical analysis. In option A, training and test set are generated if enough samples are available. The training set is used to build a statistical model that later can be validated using the test set. In option B, the statistical analysis is done on the entire dataset using a cross-validation approach.
Option A Setting up Training and Test Sets (i) From the Experiment Menu, select “Create New Experiments”. Select the “Filter on Parameter” tab. One of the added Parameters contains information specifying whether samples were part of the “Training” or the “Test” set. In that parameter, the training set samples are labeled as “Training” while the additional test set samples are marked as “test.”
(ii) From the “Filter On Parameter,” we select the Parameter Value “Training” and click on “Add All.” This adds all the samples from the training set to a new experiment. Click “Next”. A window appears. Click “Import Parameter.” Select the previous experiment. All the parameters from that experiment will appear. Click “Select All” and then “OK.” The values for the training samples will be moved into this experiment. Press “Next.” A window asking about Normalizations appears. Remove all normalizations as before and press “Next.” A window asking for a name for the Experiment appears. We name this set “Training.” All dialogs disappear and the main window reappears.
(iii) With the new “Training” experiment selected, from the Experiment window, select “Experiment Interpretation”. Change Mode to “Ratio (signal/control)”.
(iv) Repeat Steps (i) to (iii) to create a new Experiment called “Test” where the samples labeled “Test” are moved in with their corresponding parameter information.
(v) Differential expression: From the Tools Menu, select “Statistical Analysis (ANOVA)”. Make sure it indicates “all genes” next to the “Choose Gene list” button. Choose the default Interpretation for the Training experiment just created. Then click “Choose Experiment” and the name should appear next to the button. Mode should be “Ratio (signal/control)”. Make sure that Cross-Gene Error Model is inactive (Click next to dismiss this window). Click on the “Parameter to Test” pull-down menu and select the desired parameter. The settings should be the following:
-Test Type should be set to “Non-parametric” test.
-False Discover Rate should be set to .05
-Multiple Testing Correction should be set to “Benjamini and Hochberg False Discovery Rate.”
- There should be no Post Hoc Tests?
▲ CRITICAL STEP If the Experiment and Cross-Gene Error Model values are not correct, click “Close”. Then go back to the Experiment Interpretation window and set the proper values. One can do this be selecting “Experiment Interpretation” from the “Experiment” Menu.
(vi) Click “Start” and Save your results (called a gene list) with an appropriate name. We will call ours “CancerStudy_p05” and hit “Save” to record the results. The p-value can be changed to a more stringent value as needed. We generally save a gene list from the results of p<0.00001.
(vii) Class prediction: From the “Tools” Menu, select “Class Prediction.” Set the Training Experiment to be the Training Set. Working in the K-Nearest Neighbors tab, select the “Parameter to Predict.” We select “Parameter 1.” Set Gene Selection Method to be “All Genes from Selected List.” Select the Gene List saved from step v. Set the decision cutoff for p-value ratio to 1. Press “Start.” To save the results from the pop-up window, copy and paste into an Excel spreadsheet. To optimize the results with the training set, repeat by varying the number of neighbor from 3 to 9. Once optimal conditions (as judged by the lowest prediction errors) are found through iterative cross-validation using the training set, select the “Test” experiment and set it as the “Test Set”. Change Function from “Crossvalidate Training Set” to “Predict Test Set.” Keeping the values for optimal conditions found from the Training Set, press “Start.” Save the Results in an Excel spreadsheet.
(viii) For Support Vector Machine, select the second tab in the “Class Prediction” window. Keeping the same training set, change function to “Crossvalidate Training set.” Set “Parameter to Predict” to the appropriate parameter. We select “Parameter 1”. The Gene list created in step v should be the one used as before. Gene Selection Method remains “All genes from Selected List,” unchanged. Press “Start.” Save the results in an Excel spreadsheet by copying and pasting. Optimize the conditions for the training set by varying the kernel function. Occasionally, it might be necessary to change the scaling factor from 0 to 1 or 2 (that depends on how balanced the number of samples are in the groups tested). Once optimal values are found, then set the Test to the “Test” Experiment. Change the Function to “Predict Test Set.” Save the results in an Excel Spreadsheet. Press “Close” to exit Class Prediction.
Option B Class prediction with Leave-one-out-crossvalidation (LOOCV) (i) From the “Tools” Menu, select “Class Prediction.” Set the Training Experiment to be the Training Set. Working in the K-Nearest Neighbors tab, select the “Parameter to Predict.” We select “Parameter 1.” Set Gene Selection Method to be “All Genes from Selected List.” Select the Gene List saved. Set the decision cutoff for p-value ratio to 1. Press “Start.” To save the results from the pop-up window, copy and paste into an Excel spreadsheet. To optimize the results with the training set, repeat by varying the number of neighbor from 3 to 9. Once optimal conditions (as judged by the lowest prediction errors) are found though iterative cross-validation using the training set, select the “Test” experiment and set it as the “Test Set”. Change Function from “Crossvalidate Training Set” to “Predict Test Set.” Keeping the values for optimal conditions found from the Training Set, press “Start.” Save the Results in an Excel spreadsheet.
(ii) For Support Vector Machine, select the second tab in the “Class Prediction” window. Keeping the same training set, change function to “Crossvalidate Training set.” Set “Parameter to Predict” to the appropriate parameter. We select “Parameter 1”. The Gene list created should be the one used as before. Gene Selection Method remains “All genes from Selected List,” unchanged. Press “Start.” Save the results in an Excel spreadsheet by copying and pasting. Optimize the conditions for the training set by varying the kernel function. Occasionally, it might be necessary to change the scaling factor from 0 to 1 or 2 (that depends on how balanced the number of samples are in the groups tested). Once optimal values are found, then set the Test to the “Test” Experiment. Change the Function to “Predict Test Set.” Save the results in an Excel Spreadsheet. Press “Close” to exit Class Prediction.
Visual inspection of processed spectra (MassSpectra Viewer -- MSV) 75| Create a viewer definition file. This tells the viewer which samples are in which clinical group (see Supplementary material 4). The definition file needs a minimum of three columns and looks like the example below. The basefile name column does not use “_1” or “_2” - just the name. The MSV automatically adds the suffix when looking for the ASCII files. The next columns (zero to an unlimited number) list various demographic or clinical parameters, such as gender, age, etc… No spaces are to be used anywhere. The columns are tab-separated:
| # | Basefile name | Group | Parameter 1 (gender) | Parameter 2 (age) | Parameter 3 (xyz) |
| 1 | 000ZG60005DVA | Cancer1 | Female | 56 | |
| 2 | 000ZG70007DCK | Cancer1 | Male | 61 | |
| 3 | 000ZG80003VKD | Control | Male | 58 | |
| 4 | 000ZG90002EGO | Control | Female | 52 | |
76| Go to MATLAB and type “masspectraviewer” in the command window
12. Click on “Folder Setting”. A window appears. Press “Browse” next to the “Folder of ASCII files” and select the directory holding the processed ASCII data files. It should be called “Final_ASCII_Spectra” according to our data structure and located inside the Processed01 folder. Press the second “Browse” button and select the “Matrix” folder, also in the Processed01 folder. Press the last “Browse” button and select the viewer definition file, which was created in the previous step. Then press “Save & Return.” Then press “Make Matrix.” Once that is done, press “Group Color.” A dialog box will appear. The clinical subgroups will appear in the left box. Select each item. Pick a color by scrolling up and down the color bar. Once the appropriate color is picked, press the “Select” button to assign that color to that subgroup. Repeat for each subgroup. Once all the groups are given a color, press “Save and Return.” Once the main screen returns, press Update. Zoom in on peaks selected for the clinical parameter and verify the results.
▲ TROUBLESHOOTING
77| Repeat steps 75-76 for each additional clinical parameter that was used during the statistical analysis.
TIMELINE
Testing new candidate exoprotease substrates (Steps 1-5), ~2-14 h
Synthesizing labeled/unlabeled exopeptidase substrates and reference peptides (Step 6). Timing depends on the turnaround times of the peptide synthesis facility; typically ~2 to 6 weeks
Quantitate newly synthesized peptides by amino acid analysis (Step 7).
Timing depends on the turnaround times of the amino acid analysis facility; typically ~1 to 4 weeks
Testing new labeled exoprotease substrates and reference peptides in serum pools (Steps 8-16), ~2-15 h
Blood collection and Serum preparation and storage (Steps 17-35), ~21 h
Set up of the robot run (Steps 36-45), ~4 h
Automated solid-phase extraction (Steps 46-59), ~6 h
Mass Spectrometry (Steps 60-63), ~2 h
Data Analysis (Steps 64-77), ~8 h
TROUBLESHOOTING : Troubleshooting Table
ANTICIPATED RESULTS Although protease assays that use peptide substrates and a MALDI-TOF MS read-out may be employed to answer a variety of questions in the health and life sciences (see Introduction), the only applications so far of the unique platform/method combination described here have been to screen for proteolytic activities in serum and plasma
8, 11, and to monitor purification of individual exopeptidases from cancer cell secretomes (unpublished). In contrast to C3f, a surprisingly large number of `other' serum peptides that have been tested as candidate assay substrates didn't degrade in serum or plasma, not even after prolonged incubations (unpublished observations). Only fibrinopeptide A (FPA: ADSGEGDFLAEGGGVR), FPA without the N-terminal Ala (FPA(-A): DSGEGDFLAEGGGVR), a peptide mapping to the C-terminus of the clusterin beta-chain (Clus2: RPHFFFPKSRIV), and a peptide derived from alpha-1-antitrypsin (A1AT: LMIDQNTKSPLFMGKVVNPTQK) have proven utility as assay substrates at this time
11. Substantial degradation of 100 pmoles of any of the five aforementioned substrates can be anticipated within 15 min (FPA) to 15 h (FPA(-A)) after addition to 20 μL human control serum or plasma
11. More specifically, the C3f-based system performance verification test described in
Box 5 should yield the indicated results after 1-3 h incubation, with C.V.'s of normalized MS peptide-ion intensities of 5 selected metabolites (see
Box 5) all below 15% over 10 technical replicates at fixed incubation times. This also applies to replicate assays using Clus2, FPA(-A) and A1AT peptides as substrates (2 h incubation for Clus2; 6-15 h for the others). FPA degrades within minutes after addition to serum, which makes accurate timing and reproducibility exceedingly difficult, especially when dealing with large sample sets, typically resulting in C.V.'s >30%.
The current protocol has also been used to assay serum samples obtained from 48 patients with metastatic thyroid cancer and 48 gender-and age-matched healthy controls, and utilizing 3 different peptide substrates (C3f; FPA(-A); Clus2) in separate analyses. The full array of peptide metabolites (a
de facto 26-member `mini-peptidome') was then quantitated and subjected to multivariate statistical analysis and machine learning methods to ultimately yield class predictions with 94% sensitivity and 90% specificity
11. It should be noted, however, that none of these 26 individual peptides comprising the assay degradome could, by itself, be used as a quantitative `biomarker' (in the form of a MALDI-TOF MS-generated ion intensity signal) to completely distinguish all the cancer patients from healthy controls. At best, the ion intensities of certain peptide peaks, as for instance for the three C3f degradation products in the color-coded spectral overlays shown in , are on average higher in one particular group than in the other, but without being completely separated. In our hands, no such perfect separation has been observed until now between any two cancer patient / control groups, for any of the degradation products of any substrate peptide tested (unpublished observations). Considering the complexity of blood protease panels, and the unique concentrations and activities of each of the individual members acting in concert, and in view of the biological variability, it is perhaps unrealistic to anticipate discovery of a single peptide metabolite that will have
bona fide diagnostic capacity. Instead, the metabolites must either be interpreted as diagnostic `patterns' (as is common practice at the moment) and/or specific assays should be developed for the individual enzymes.