R/filter_mscore_fdr.R
filter_mscore_fdr.Rd
This function controls the protein FDR over a multi-run OpenSWATH/pyProphet output table and filters all quantitative values to a desired overall/global peptide FDR level. It first finds a suitable m-score cutoff to minimally achieve a desired global FDR quality on a protein master list based on the function mscore4protfdr. It then finds a suitable m-score cutoff to minimally achieve a desired global FDR quality on peptide level based on the function mscore4pepfdr. Finally, it reports all the peptide quantities derived based on the peptide level cutoff for only those peptides mapping to the protein master list. It further summarizes the protein and peptide numbers remaining after the filtering and evaluates the individual run FDR qualities of the peptides (and quantitation events) selected.
filter_mscore_fdr(
data,
FFT = 1,
overall_protein_fdr_target = 0.02,
upper_overall_peptide_fdr_limit = 0.05,
rm_decoy = TRUE,
score_col = "m_score"
)
Annotated OpenSWATH/pyProphet data table.
Ratio of false positives to true negatives, q-values from [Injection_name]_full_stat.csv in pyProphet stats output. As an approximation, the q-values of multiple runs are averaged and supplied as argument FFT. Numeric from 0 to 1. Defaults to 1, the most conservative value (1 Decoy indicates 1 False target). For further details see the Vignette Section 1.3 and 4.1.
FDR target for the protein master list for which quantitative values down to the less strict peptide_fdr criterion will be kept/reported. Defaults to 0.02.
Option to relax or tighten the false discovery rate limit.
Logical T/F, whether decoy entries should be removed after the analysis. Defaults to TRUE. Can be useful to disable to track the influence on decoy fraction by further filtering steps such as requiring 2 peptides per protein.
Defines the column from which to retrieve the m_score. If you use JPP (Rosenberger, Bludau et al. 2017) this can be used to select between Protein and transition_group m_score.
Returns a data frame with the filtered data.
data("OpenSWATH_data", package="SWATH2stats")
data("Study_design", package="SWATH2stats")
data <- sample_annotation(OpenSWATH_data, Study_design)
data.fdr.filtered<-filter_mscore_fdr(data, FFT=0.7,
overall_protein_fdr_target=0.02,
upper_overall_peptide_fdr_limit=0.1)
#> Target protein FDR:0.02
#> Required overall m-score cutoff:0.00017783
#> achieving protein FDR =0
#> filter_mscore_fdr is filtering the data...
#> -----------------------------------------------------------
#> finding m-score cutoff to achieve desired protein FDR in protein master list..
#> finding m-score cutoff to achieve desired global peptide FDR..
#> Target peptide FDR:0.1
#> Required overall m-score cutoff:0.01
#> achieving peptide FDR =0.00864
#> -------------------------------------------------------------
#> Proteins selected:
#> Total proteins selected: 10
#> Thereof target proteins: 10
#> Thereof decoy proteins: 0
#> Peptides mapping to these protein entries selected:
#> Total mapping peptides: 243
#> Thereof target peptides: 243
#> Thereof decoy peptides: 0
#> Total peptides selected from:
#> Total peptides: 246
#> Thereof target peptides: 246
#> Thereof decoy peptides: 0
#> -------------------------------------------------------------
#> Individual run FDR quality of the peptides was not calculated
#> as not every run contains a decoy.
#> The decoys have been removed from the returned data