Assessment of the lipid–array data quality. (A, B) Comparison with sets of literature curated and genetic interactions. (A) Summary of the two reference data sets and strategies used to assess quality. Genetic coverage is defined as the percentage of physical protein–lipid interactions covered by the data set of genetic interactions. (B) Estimation of accuracy based on PtdInsPs metabolic pathway. The left column shows the number of proteins in the literature-derived reference set that are covered (dark blue) or not (magenta) by the data set of genetic interactions. The right column shows the fraction of proteins devoid of LBD and that did not bind PtdInsPs in the lipid–array that interact genetically with enzymes involved in the synthesis of these lipids (light blue, background coverage by genetics). The central column shows the estimation of accuracy (fraction of true positives) in the lipid–array data set assuming that genetic coverage results from the combination of true positives (that will have the same genetic coverage as the literature-derived reference data set, in dark blue) and false positive (with a background coverage, in light blue). The estimated fraction of true positive that is not covered by genetics is shown in magenta. Numbers on top of the columns are overall genetic coverage for each set of proteins (*P=0.015 and **P=0.035).
Articles from Molecular Systems Biology are provided here courtesy of
The European Molecular Biology Organization and Nature Publishing Group