This is an open-access article distributed under the terms of the Creative Commons Attribution Noncommercial Share Alike 3.0 Unported License, which allows readers to alter, transform, or build upon the article and then distribute the resulting work under the same or similar license to this one. The work must be attributed back to the original author and commercial use is not permitted without specific permission.
Assessment of the lipid–array data quality. (A, B) Comparison with sets of literature curated and genetic interactions. (A) Summary of the two reference data sets and strategies used to assess quality. Genetic coverage is defined as the percentage of physical protein–lipid interactions covered by the data set of genetic interactions. (B) Estimation of accuracy based on PtdInsPs metabolic pathway. The left column shows the number of proteins in the literature-derived reference set that are covered (dark blue) or not (magenta) by the data set of genetic interactions. The right column shows the fraction of proteins devoid of LBD and that did not bind PtdInsPs in the lipid–array that interact genetically with enzymes involved in the synthesis of these lipids (light blue, background coverage by genetics). The central column shows the estimation of accuracy (fraction of true positives) in the lipid–array data set assuming that genetic coverage results from the combination of true positives (that will have the same genetic coverage as the literature-derived reference data set, in dark blue) and false positive (with a background coverage, in light blue). The estimated fraction of true positive that is not covered by genetics is shown in magenta. Numbers on top of the columns are overall genetic coverage for each set of proteins (*P=0.015 and **P=0.035).