Reproducibility studies are useful in determining the degree of agreement among pathologists in morphological diagnoses, and in the assessment of the universal applicability of histological criteria in classification schemes. The grading of invasive breast cancer,
6,7,8,9,10,11 the evaluation of different DCIS classifications,
12–16 and the assessment of diagnostic agreement between general and expert breast pathologists in core biopsy interpretation
17,18 and among community based surgical pathologists
19 have been the subject of several published reports. Results from these studies have provided insight into problem issues in diagnostic breast pathology and identified areas that require further refinement in classification schemes, particularly in honing objective histological criteria to allow improved reproducibility and correlation with clinical outcome.
Rosai in 1991 defined borderline epithelial lesions of the breast as “a type of proliferative process placed somewhere between the usual type of hyperplasia and carcinoma in situ, both in terms of morphological features and propensity for the development of invasive carcinoma”, and found an “unacceptably high” degree of interobserver variability among a group of experienced pathologists in the interpretation of such lesions.
20 Although most CCLs do not fall into the “borderline” category, those with cytological atypia, also referred to as flat epithelial atypia,
21 present similar diagnostic problems, with a suggestion that these may either be precursors of or the earliest morphological manifestation of DCIS. It has also been recommended that the identification of CCL with cytological atypia on core biopsies should lead to an open excision, because a more advanced lesion is seen in about a third of cases.
21Therefore, it is of paramount importance that the spectrum of CCL should be recognised by practising surgical pathologists who interpret breast specimens, specifically in identifying CCL with cytological atypia, so that proper management can be instituted. This is particularly relevant for Singapore, because we have a National Breast Screening Programme (BreastScreen Singapore) and a proportion of core biopsies carried out for mammographic calcifications contain CCL.
In our study, interobserver agreement with the tutor’s diagnoses immediately after a didactic session on CCL varied from moderate to substantial, indicating that an acceptable degree of agreement can be achieved through an educational session incorporating digitised images. We believe that the results from this first assessment provide an unbiased reflection of interobserver variability, because all answers proffered by the participating pathologists were independently concluded, and an equal amount of time was given to the presentation of each image for their analysis. In the second exercise, the interobserver agreement ranged from fair to substantial, with the κ values of four of the pathologists being lower than in the first exercise, confirming that the recollection of histological criteria and diagnostic reproducibility are better when criteria have been just expounded. These results also imply that recurrent educational sessions may be needed to maintain reproducible diagnoses, particularly for the relatively new and emerging entity of CCL. The intraobserver agreement varied from fair to perfect, which can also be explained by the diminishing consistency of application of histological criteria with increasing time since the educational session.
When we assessed the degree of agreement among all pathologists for individual images, there was complete diagnostic agreement in only six of the 39 images during the first exercise, which improved to 10 in the second exercise. This improvement was mainly because of agreement among all pathologists with regard to DCIS images during the second exercise, suggesting that for the extreme end of the spectrum, where the impact on management is greatest, pathologists learn and retain diagnostic criteria more effectively, especially after a post exercise discussion. It could also be that the severe degree of cytological atypia and the characteristic architectural abnormalities of the presented images were more readily appreciated after they had been specifically pointed out during the discussion after the first exercise. Overall, however, agreement among pathologists decreased during the second exercise, with at least seven of the eight pathologists agreeing in 12 and 17 images in the second and first exercises, respectively. In fact, in the second exercise, there were two images in which common answers were given by only two pathologists.
“Closer scrutiny of all columnar cell lesions at medium to high magnification is necessary to recognise any accompanying cytological atypia that may have implications for management”
When stratified according to the type of CCL, the lowest numbers of complete agreement for individual images were in the categories of CCC with cytological atypia and CCC, in both exercises. This underscores the fact that cytological atypia is subjective, and the threshold between pure CCC and that with cytological atypia is difficult to delineate. A concerted effort should be made to define what constitutes cytological atypia in CCL in a semiquantitative manner, similar to the way that nuclear grading of DCIS is taught, applied,
22,23 and validated.
24 This is so that CCL with cytological atypia can be diagnosed more reproducibly and consistently, especially because this lesion has management implications for breast core biopsies. Interestingly, among images/lesions that were repeated during the sequential presentation of all 39 images, the greatest internal variation in pathologists’ diagnoses occurred when the same lesion was presented at different magnifications, with cytological atypia being noticed only at medium to high magnification. This corroborates the opinion expressed in a review by Schnitt that cytological atypia in these lesions becomes evident only at high magnification,
21 so that all CCLs should not just be accorded a cursory view, but should be subjected to a high magnification examination.
Take home messages- We assessed inter/intraobserver variability in the interpretation by seven staff pathologists of a series of digitised images of columnar cell lesions (CCL) of the breast after a tutorial on breast CCL
- Interobserver and intraobserver agreement was good for ductal carcinoma in situ, but more effort is needed to improve diagnostic consistency in the category of columnar cell changes with cytological atypia
- Continued awareness and study of these lesions are necessary to enhance recognition and understanding
The limitations of our study include the fact that digitised images were used instead of histological sections on glass slides, which represent the real practical situation. However, it can be argued that for teaching, focusing on a specific image may be advantageous to the learning process. Even in past studies using circulated glass slides, an area of interest is circled or marked on the slide for participating pathologists.
12,13,20 Other potentially contentious issues that may have affected our findings are: the use of the same series of images for the second evaluation (such that the inter/intraobserver reproducibility results from this second exercise may be that of pure memory rather than true assimilation of learnt criteria); no time limitations imposed on the second evaluation (with possible review option by pathologists, thereby allowing comparison of and realisation that there were repeated images/lesions that were interspersed within the series of images); an unequal number of images from each category; and differing levels of interest in breast pathology and motivation among participating pathologists.
Nevertheless, our study shows that moderate to substantial interobserver reproducibility for breast CCL can be achieved after a tutorial, although it is recommended that follow up educational sessions should be conducted to maintain satisfactory consistency in the evaluation of these lesions. At the DCIS end of the spectrum, a diagnostic reproducibility of 100% can be achieved. The category that requires more attention in terms of refining diagnostic criteria is that of CCL with cytological atypia, where more objective guidelines as to what constitutes cytological atypia should be reached. Closer scrutiny of all CCLs at medium to high magnification is necessary to recognise any accompanying cytological atypia that may have implications for management. This difficulty is similar to that seen in studies of reproducibility in the diagnosis of atypical ductal hyperplasia which, unfortunately, has remained refractory to efforts to improve diagnostic consistency.
25 Indeed the most (as yet unpublished) recent version of the UK breast screening pathology guidelines recommends that CCL with atypia should be categorised as equivalent to atypical ductal hyperplasia—that is, B3.
In conclusion, further reproducibility and follow up studies will assist in arriving at a clinically relevant consensus classification for CCL of the breast. Nuclear morphometry can help define thresholds of nuclear atypia, and molecular studies on microdissected lesions may provide important biological clues beyond morphology. From a practical standpoint, multiple step sections and consultation with a breast pathologist may be prudent when there is uncertainty about the presence or otherwise of cytological atypia in CCL, particularly in core biopsies.