|Home | About | Journals | Submit | Contact Us | Français|
Involved with collection and analysis of data, manuscript writing, critical revision and approval of the final manuscript (MCCC), involved with conception of study objectives and design, interpretation of data, manuscript writing and critical revision and approval of the final manuscript (AJM), involved with conception of study objectives and design, interpretation of data, manuscript writing and critical revision and approval of the final manuscript (GBBG).
Non-alcoholic fatty liver disease (NAFLD) is a burgeoning global health concern. In the subset of NAFLD patients with non-alcoholic steatohepatitis (NASH), the presence of significant fibrosis at index assessment is associated with poor prognosis and increased mortality. Hence, there is a growing need to accurately assess and stage fibrosis. Liver biopsy, the current gold standard, has limitations with sampling error and is invasive, with associated inherent risk. This has led to a host of non-invasive means of assessing fibrosis, which has garnered relevance in a disease that requires serial assessment of fibrosis longitudinally over time. This review discusses, comprehensively, the various tools available to the clinician for the assessment of fibrosis, including the various scoring systems used in liver biopsy, the non-invasive means of serum biomarkers, such as the highly-validated NAFLD fibrosis score, and the imaging-based modalities, such as transient elastography and magnetic resonance elastography.
Non-alcoholic fatty liver disease (NAFLD) is the most common chronic liver disease worldwide, with a global prevalence estimated at 25% of the world’s population, but with geographical variability; the highest prevalence has been noted amongst western countries.1,2 NAFLD is closely associated with obesity, type 2 diabetes, dyslipidemia and other metabolic risk factors, and is commonly regarded as the hepatic component of metabolic syndrome.1,3–5 Studies have consistently demonstrated the increasing clinical burden of NAFLD, both as a function of increasing disease prevalence in parallel to the obesity and metabolic syndrome epidemic and also corroborated by continued emergence of natural history data.3,6,7
In the United States, NASH-related cirrhosis is the fastest growing indication for liver transplantation. Separately, another study demonstrated a 10-fold increase in NASH-associated hepatocellular carcinoma (HCC) from 2000 to 2010, accounting for 34.8% of all HCC cases.8 In addition, relative to the general population, patients with NAFLD have been reported to have increased mortality, with the top three causes of death attributed to cardiovascular-, malignancy- and liver-related causes.9–12 From the public health perspective, the direct medical cost of caring for a patient with NAFLD has been estimated to be about $1,613 or €1,163 per patient annually in the United States and Europe respectively, representing a substantial economic burden.6 This sets a worrying precedent for the future landscape of liver disease.
NAFLD is defined by the presence of excessive hepatic steatosis (5% or more) in the absence of other liver disease etiologies.13–15 It is a heterogeneous disease, manifesting a spectrum of phenotypes, ranging from simple steatosis, which is traditionally considered relatively benign, to the more aggressive non-alcoholic steatohepatitis (NASH), seen in 10 to 25% of the cases. Up to one-third of the patients with NASH may progress onwards to liver cirrhosis and other liver-related complications, such as HCC. In particular, significant fibrosis at the time of diagnosis is the most important histological feature associated with mortality in NASH, regardless of the presence or severity of other histologic features.16–18 In addition, with the advent of potential emergence of potent anti-fibrotic agents for the treatment of NASH, the accurate assessment of liver fibrosis becomes even more pertinent, not only for identifying suitable patients for treatment, but also in the evaluation of treatment efficacy.
In this review, we explore the different modalities of fibrosis assessment in NAFLD, ranging from histology and non-invasive biomarkers to predictive scores and imaging measures.
The liver biopsy, albeit imperfect, remains the gold standard for the diagnosis, assessment of disease activity and response to treatment in NAFLD. In an early landmark study, Skelly et al.19 highlighted the importance of liver biopsy in the context of NAFLD; in their cohort of 354 patients who underwent liver biopsy for otherwise unexplained abnormal liver tests, 66% had fatty liver, 50% of which had NASH, while a further 19% had other treatable causes, such as autoimmune hepatitis, hemochromatosis and primary biliary cirrhosis.
An adequate histology specimen would depend on obtaining a sufficient core of at least 2 cm long, comprising 10 or more portal tracts.15–20 Specimens should be processed with hematoxylin and eosin stain and specifically with Masson’s Trichrome stain to assess fibrosis. Other stains may also be used as deemed necessary by the pathologist in the interrogation of the specimen. Finally, the review of specimens should be carried out by at least two pathologists.
Fibrosis in NAFLD evolves in a distinctive pattern, usually beginning in the pericentral zone 3 region. This fibrosis can increase in density and complexity, eventually progressing to bridging fibrosis and cirrhosis.21 There are three established scoring assessment systems currently in regular use for NAFLD: the Brunt system, NASH Clinical Research Network (CRN) system and the Steatosis, Activity, Fibrosis (SAF) system, all of which incorporate the assessment of fibrosis.14,22,23 According to the Brunt system, the severity of fibrosis is defined by four stages, with stage 1 representing zone 3 perisinusoidal fibrosis, stage 2 including portal fibrosis with the zone 3 perisinusoidal fibrosis, stage 3 representing bridging fibrosis and stage 4 representing cirrhosis.22 The Brunt system was subsequently modified in the NASH CRN system, which differed from the Brunt staging by further subdividing stage 1 fibrosis into three sub-stages, whereby stage 1a and 1b represents zone 3 perisinusoidal fibrosis, differentiated by the character of collagen deposition (delicate or dense), while stage 1c represents portal or periportal fibrosis, which is more commonly seen in pediatric patients.14 The SAF system is the most recently proposed scoring system, developed from a study of 679 obese patients undergoing liver biopsy prior to bariatric surgery. Fibrosis stage (F) was assessed as Stage 0 (F0) no activity, stage 1 (F1) 1a or 1b as perisinusoidal zone 3 of 1c portal fibrosis, stage 2 (F2) as perisinusoidal and periportal fibrosis without bridging, stage 3 (F3) as bridging fibrosis and stage 4 (F4) as cirrhosis. The SAF system differs from the NASH CRN system in that it incorporates fibrosis score into the final SAF score, while for the NASH CRN system, fibrosis stage is reported separately from the NALFD activity score (known as NAS). Besides fibrosis, the liver biopsy reveals a treasure trove of other information integral to the assessment of NAFLD, including grading of steatosis, inflammatory activity and damage, which can be perused in the many excellent reviews on this subject.21,24,25
Liver biopsies, however, are not without issue. Firstly, inherent to its invasive nature are the procedural risks of bleeding, visceral injury and even death.26 Secondly, the biopsy assesses only a small fraction (1/50,000th) of the entire liver parenchyma. Whilst steatosis itself is diffuse, the distribution and degree of steatohepatitis/fibrosis within the liver can be heterogeneous; hence, inaccurate sampling and staging may result.27–30 Thirdly, considering the semi-quantitative nature of histologic assessment, there remains the potential for intra- and inter-observer variability.31,32 Furthermore, bearing in mind the huge number of patients with NAFLD and NASH potentially, liver biopsy cannot be applied as a screening tool or for first-line investigation in the approach to NAFLD.
Hence, there is a growing impetus advocating for non-invasive means of assessing fibrosis that are reliable, reproducible, easy to perform and inexpensive. The optimal test should not only allow for effective grading and staging, but also allow for the monitoring of disease progression and response to treatment.
Patients with NAFLD or NASH are largely asymptomatic at presentation. At index evaluation, routine laboratory tests, such as a serum panel of liver enzymes (albumin, bilirubin, alanine aminotransaminase (ALT), aspartate aminotransaminase (AST), gamma-glutamyl transpeptidase (GGT)), full blood count and prothrombin time/international normalized ratio may be performed. Elevated ALT and AST levels are the predominant finding in patients with NAFLD, with levels typically not more than 5 times the upper limit of normal. Individually, the liver enzymes do not correlate very well with severity of NAFLD. Indeed, in one study, normal levels of ALT and AST were seen in more than two-thirds of patients with NASH, with the entire range of histological findings of NAFLD, including active necroinflammation, fibrosis and cirrhosis found in patients with normal ALT values.33 A separate study reported that there was no difference in the rate of advanced fibrosis between patients with normal or elevated ALT and there was no optimal ALT level to predict advanced fibrosis.34 Rather it may be the pattern of AST to ALT ratio that may provide clues on the stage of fibrosis, and this concept is incorporated into several of the composite risk scores for advanced fibrosis.
The NFS is arguably the most studied scoring system and is recommended by the American Association for the Study of Liver Diseases (2012)35 and European Association for the Study of the Liver (2015)36 in the assessment of patients for advanced fibrosis. In comparison to other composite scores for advanced fibrosis, the NFS was found to perform favorably.37 The NFS was developed in a multicenter study of 733 patients with biopsy-proven NAFLD.38 Four-hundred-and-eighty patients were used to develop the scoring system, with the remaining two-hundred-and-fifty-three patients used for validation. Six variables including age, hyperglycemia, body mass index (BMI), platelet count, albumin and the AST/ALT ratio were identified to be independent indicators of advanced fibrosis. Using these variables, a regression formula was applied to create the NFS score. Two optimal cut-offs were identified, one to exclude advanced fibrosis (<−1.455) and the other to indicate the presence of advanced fibrosis (>0.676). Using these cut-offs, the NFS score was able to discriminate patients with advanced fibrosis (stage ≥3) from patients without (stage 0–2), with an area under the receiver operating characteristic curve (AUROC) of 0.82 (95% CI 0.76–0.88) in the validation cohort.
The NFS score was validated in another study comparing various non-invasive measures of advanced fibrosis, with an AUROC of 0.81 (95% CI, 0.71–0.91), performing well when compared with the FIB-4, BARD score, and AST/ALT ratio.37 In a meta-analysis of 13 studies consisting of 3,064 patients,12 the NFS had an AUROC of 0.85 for predicting advanced fibrosis (F ≥ 3). A score of <−1.455 had 90% sensitivity and 60% specificity for excluding advanced fibrosis, whereas a score of >0.67 had 67% sensitivity and 97% specificity for identifying the presence of advanced fibrosis. While the NFS represents an easily accessible tool, incorporating routine clinical parameters, and has good diagnostic performance, a considerable proportion of patients (between 20–58%) do fall into the indeterminate “grey zone”.12
Nevertheless, and perhaps most importantly, the NFS has been shown to have prognostic value for hepatic complications and mortality in patients with NAFLD.39 In a cohort study of 302 patients with 20 years of follow-up,40 the NFS cut-off of >0.67 was used to identify patients with a high probability of advanced fibrosis. During the follow-up, all-cause death occurred in 46.2%, with the composite endpoint of all-cause mortality, cardiac and/or liver complications occurring in 61.5% of patients in this group. The NFS has also been validated in an Asian (Chinese) cohort;41 a similar high negative predictive value of 91% was corroborated in this study, when the low cut-off of <−1.455 was applied. However, only two patients in that study exceeded the high cut-off of 0.676, limiting the ability of this study to validate the high cut-off. This underscores the potential differences in an Asian cohort where the development of visceral obesity and metabolic complications may occur at a lower BMI.
FibroTest® (FT) or Fibrosure represent the same test, marketed commercially under different names in the United States and Europe, respectively. It is a panel comprising total bilirubin, GGT, α2-macroglobulin, haptoglobin and apolipoprotein A1 corrected for age and sex. Besides the previous validation as a marker for advanced fibrosis in hepatitis B and C populations,37,42 it has also been evaluated in the context of NAFLD.43 Ratziu et al.43 evaluated the utility of FT in two cohort groups with biopsy-proven NAFLD (group 1, n = 170, from a single reference center) and (group 2, n = 97, from a multicenter cohort) compared with 954 blood donors as controls. The study demonstrated that FT had an AUROC of 0.86 and 0.92 for predicting F2–4 fibrosis and F3–4 fibrosis, respectively. A lower FT cut-off of 0.30 had a 90% negative predictive value for advanced fibrosis (sensitivity of 77%) while a higher FT cut-off of 0.70 had a 73% positive predictive value for advanced fibrosis (specificity of 98%).43 The caveat would be that this test may not be easily accessible, as several of the variables required (α2-macroglobulin, haptoglobin and apolipoprotein A1) are not readily measureable in most routine clinical laboratories.
The APRI score is a simple index that was initially developed and validated by Wai et al.44 in the chronic hepatitis C virus-infected population for identifying patients with advanced fibrosis (≥F3). In a retrospective cohort study of 358 patients with biopsy-proven NAFLD, an APRI score of >1 had a likelihood ratio for significant fibrosis of 4.2, with sensitivity and specificity of 30% and 92.8% respectively.46 Other studies that compared various composite predictive scores suggested that APRI performed less favorably in detecting advanced fibrosis, with AUROCs ranging from 0.67 to 0.78.37,46,47 As such, there is less enthusiasm for the use of APRI in the context of NAFLD.
The AST/ALT ratio has been shown to be reasonably accurate in detecting cirrhosis in patients with chronic hepatitis C.48 Data validating the use of the AST/ALT ratio as a stand-alone scoring system in NASH is scarce. In one NASH cohort, the AST/ALT ratio was shown to be an independent predictor for advanced fibrosis (F ≥ 3)/cirrhosis in the multivariate analysis.49 Separately, McPherson et al50 reported on the positive performance of AST to ALT ratio to detect advanced fibrosis in a cohort of 145 patients with biopsy-proven NAFLD, with an AUROC of 0.83, sensitivity of 74% and specificity of 78%. With a high negative predictive value of 93% and a modest positive predictive value of 44%, the AST to ALT ratio can be useful in ruling out advanced fibrosis as an initial screening tool.50 One caveat is the interpretation of AST/ALT ratio with age; with advancing age, ALT levels progressively decline, while AST levels remain stable. This can result in increasing AST/ALT ratios in both patients with and without advanced fibrosis. Hence, cautious interpretation of ALT and AST/ALT ratio is warranted in the context of advancing age.51 Nonetheless, the importance of the AST/ALT ratio to detect advanced fibrosis is often acknowledged and even incorporated in various other more complex scoring systems, such as the NFS and BARD.
The FIB-4 was originally developed for use in assessing fibrosis in the hepatitis C virus/human immunodeficiency virus co-infected population.52 The score comprises readily available clinical data of: age, AST, ALT and platelet counts. Shah et al.47 validated the FIB-4 for use in NAFLD, in their comparative study against other non-invasive scoring systems (FT, Fibroscan, BARD, NFS and NASH score), using a cohort of 541 patients from the NASH CRN network. In their study, the AUROC for FIB-4 for identifying patients with advanced fibrosis (F3–F4) was 0.802 (95% CI: 0.758–0847), which was comparable to the NFS but higher than the other scoring systems it was compared against. A FIB-4 score of ≥2.67, had an 80% positive predictive value and a FIB-4 score of ≤1.30 had a 90% negative predictive value for advanced fibrosis. These results were corroborated in other similar studies comparing the performance of non-invasive markers of advanced fibrosis in NASH.37,46,53,54 Interestingly, FIB-4 has also shown good performance in the prediction of mild to moderate fibrosis in NAFLD. Comparing several noninvasive models for predicting hepatic fibrosis, a FIB-4 cut-off of 1.43 had the best AUROC (0.821; 95% CI: 0.75–0.891) to detect stage 1 fibrosis or higher.55
The BAAT Score was one of the early predictive scores developed to assess risk of fibrosis in overweight patients with NAFLD. It is a composite score utilizing four variables (BMI, Age, ALT and Triglycerides) that were found to independently correlate with septal fibrosis on liver biopsy, based on multivariate analysis of a retrospective cohort of 93 patients; that finding was further validated in a prospective cohort of 66 patients.56 The BAAT score is the weighted sum of BMI (≥28 = 1, <28 = 0), age at liver biopsy (≥50 years = 1; <50 = 0), ALT (≥2N = 1, ≤2N = 0) and serum triglycerides (≥1.7 mmol/L = 1, <1.7 = 0), with a score ranging from 0 to 4. A score of 0 had 100% negative predictive value for diagnosis of septal fibrosis, with a BAAT score cut-off of 2.86 having an AUROC of 0.84 for identifying patients with septal fibrosis. Data on the utility of BAAT score remains scarce. In a more recent study comparing various scores to predict fibrosis, the performance of BAAT demonstrated an AUROC of 0.676 (95% CI: 0.577–0.774) and 0.615 (95% CI: 0.520–0.710) to detect stage 1 fibrosis or more and advanced fibrosis respectively.55
The BARD score developed by Harrison et al.57 was generated from logistic regression analysis of a retrospective cohort of 827 patients with NAFLD. Three variables were identified and used in a weighted sum score: B: BMI ≥ 28 = 1 point, AAR: AST/ALT ratio ≥ 0.8 = 2, DM = 1 point. In the validation study, a score of 2–4 had an AUROC of 0.81 (for patients with advanced fibrosis (≥F3) vs patients with simple steatosis or F0–F2 fibrosis). The BARD score was associated with an odds ratio of 17 (95% CI: 9.2 to 31.9) for advanced fibrosis (F3–F4), with a negative predictive value of 96% for ruling out advanced fibrosis if the score as 0–1. In other studies, the BARD consistently reported a high negative predictive value, ranging 81 to 97%.58–60 However, in the study by Ruffilo et al.,59 the BARD score only had a sensitivity of 51.4% and specificity of 77.2% for advanced fibrosis. The variable performance of the BARD score may be contributed to some extent by the overestimation of BMI and presence of diabetes.61
The ELF test is a simplified algorithm of the Original European Liver Fibrosis test62,63 and has been validated in a variety of chronic liver diseases. It comprises a panel of amino-terminal propeptide of type III procollagen (PIIINP), tissue inhibitor of metalloproteinase 1 (TIMP-1) and hyaluronic acid. In the initial validation cohort of 196 patients, the ELF performed well, with an AUROC of 0.90 (95% CI: 0.84–0.96) for distinguishing advanced fibrosis (≥F3). A threshold of 0.3576 had a sensitivity of 80%, specificity of 90%, positive predictive value of 71% and negative predictive value of 94% for severe fibrosis. Similar results have also been shown for the pediatric population.64 Furthermore, some data suggest that the ELF test to be a good predictor of liver-related morbidity and mortality in chronic liver disease, including NAFLD.65
Recently, the ELF score has been endorsed by the National Institute of Clinical Excellence (United Kingdom) in their guidelines for management of NAFLD, citing the high accuracy of ELF for identifying patients with advanced fibrosis. Further, the ELF score proved most cost-effective in their analysis when compared to other serum based scoring systems or imaging modalities such as vibration controlled transient elastography (VCTE) or magnetic resonance elastography (MRE).66 In adults, the guidelines recommend screening for advanced fibrosis using ELF every 3 years. As with the FT, the ELF test may have limitations in usage as it is based on markers not routinely available.
FibroMeters are panels of serum markers and clinical parameters that were developed to evaluate hepatic fibrosis depending on the underlying etiology. FibroMeters for viral hepatitis,67 alcoholic67 and NAFLD have been developed. The FibroMeter for NAFLD comprises age, body weight, glycemia, platelets, AST, ALT and ferritin. In a comparative study68 of FibroMeter for NAFLD against the NFS and APRI, the FibroMeter for NAFLD had the highest AUROC for significant fibrosis (≥F2) of 0.943 (95% CI: 0.91–0.98) compared to the NFS 0.885 (95% CI: 0.83–0.93) versus the APRI 0.866 (95% CI: 0.81–0.92). The FibroMeter for NAFLD also demonstrated excellent accuracy when three intervals of fibrosis were applied: F0/F1 (95% accuracy), F0/F1/F2 (75% accuracy), F2/F3/F4 (87.9% accuracy) with an overall accuracy of 91.9%. Overall, the diagnostic performance of Fibrometer for NAFLD was better for intermediate fibrosis stages (F1/2) and less robust for advanced fibrosis (F3/4).
Amongst the array of composite scores available to the clinician, the NFS and FIB-4 standout in having a high AUROC for advanced fibrosis, as compared to the other scoring systems,37,46,47,53–55 and have the advantage of using readily available clinical and biochemical parameters. Importantly, the NFS has been shown to have prognostic value for hepatic decompensation and mortality.39 Other composite scores, such as the ELF and Fibrometer, show promise with their excellent AUROC for advanced fibrosis;62,63 Fibrometer, in particular, showed better performance than NFS in one comparative study.68 However, the proprietary nature, and potential costs of these newer tests have perhaps limited the enthusiasm and accessibility for their mainstream use. A summary of the serum biomarker/composite scores are available in Table 1.
VCTE by Fibroscan® measures shear wave velocity of a low frequency shear wave (50 Hz) transmitted via an ultrasound probe in the liver as an estimate of liver stiffness. Conceptually, with increased liver stiffness, shear wave velocity correspondingly increases. Besides the non-invasive nature of evaluation, VCTE is advantageous in having excellent inter- and intra-operator reproducibility and is arguably able to assess a larger area of liver parenchyma (approximately 1 cm × 4 cm), the volume of which is about 100 times that of a liver biopsy, which samples only 1/50,000 of the liver volume.69,70 The liver stiffness measurement (LSM) of a normal liver is <5.5 kPa, with values ranging up to 75 kPa in disease.71 Myriad studies have explored the use of VCTE in patients with NAFLD, with data derived from both Asian and Western series in addition to adult and pediatric cohorts. Based on these studies, variable LSM cut-off values for each stage of fibrosis have been reported, with readings of 6.6–7.8, 7.1–10.4 and 10.3–22.3 kPa corresponding to stages F2, F3 and F4 respectively.72 However, there remains no universal consensus on which cut-off value to use.
In NAFLD, LSM by VCTE has been shown to have good diagnostic accuracy for the presence of fibrosis, with an AUROC of 0.927 for ≥F1 fibrosis.73,74 An excellent AUROC of 0.93 (95% CI 0.89–0.096) has been reported for advanced fibrosis (≥F3) and cirrhosis, with a negative predictive value of 90% in ruling out cirrhosis when using a cut-off of LSM 7.9 kPa.75 In a meta-analysis of 1047 patients, the pooled sensitivities and specificities for VCTE to diagnosis ≥F2, ≥F3 and F4 disease were 79% and 75%, 85% and 85%, 92% and 92% respectively.76 However, the utility of differentiating between F2 and F3 fibrosis seems less robust.75 The strength of VCTE, therefore, lies in its use as a good screening tool to: i) rule out the presence of fibrosis, and ii) differentiate advanced fibrosis (≥F3) and cirrhosis from earlier stages of fibrosis.
However, VCTE is not without shortcomings. Steatosis and hepatocellular necroinflammation has been reported to cause falsely elevated readings of LSM.77–80 One study reported the correlation of hepatic steatosis as quantified by controlled attenuation parameter (CAP), with increased LSM-graded fibrosis that may be attributed to steatosis resulting in discordance with fibrosis graded by paired liver biopsy samples.81 The authors recommend using CAP-graded steatosis to determine the likelihood of a falsely elevated LSM fibrosis grade that may be attributed to steatosis, and if the likelihood is high, to consider using a higher LSM cutoff. The impact of increased hepatic steatosis on falsely elevated LSM remains to be clarified. Further, cholestasis, increased central venous pressure and recent food intake (within 60 mins) have all been reported to falsely raise LSM.82–84 Importantly, increased BMI > 30 kg/m2 has been reported to be independent predictors of failed LSM in up to 25% of attempted studies with the conventional M probe.85,86
The XL probe attempts to circumvent issues with the M probe with the intent of improving the rate of successful LSM studies. The XL probe has a greater depth of measurement below the skin surface and features a lower frequency, more sensitive transducer with a larger vibration amplitude and deeper focal length.87 In one study, the use of the XL probe allowed reliable LSM measurements in 92% of patients compared with 80%, when using the M probe alone.88 In a study of 193 consecutive patients, evaluating the performance of the XL probe and M probe with paired liver biopsies, the XL probe was more likely than the M probe to achieve success (95% vs 81% P < 0.001). At a cut-off of 7.2 kPa for advanced fibrosis (≥F3), sensitivity was 78%, specificity was 78%, positive predictive value 60% and negative predictive value of 89%, with an AUROC of 0.85, was reported.89 However, discordance by up to two stages between the XL probe and histology in 9% of patients was also noted in this study. Other studies have also reported that the median LSM by the XL probe was 1–1.2 kPa lower than that measured by the M probe, for the same stage of fibrosis.88,89
MRE is a highly accurate modality for estimating liver stiffness that can be easily supplemented to routine magnetic resonance imaging (MRI) protocols. Acoustic drivers are used to generate continuous vibrations at 60 Hz, which are transmitted through the liver. The resulting propagating shear waves in the liver are then captured using a MRI sequence with motion-encoding gradient.90 MRE readings are uniquely advantaged in not being affected by obesity, simple steatosis or the presence of ascites.90–92 In a retrospective cohort study of 325 patients with NAFLD, MRE with a cut-off of 4.15 kPa was able to identify patients with advanced fibrosis (≥F3) with an AUROC of 0.954, sensitivity of 0.85, and specificity of 0.929.93 In a systematic review evaluating nine studies (232 patients with NAFLD), Singh et al.94 demonstrated the high diagnostic accuracy of MRE for detection of fibrosis in NAFLD, with the mean AUROC for detecting any (≥F1), significant (≥F2), advanced fibrosis (≥F3) and cirrhosis (F4) of 0.86 (0.82–0.90), 0.87 (0.82–0.93), 0.90 (0.84–0.94) and 0.91 (0.76–0.95) respectively. The performance of MRE was not affected by sex, obesity or degree of inflammation.94 In addition, MRE was found to have higher diagnostic performance compared to transient elastography in two recent separate cross-sectional studies from Japan and the United States.95,96 Along similar lines, MRE was also more accurate than acoustic resonance force impulse (ARFI) for diagnosing any fibrosis (F ≥ 1) in NAFLD patients.97
To further optimize performance, 3-dimensional (3-D) MRE, a novel variation of the standard, commercially available MRE, which utilized 2-dimensional (2-D) technology, was explored. Using a cohort of patients with biopsy-proven NAFLD, Loomba and colleagues98 demonstrated that 3-D MRE (shear wave frequency of 40 Hz) had even better diagnostic accuracy for identifying advanced fibrosis compared to 2-D MRE (shear wave frequency of 60 Hz) (AUROC 0.981 vs 0.921, p < 0.05).
MRE represents an excellent and highly accurate modality for assessing LSM in NAFLD that is operator-independent, not affected by obesity or ascites, and is also complemented by the ability to obtain anatomical MR imaging at the same time. However, it is dependent on patient factors such as inability to perform breath-hold, and signal degradation in patients with severe iron overload. Similarly, it may not be feasible as a screening tool with considerations for widespread use in view of limitations of facility constraints, high cost and longer examination time, as opposed to VCTE, which can provide the convenience of a rapid bedside procedure that can provide immediate results to the physician.
ARFI is an elastography technique developed by Siemens using modified commercially available ultrasound machines, integrating both elastography and conventional B-mode ultrasonography. By using conventional ultrasound, the operator is able to avoid large blood vessels and other anatomic confounders of LSM. After targeting a 5 mm × 10 mm area of interest, the AFRI probe is used to produce short 262 ms pulses at 2.67 MHz to generate shear waves, propagation of which is tracked by ultrasound to produce the metric of shear wave speed as a surrogate of fibrosis.
In a cohort of 250 consecutive patients with paired liver biopsies, ARFI was able to predict ≥F2 fibrosis and cirrhosis (F4) with an AUROC of 0.74 (95% CI: 0.64–0.86) and 0.79 (95% CI: 0.67–0.91) respectively.99 A meta-analysis of seven studies involving 723 patients reported a pooled sensitivity of 80.2% and specificity of 85.2% in detecting significant fibrosis as defined by stage 2 or more fibrosis, for which the authors concluded that ARFI was modestly accurate in detecting significant fibrosis in NAFLD.99 In a comparative study of Fibroscan® versus AFRI in 291 NAFLD who underwent liver biopsies, similarly good accuracies were reported for AUROC for advanced fibrosis (≥F3), reported as 0.87 and 0.85 for Fibroscan® and AFRI respectively.101 Similarly, in a meta-analysis comparing the performance of ARFI and VCTE, no difference in performance was found in the detection of significant fibrosis or cirrhosis.102
One advantage of ARFI is that it is not affected by morbid obesity (BMI >40). However, the role of ARFI in NAFLD remains to be clarified.103
SSI, otherwise known as shearwave elastography, was developed by SuperSonic Imagine (S.A, Aix-en-Provence, France). It allows acquisition of liver stiffness in real-time, whilst guided by a B-mode image.104 It is similar to ARFI in being ultrasound based, without the need of an external vibrator to produce the shear wave. However, whilst ARFI measures a single shear wave emitted at a single frequency, the transducer in SSI emits a spectrum of pulsed wave beams of varied depths using a frequency band ranging from 60 to 600 Hz. This allows for a near simultaneous evaluation of several shear wave fronts over a wide frequency range.105–107 Cassinoto et al.108 conducted a comparative study of SSI, ARFI and Fibroscan® for assessment of liver fibrosis in a cohort of mixed etiology patients undergoing liver biopsy; SSI performed well, with an AUROC of 0.92 (95% CI: 0.89–0.95) with a specificity and sensitivity of 90% and 81% for advanced fibrosis (≥F3). In this study, SSI compared favorably to Fibroscan® for the diagnosis of advanced fibrosis with an accuracy of 84% vs 78% (p = 0.0016). In another comparative study of SSI, AFRI and Fibroscan® in patients with NAFLD (n = 291) undergoing liver biopsy, no difference was seen in accuracy between the modalities for detecting advanced fibrosis (≥F3), with AUROC for SSI, Fibroscan® and ARFI being 0.89, 0.86 and 0.84 respectively.101 However, with regards to significant fibrosis (≥F2), SSI, while comparable to Fibroscan®, had a higher accuracy than ARFI for diagnosis of significant fibrosis (p = 0.004).
VCTE has been widely adopted as a reliable means of assessing NAFLD patients for advanced fibrosis75,76 and has been extensively studied in both Asian and Western cohorts.72 Some shortcomings are noted, however, especially with steatosis potentially falsely elevating LSM reading,81 in addition to the XL probe having a potential discordance in LSM readings when compared to the M probe.88,89 Nonetheless, VCTE remains a relatively robust and reliable tool that has already been widely used in clinical practice for the assessment of fibrosis.
Newer ultrasound-based modalities that capitalize on inherent B-mode ultrasonography, such as ARFI and SSI, represent an opportunity to serendipitously assess fibrosis in NAFLD patients undergoing ultrasonography, and shows good performance in comparative studies with VCTE.101,102,108 More data needs to be demonstrated, particularly in the context of correlation to clinically relevant outcomes.
Separately, MRE, whilst having numerous advantages, including being unaffected by obesity, simple steatosis, inflammation or the presence of ascites,90–92,94 is largely limited by high cost and accessibility. MRE will probably remain as an opportunistic adjunct for assessment of fibrosis in NAFLD patients undergoing MRI for other reasons. A summary of the elastography-based assessments of fibrosis is available in Table 2.
In day-to-day clinical practice, there have not been any universally accepted algorithms in diagnosis and approach to fibrosis in the context of NAFLD, mainly because no non-invasive modality, either alone or in combination, has been shown to be consistently robust. Among the major society guidelines, the EASL guidelines do suggest that a combination of modality may perform better than a single modality alone. However, there is no robust longitudinal data to correlate the utility of combined modality and clinically relevant outcomes. In our own practice, we commonly utilize a combination of NFS and VCTE in the assessment of advanced fibrosis for patients with NAFLD, as shown in Fig. 1. This has been highlighted before in several other publications advocating its use.109 Moreover, this approach was demonstrated by Petta et al.,110 who compared various bedside scoring systems, such as the APRI, AST/ALT, BARD, FIB-4 and NFS scores, in combination with VCTE. In the validation cohort, the combination of NFS and VCTE had an excellent AUC of 0.844, a specificity of 100%, a positive predictive value of 100% and a negative predictive value of 92.7% for advanced fibrosis (≥F3). In addition, this combination not only provides excellent performance but also represents a cost-effective strategy as demonstrated by Tapper et al.111
Whilst liver biopsy remains the gold standard in assessing necroinflammation and fibrosis to prognosticate patients with NAFLD, it remains far from ideal, with inherent issues of safety and sampling error. Recent years have seen the proliferation of various methods that are able to accurately assess necroinflammation and fibrosis, and allow for viable alternatives to liver biopsy. This is especially important when considering that serial assessments are required over the course of a patient’s lifetime to gauge progression of disease, decide on intervention and track response to treatment. Bedsides, scoring systems such as NFS, FIB-4 and APRI are easily calculated using commonly available parameters and are helpful tools to allow the clinician to identify patients with advanced fibrosis. Methods to assess LSM, such as Fibroscan®, have gained popularity in being highly accurate and reproducible, to assess the presence of fibrosis and detect advanced disease. However, a majority of the non-invasive tests are less nuanced and not as reliable to differentiate between early stages of fibrosis (F1 to F2). The role, therefore, of liver biopsy in this current era is to obtain definitive proof of NASH, especially in patients with competing concomitant diagnoses, such as chronic hepatitis B, and to allow assessment of early fibrosis when non-invasive methods are unable to provide adequate resolution.