Noninvasive “tubeless” pancreatic function tests
In an effort to discover a sensitive and specific function test for CP that avoids risk and invasive procedures and that can be performed on outpatient basis, several tests have been developed, all of which suffer several severe shortcomings, but may be useful in diagnosing CP in a patient with a long alcohol history or with equivocal imaging findings. Generally, these tests only detect advanced CP with steatorrhea, but are fairly cheap and reliable.
Seventy-two hour fecal fat: The 72 h fecal fat collection was once a routine part of the workup for malabsorption, and it remains the gold standard for quantification of steatorrhea. However, it suffers from many drawbacks, including its nonspecificity for pancreatic disease. For example, bacterial overgrowth, short bowel syndrome, and small bowel mucosal disease (e.g. celiac disease and Crohn’s disease) can present with steatorrhea. However, the diarrhea of CP tends to be less voluminous yet fattier than other diarrheal illnesses. In addition, the 72 h fecal fat is inaccurate when performed in the outpatient setting for several reasons. First, it is unrealistic to expect the patient to refrigerate 72 h worth of stool. Second, adherence to a standardized 100 g/24 h fat diet per day for a total of 6 d (the 3 d preceding the test and then the test itself) is difficult. Achieving at least 100 g/d, typical for a large fast food lunch of double cheeseburger and French fries with milkshake, is relatively easy (of course, false negatives can occur in the patient unable to consume that much fat due to pain, though, these are typically early CP patients, who do not yet have steatorrhea). However, quantification of daily fat intake with food diaries as an outpatient is unreliable, making calculation of the coefficient of fat absorption similarly unreliable. Third, for this test, the patient must be off of oral pancreatic enzymes supplements for about a week prior to collection. As a result, some patients have bloating, abdominal discomfort, and gas from malabsorption or are otherwise unwilling to stop the enzymes.
In our institution, for the above reasons, we reserve the 72 h stool collection for research purposes, during which time the patient is admitted to a metabolic ward with a dietician familiar with the protocol to monitor consumption and adjust later meals to account for what has not been consumed. A 72 h stool collection during a high fat diet showing more than 7 g/d fat in the stool is abnormal[23]. The levels of steatorrhea seen in CP tend to be much higher (often > 20 g/d). For practicality, most pancreatologists have abandoned this test. However, a modified 24 h protocol can be used for clinical purposes to monitor response to enzyme therapy in patients experiencing an unexplained increase in steatorrhea, especially in growing children with cystic fibrosis, despite alleged compliance with enzymes.
Spot fecal fat: Sudan staining of a random stool sample for fecal fat is relatively insensitive for fat malabsorption. Generally, it detects steatorrhea only at 25 g/d or more. As a stool collection, it suffers many of the drawbacks of the 72 h fecal fat, including patient embarrassment, need to stop pancreatic enzyme supplements, need to be on a high fat diet for several days before the collection, etc. Greater than 6 droplets of fat per high power field are indicative of steatorrhea. As in the case of the 72 h fecal fat analysis, fat substitutes in foods such as Olestra©/Olean© or drugs such as orlistat or ezetimibe can give false positive results.
Fecal chymotrypsin: In advanced CP, lower concentra-tions of pancreatic proteases reach the stool than in controls. Trypsin is the principal protease secreted by the pancreas, however, it undergoes degradation in the distal small bowel so is not a good fecal marker for pancreas enzyme output[24]. On the other hand, several other proteases made by the pancreas, such as chymotrypsin are useful stool markers. As with all fecal protease assays, the fecal chymotrypsin should be thought of as a surrogate for the 72 h fecal fat rather than for the conventional, “tube,” pancreatic function tests. Chymotrypsin evades degradation in stool by binding to insoluble debris in stool and is stable for several days at room temperature, enabling a sample to be shipped to a reference lab. A fecal chymotrypsin below 3 U/g of stool suggests advanced CP. This test is altered by exogenous pancreatic enzyme supplementation so is useful to monitor for compliance, but is not available in the United States[25]. The fecal chymotrypsin assay is of little clinical value to detect early stage CP, but it has a reasonable sensitivity for advanced disease of from 50% to 80%, increasing to 80%-90% in cystic fibrosis[26], with a specificity of 50%-100%[27–29]. As in all fecal protease assays, watery diarrhea, such as from short bowel syndrome, can give false positive results (low fecal chymotrypsin) by diluting the sample.
Fecal elastase (FE): Pancreatic elastase-1 is a pancreas-specific protease that is minimally degraded during intestinal transit. In fact, it is concentrated 6-fold in stool compared to duodenal juice[3031]. The concentration of fecal elastase in stool measured by Enzyme Linked Immunosorbant Assay (ELISA) correlates well with duodenal amylase, lipase, and trypsin in both CP patients and controls[32]. Typically, a fecal elastase less than 100 mcg/g of stool indicates severe pancreatic insufficiency. A value between 100-200 mcg is indeter-minant, but in the face of other evidence, is suggestive of CP. Values over 200 mcg are normal.
FE suffers from many of the same limitations of the fecal chymotrypsin assay, notably that it only detects patients with steatorrhea and severe CP that likely could have been detected by other means. In various studies, compared to conventional pancreatic function testing and ERCP, the sensitivity of FE varies from between 0%-65% for mild disease to 33%-100% for severe CP, with generally good specificity (from 29% to 95%)[33–37]. FE may be superior to fecal chymotrypsin. For example, in one small study the FE had a sensitivity of 64% for detecting CP compared to 25% for fecal chymotrypsin[38]. Also, like fecal chymotrypsin only a spot stool sample is required rather than a 24 h or 72 h collection. FE also does not cross react to exogenous porcine enzymes so patients can remain on therapy for the test. However, FE is more expensive than fecal chymotrypsin.
Serum trypsin: The serum trypsinogen (a.k.a. trypsin) assay is unique among pancreatic function tests in being a serum sample, making it convenient and relatively cheap. Low levels, less than 20 ng/mL, are specific for CP, but are sensitive only for advanced disease. Levels from 20-29 are indeterminant, but sometimes represent early CP[39]. Sensitivities for mild to severe CP patients combined range from 33%-65%, but specificity is quite high[40]. Sensitivity for exocrine dysfunction is quite high, at about 95%[39]. One added benefit is that trypsin levels over 150 ng/mL are indicative of pancreatic inflammation. For example, the trypsin can be positive for a relapse of CP even when amylase and lipase levels are normal. Conversely, it can help differentiate benign, chronically elevated amylase and lipase from pancreatic inflammation[41]. The test used in our institution is a Radio-Immune Assay (RIA), so it has the disadvantage of requiring several days to obtain a result. We typically obtain this test along with the fecal elastase and pancreatic protocol CT as an initial battery in all patients suspected of having CP referred to our clinic. However, like the fecal assays, it is basically a marker of advanced disease and steatorrhea.
Invasive, traditional, “tube” pancreatic function tests
Since first described in the 1930s and 1940s, several techniques have been developed to measure pancreatic function after physiologic or supraphysiologic stimuli[4243]. The central theme of these tests is to collect and quantitate the quality of pancreatic secretions to determine pancreatic secretory capacity.
Secretin stimulation test (SST): In a technique more widely publicized by Dreiling[44], a double lumen, 26 Fr, oro-duodenal tube with both gastric and duodenal ports is introduced fluoroscopically, stiffened with a guidewire, with only topical anesthesia (benzocaine spray and viscous lidocaine) applied to the posterior pharynx. The weighted tip should be advanced close to the ligament of Treitz and the tapered radiopaque portion of the tube should be positioned at the pylorus. Placement can be hampered by multiple factors including patient discomfort, nausea, gastroparesis, and pyloric stenosis.
We place both the gastric and duodenal ports to low constant suction by an electric flywheel pump whose gauge measures 2-5 inches Hg (51-127 mmHg). However, the suction produced by these pumps may be lower than the gauge suggests: our lab has found that standard wall units are too strong and inconsistent and may result in adherence of the tube to the duodenal wall with clogging of the ports. Constant vigilance is required to prevent clogging of ports which decreases yield of duodenal fluid. During experiments with Polyethylene Glycol (PEG) labeled with carbon 14 (14C), 85% or more of duodenal fluid can be collected with this double lumen “Dreiling” tube with relatively little reflux of duodenal contents into the stomach[4546]. We then measure basal duodenal and basal gastric pH and volume over 15 min. Next, we give a bolus of intravenous (IV) secretin, because bolus administration has been shown to be equivalent[47] or superior[48] to continuous infusion. The typical dose of porcine secretin is a 1 U/kg IV bolus. This is a supraphysiologic dose, but is usually well tolerated other than some flushing. However, the cost of secretin is fairly high. One study showed that an even higher dose of secretin (4-5 U/kg) might be more sensitive[49]. We now use synthetic human secretin at dose 0.2 mg/kg which has been shown to be equivalent to porcine secretin[50]. We then measure three parameters of the duodenal fluid collected over one hour in four
15 min aliquots: volume, pH, and bicarbonate concentration in mEq/L measured by back titration with hydrochloric acid. Others have found that automated analyzers are almost as good as the standard labor intense back titration[51]. The gastric pH and volume at the end of the study are also recorded. The highest concentration of bicarbonate obtained among the four 15 min aliquots is the peak bicarbonate concentration. For completeness, a microscopic exam is performed on the duodenal aspirate for Giardia, Gram stain, and Crystals. Then, the bicarbonate output (the product of bicarbonate concentration and volume) for that hour long post-stimulation period is calculated. The tube is then removed and the patient can resume normal activities and can drive home. Standardized ranges are 80-130 mEq/L for the peak bicarbonate, 1.5-5.7 mL/kg for the volume/kg of patient weight, and 10.1 to 37.0 mEq/h for the bicarbonate output. If the peak bicarbonate is less than 80 mEq/L, the patient is very likely to have CP. If the volume is low and proper position of the collecting tube is reconfirmed, we typically state that the patient should be evaluated further for a pancreatic duct obstruction.
The SST is arguably the most sensitive test for CP. Classically, bicarbonate is thought to be produced by small pancreatic ducts[2]. Consequently, one might anticipate that the SST would be the most sensitive test to diagnose small duct, minimal change CP. This hypothesis was upheld in several studies. The SST, when compared with histology, is 75% sensitive in detecting early stage CP, and up to 97% for late stage disease with a specificity of 90%[5253]. Compared to SST, ERCP has about a 66% sensitivity for early disease, though it comes close to SST for late stage disease[454].
In addition, several histologic studies suggest bicarbonate production may be the best way to diagnose early CP. A study in dogs indicates that the maximal bicarbonate output is closely related to functional pancreatic mass[55]. In addition, an early study by Dr. Dreiling found an excellent correlation between findings on histology and findings of the SST. The SST picked up 20/24 patients (83%) who had CP by pathology whereas ERCP was only 17/24 (71%) sensitive. All underwent SST first, followed by ERCP, and 24 went on to exploratory laparotomy[54].
However, the SST does have some shortcomings, notably difficulty with tube placement and that false positives can be seen for several months after an attack of acute pancreatitis. This is the reason we delay EUS, S-MRCP, fecal elastase testing, fecal fat testing, and SST for several months after an attack of acute pancreatitis.
CCK stimulation testing: In use almost as long as the SST, the classical CCK stimulation test is a useful test, developed and used primarily at the Mayo Clinic in Rochester, Minnesota. Because this test measures ecbolic (enzyme) output, it is, in theory, a measure of the processes that lead to steatorrhea, and could be less sensitive than SST. However, it is still one of the most sensitive tests for the presence of CP. One study of normal controls in Japan found no differences between the SST and the CCK stimulation test[56]. The CCK stimulation test has a number of drawbacks including the need for placement of two specialized 2-lumen tubes with simultaneous gastric and duodenal aspiration and duodenal perfusion of a solution containing mannitol and PEG. CCK is also administered under constant infusion at 40 ng/kg per hour, but it can be given as a bolus[5758]. Caerulein, which is found on the skin of tree frogs and can be produced synthetically, can substitute for CCK. Caerulein is, in fact, many times more potent a secretogogue than CCK[59]. Bombesin can also substitute for CCK[60].
In the classical CCK stimulation test, as in most tube tests, the basal 20 min aspiration of duodenal and gastric contents is discarded. The gastric and duodenal ports are continually withdrawn under low intermittent suctioning and duodenal fluid is collected over 80 min into four 20 min aliquots. Also during the first 20-40 min, the contraction of the gallbladder by CCK (and resultant flow of bile into the duodenum) affects the measurement of pancreatic output. In addition, as CCK can delay gastric emptying[61], and is thought to cross the blood brain barrier and mediate central pain mechanisms[62], symptoms of nausea and vomiting are common during infusion and more common than symptoms from secretin infusion[63]. The classic CCK stimulation test also requires measurement, and constant intestinal perfusion, of a nonabsorbable marker, and recovery rates vary significantly[64]. If the illustration in the Gastroenterology article which first described it is still in use today, it has fewer aspiration ports in the duodenum than the conventional Dreiling tube, and uses pressure suctioning of 40 mmHg[65] which, as mentioned above, may be somewhat different than the suction used at University of Florida with the conventional Dreiling tube.
A modified version of this test using a conventional Dreiling tube, placed under light sedation, and measuring only lipase by a hospital based lab assay was found to be very sensitive in patients with both early (Cambridge 2) and late stage CP by ERCP (Cambridge 3 and 4)[66]. However, no one has compared this test directly to the SST. In addition, as we shall discuss later in the section on endoscopic secretin stimulation testing, use of sedation may affect recovery of secretions and cost.
Combined secretin-CCK (secretin-pancreozymin) stimulation testing: This test is used mostly in Europe and Japan and allows measurement of both bicarbonate and enzyme production by the pancreas. In theory, the simultaneous administration of Secretin with CCK has the potential to dilute the measurement of enzyme activity by watery, bicarbonate solution. However, CCK can also be given before[67] or after[57] Secretin. It also shares one of the drawbacks of the CCK stimulation test: increased bile secretion into the duodenum.
In one study of the Secretin-CCK test, the peak bicarbonate - rather than CCK-related parameters - was correlated nearly linearly to the severity of histologic changes in CP. Also in this study, the second and third best measures of histologic damage were the amylase activity and the total volume, respectively. In that study, the secretin-CCK test was 67% sensitive for various stages of CP, which is somewhat less than other studies of the SST. However, this study used stringent requirements for the diagnosis of CP. All 3 parameters (peak bicarbonate, volume of duodenal secretions, and amylase output) had to be decreased in order to qualify as CP. Applying our cutoffs for peak bicarbonate, only, to this data would give greater sensitivity with only some loss of specificity[468].
Another study found that the trypsin activity in pancreatic fluid was not as sensitive a measure of CP as the peak bicarbonate during Secretin-CCK testing[69]. A recent, and probably the largest, study of Secretin-CCK stimulation testing supported this finding, mostly in cystic fibrosis patients. In this study, 336 CCK-Secretin tests were reviewed. Using enzyme (trypsin) activity alone (cutoff < 50 U/kg per hour) would have had 25% false positives if enzyme recovery were not corrected for losses (if a marker had not been used); i.e. 25% of patients with good enzyme activity would have been falsely classified as pancreas-insufficient[70].
A third study of 19 alcoholic CP patients and 6 patients with idiopathic CP who underwent CCK-secretin testing and went on to surgery for refractory symptoms, 18/18 of whom had an abnormal ERCP, found that the peak bicarbonate concentration and output were the best measures of small duct dilation seen on histology. In addition, peak bicarbonate output was the best measure of acinar atrophy with a Spearman correlation coefficient of -0.71 (P between 0.001 and 0.01) and the chymotrypsin output was also significantly correlated (Spearman, -0.57), but with a higher P of between 0.01 and 0.02. Peak volume also correlated fairly well with acinar atrophy (Spearman -0.44, P between 0.02 and 0.05), but peak bicarbonate concentration was weaker (Spearman -0.17, P > 0.05). In summary, this study found that the hydraulic parameters (volume, peak bicarbonate concentration, peak bicarbonate output) were overall better predictors of abnormal histology than the ecbolic parameters (chymotrypsin)[2].
These studies indicate that the CCK portion of the Secretin-CCK stimulation test adds little information in the diagnosis of CP that the secretin stimulation portion alone (or perhaps the classic SST) already provides. However, the Secretin-CCK test is certainly a more sensitive measure of pancreatic enzyme production than the bentiromide test, which tests primarily protease production by the pancreas[71].
Perfusion testing: Researchers in the Gastroenterology Division at the University of Florida in Gainesville over the last 25 years have developed and implemented a method of measuring endogenous and exogenous pancreatic enzyme activity in the duodenum of patients with CP analogous to the Mayo clinic CCK methodology. This “perfusion test” enables quantification of delivery of exogenous pancreatic enzymes to the duodenum. Some notable differences between this perfusion test and the CCK stimulation test include use of a standardized meal rather than CCK to stimulate the pancreas, use of a modified Dreiling tube attached to a 7 Fr Dobhoff tube, placed without sedation under fluoroscopy, and the perfusion of radiolabeled Carbon-14 Polyethylene Glycol (PEG) to enhance assessments of recovery. This perfusion test measures endogenous enzyme production in the fasted and fed states with a standardized Ensure® meal. Volume of both gastric and duodenal collections, pH, and enzyme activity are recorded over a 3 h period. The test is then repeated immediately after intake of an exogenous pancreatic enzyme[72]. The inconvenience and time required for this test render it useful only for the research setting.
Intraductal SST: In the intraductal secretin stimulation test, typically the main pancreatic duct is cannulated using ERCP techniques and then pancreatic fluid is collected, after the administration of secretin alone, or secretin followed by CCK. The patient is sedated without anticholinergic medications such as diphenhydramine (Benadryl®) or opiates, usually with benzodiazepines. Typically, the pancreatic fluid collected in this manner has a higher bicarbonate concentration than in the classical SST, around 130 mEq/L for controls, and less then 105 mEq/L for CP patients, owing to lack of contamination by bile and duodenal content. Some of the disadvantages of this test include the complication rate of ERCP, the need for sedation, and the relatively short time periods of collection (usually 15 min, as limited by sedation and fluoroscopy room time). An advantage of the intraductal test is that pure pancreatic juice is collected without contamination with bile or duodeno-gastric contents and that it can be used in patients with Billroth I and II gastric resections.
One group showed that the intraductal test could not reliably differentiate between 19 CP patients, 14 “early CP” patients, and 14 controls[73], despite a long intraductal collection period of 60 min. The investigators used extra CCK with secretin after the initial secretin boluses in 15 patients. They used only 70 U maximum of secretin and did not adjust for weight of the patients. In addition, their aspiration catheter was prefilled with a dye to assist in identifying the start of the collection, which may have been problematic due to mixing. Also their “early CP” patients had only acute relapsing pancreatitis with no evidence of chronicity by imaging or conventional pancreatic function testing.
For the analyses, it appears they combined the patients with “early CP” and those with CP. They found that this combined group of CP patients produced significantly less volume of pancreatic secretions than controls after stimulation with 1 CU of secretin and 70 CU of secretin but not after 4 CU of secretin. CP patients also had significantly less bicarbonate concentrations only after 4 CU of secretin compared to controls. Bicarbonate output was decreased significantly at all time points for CP patients compared to controls. Interestingly, at only the first minute time point, in patients with CP, the protein content of fluid was higher but not significantly so, than controls, perhaps due to concretions of inspissated enzymes in this group from PD stasis. However, after 70 CU of secretin, the protein output of CP patients was significantly less than controls.
A second, larger study of 12 patients with CP and 33 controls (22 normals and 11 with other nonpancreatic GI disorders), which used only a 20 min collection time, found that the sensitivity of the intraductal test peak bicarbonate compared to SST was 100% with a specificity of only 66%. Volume had an 88% sensitivity and a 91% specificity for CP by SST[74].
The most recent study of the intraductal secretin stimulation test was less favorable. In this comparison of the intraductal secretin stimulation test and SST, in which 19 patients served as their own control, the sensitivity of the intraductal secretin stimulation test compared to the conventional SST was only 80%, with a very poor specificity of 20%. Against pancreatogram, the intraductal test was 100% sensitive but only 55% specific[75]. This group used three 5-min collections (as is customary for most intraductal secretin stimulation tests) and the first was discarded. Based on these results, we do not recommend the use of the intraductal secretin stimulation test for routine diagnostic or research purposes.
Endoscopic secretin stimulation testing (eSST): An alternative to traditional pancreatic function testing is to sedate the patient, and collect duodenal juice under endoscopic guidance from a polyethylene tube passed through the biopsy channel of a standard upper endoscope after stimulation with secretin[5176] or the combination of secretin-CCK[77]. This offers the advantage of patient comfort and sedation. The eSST has been extensively studied by a group of investigators from the Cleveland Clinic. The overall impression is that the eSST has the potential to yield results similar to the conventional SST. This comes to no surprise since the two tests are very similar with the main differences being the use of sedation in the eSST and the use of the endoscope to collect duodenal secretions rather than a Dreiling tube for the conventional SST. However, it should be noted that the eSST and the SST have only been directly compared in one small cross over study of healthy controls only, without any CP patients, in which the SST group also received sedation, which we do not do and could have confounded the results in favor of the eSST[78].
Unfortunately, the eSST has several disadvantages. Although the eSST is technically easy to perform, it is impractical, and to date it has not gained acceptance. The main problem appears to be that occupying an endoscopy room and keeping the patient sedated for more than one hour are cost-prohibitive. Although the Cleveland Clinic group has shown that a 45 min endoscopic collection is reasonable with good sensitivity with some loss of specificity[79], we have shown that a full 60 min is necessary for full sensitivity and specificity of the classical secretin stimulation test[80]. Furthermore, patients and their escorts will also have to miss a whole day of work. In addition, medications used for sedation may have effects on pancreatic secretions[81]. Opiates may constrict the sphincter of Oddi, and propofol contains 5% triglyceride which may have effects on pancreatic secretion. Although one small study of normal subjects did not find an effect of light sedation on secretion during endoscopic secretin stimulation testing, it used fairly low doses - 2.5 mg of midazolam and 62.5 mg of meperidine[82]. However, we have found that greater amounts of sedatives are required in most patients with chronic abdominal pain who are referred for evaluation of possible CP.
“Enhanced imaging” pancreatic function tests (S-MRCP): Because of some of the shortcomings of conventional MRCP in the diagnosis of pancreatic disorders, some have investigated the use of MRI with secretin stimulation to increase the flow and volume in the pancreatic duct. The filling of the duodenum can be semi-quantitated to assess for CP. One possible problem with this technique is that it measures volume of pancreatic flow rather than bicarbonate concentration. In theory, obstructive lesions, or sphincter of Oddi spasm could give positive results in the absence of true CP[83]. In addition, MR images are acquired over at most 30 min, which is often an insufficient length of time during secretin stimulation and which may lead to reduced sensitivity.
One German study of 18 CP patients, defined by ERCP, many of whom had previously undergone pancreatic duct stenting and removal, and 5 diseased controls exemplified some of these issues with S-MRCP. This study, even on these patients with obviously advanced CP, showed a 69% sensitivity of S-MRCP with 1 CU/kg of secretin and 90% specificity as compared to relatively insensitive pancreatic function tests, such as the fecal elastase and 13C Mixed Chain Triglyceride Breath Test (MCT-BT)[83].
Another method that S-MRCP uses to assess for CP is parenchymal enhancement during gadolinium infusion (also used during conventional MRI, but which is not used during conventional MRCP). To assess for parenchymal enhancement, T1-weighted sequences with fat suppression are crucial. Also important is the pattern of gadolinium enhancement of the parenchyma: CP patients show decreased enhancement in the arterial phase and increased enhancement in the early venous phase, which are thought to be due to decreased pancreatic blood flow. On T2 imaging, enhancement is seen in CP patients compared to controls, indicating fatty or fibrous replacement of the parenchyma. After 0.5 IU/kg of secretin, the reduced T2 signal changes showed a good correlation with the Lundh test, a pancreatic function test using meal based stimulation. This study also showed a good correlation between duodenal diameter after S-MRCP and the Lundh test. Patients with severe CP had an average increase in duodenal diameter of 1.7 mm. In mild CP the increase was 4.7 mm, and in controls, 14 mm. However, in this study, the patient population was not well defined. They did have a cohort with mild pancreatitis but again we do not know the criteria used to establish this[84].
Another group, this time from Japan, has distinguished S-MRCP, which they reserve to look for duct changes, from “Secretin-Stimulated, Diffusion Weighted MRI” which focuses on secretin-induced changes within the parenchyma of the gland. This new type of MRI calculates the Apparent Diffusion Coefficient (ADC) which measures diffusion of water molecules in the microcirculation. They claim that this type of MRI is even more sensitive than S-MRCP and that it evaluates local and regional pancreatic exocrine function. They also measured changes in alcoholic patients, known not to have structural pancreatic disease by conventional CT. Notably these patients did not undergo pancreatic function testing or ERCP[11].