Effects of Reusing Baseline Volumes of Interest by Applying (Non-)Rigid Image Registration on Positron Emission Tomography Response Assessments

Objectives Reusing baseline volumes of interest (VOI) by applying non-rigid and to some extent (local) rigid image registration showed good test-retest variability similar to delineating VOI on both scans individually. The aim of the present study was to compare response assessments and classifications based on various types of image registration with those based on (semi)-automatic tumour delineation. Methods Baseline (n = 13), early (n = 12) and late (n = 9) response (after one and three cycles of treatment, respectively) whole body [18F]fluoro-2-deoxy-D-glucose positron emission tomography/computed tomography (PET/CT) scans were acquired in subjects with advanced gastrointestinal malignancies. Lesions were identified for early and late response scans. VOI were drawn independently on all scans using an adaptive 50% threshold method (A50). In addition, various types of (non-)rigid image registration were applied to PET and/or CT images, after which baseline VOI were projected onto response scans. Response was classified using PET Response Criteria in Solid Tumors for maximum standardized uptake value (SUVmax), average SUV (SUVmean), peak SUV (SUVpeak), metabolically active tumour volume (MATV), total lesion glycolysis (TLG) and the area under a cumulative SUV-volume histogram curve (AUC). Results Non-rigid PET-based registration and non-rigid CT-based registration followed by non-rigid PET-based registration (CTPET) did not show differences in response classifications compared to A50 for SUVmax and SUVpeak,, however, differences were observed for MATV, SUVmean, TLG and AUC. For the latter, these registrations demonstrated a poorer performance for small lung lesions (<2.8 ml), whereas A50 showed a poorer performance when another area with high uptake was close to the target lesion. All methods were affected by lesions with very heterogeneous tracer uptake. Conclusions Non-rigid PET- and CTPET-based image registrations may be used to classify response based on SUVmax and SUVpeak. For other quantitative measures future studies should assess which method is valid for response evaluations by correlating with survival data.


Introduction
Positron emission tomography/computed tomography (PET/ CT) has been shown to be a valuable tool in oncology for monitoring response to treatment [1]. Volumes of interest (VOI) can be defined on the pre-treatment PET/CT scan and on consecutive (response) scans during or after treatment to measure changes (response) in metabolically active tumour volume (MATV). tracer uptake or uptake heterogeneity [2]. A 3 dimensional isocontour method at 50% of the maximum pixel value that corrects for local background (A50) is a highly reproducible method to define VOI (semi-)automatically [3][4][5][6]. Ideally baseline VOI should be projected onto the consecutive (response) scans to enable more efficient therapy efficacy assessment [7]. One practical issue with longitudinal PET/CT studies is that patient positioning between consecutive scans may vary, thereby inhibiting the direct reuse of baseline VOI for response scans. Image registration between consecutive scans is required to facilitate reuse of baseline VOI. These image registrations can be performed either rigidly or non-rigidly. Rigid image registration only allows for rotational and translational movements of the entire image, whereas non-rigid image registration allows for any type of local (elastic) deformations. A previous test-retest study showed that reusing baseline VOI by applying non-rigid and to lesser extent (local) rigid image registration has good repeatability, similar to delineating VOI on either scan separately [8]. However, in a test-retest setting, no changes in tumour shape, volume, tracer uptake and/or tracer uptake heterogeneity are expected, because these studies are acquired within a limited time frame and without administration of therapy. In a response monitoring setting, the interval between consecutive scans can be several weeks. For this reason, not only difference in patient positioning between consecutive PET/CT scans may pose a challenge for image registration strategies in longitudinal PET/CT studies, but also changes in tumour shape, volume, tracer uptake and tracer uptake heterogeneity, resulting from either treatment effects or progression of the disease [8,9].
The purpose of the present study was to investigate the effects of reusing baseline VOI by (non-)rigid image registration strategies proposed previously [8] on PET/CT response assessments and response classifications by comparing the results to those obtained using A50 to delineate VOI on baseline and response scans separately.

Patient data
Baseline whole-body [ 18 F]fluoro-2-deoxy-D-glucose ([ 18 F]FDG) PET/CT studies were acquired for 13 patients (9 male, 4 female; age: 60612 y; weight: 84617 kg; height: 17269 cm) with advanced colorectal carcinoma at five different sites [10]. Patients were only included if their double baseline studies demonstrated good repeatability [10]. All patients had received no therapy (chemotherapy, radiotherapy, or surgical treatment) for 2 weeks prior to the baseline scan. The patients were treated by BMS-582664 (brivanib alaninate) in combination with full-dose cetuximab (Erbitux), a monoclonal antibody targeting epidermal growth factor receptor. BMS-582664 is a selective dual inhibitor of fibroblast growth factor and vascular endothelial growth factor signalling, and is taken orally on a daily schedule [11]. Twelve patients underwent an early [ 18 F]FDG PET/CT response scan after 1 cycle (day 15) of treatment, and nine patients a late [ 18 F]FDG PET/CT response scan after 3 cycles (day 56). Patients fasted for at least 4 h prior to scanning and refrained from strenuous physical activity. Blood glucose levels were obtained for each patient prior to scanning and were within the normal range (5.661.0 mmol?l 21 ).

Image registration strategies
All registrations were performed using Elastix (UMC Utrecht, The Netherlands) [14]. Various rigid and non-rigid strategies were evaluated based on various input data [8]: N PET to PET image registration. This registration type takes functional information into account; N CT to CT image registration. This registration takes anatomical information into account. The low dose CT scans were downsampled to the PET resolution prior to image registration to increase computational performance and to avoid issues with computer memory; N CT to CT image registration, after which the transformation was used to initialize PET to PET registration (referred to as CTPET). This registration initially takes the anatomical followed by the functional information. This method was only used for (non-linear) non-rigid transformations, as (linear) rigid CTPET-based image registration would produce identical results to rigid PET-based image registration.
These various types of rigid and non-rigid image registration were applied on whole-body images, referred to as 'global'. In addition, these various types of rigid image registration were also applied on selected whole-body images, cropped in such a way that they included slices with either the abdomen or lung. This method is referred to as 'local'. In total, 7 different image registration strategies were investigated for response monitoring purposes. More details on the applied registration strategies and the corresponding settings for Elastix can be found in the literature [8].

Data analysis
In total, 29 lesions were identified on the baseline scan located in the liver (n = 17), lung (n = 10), bone (n = 1) or pancreas (n = 1). For early response assessments, 27 lesions could be identified located in the liver (n = 15), lung (n = 10), pancreas (n = 1) or bone (n = 1). For late response assessments, 18 lesions could be identified located in the liver (n = 9), lung (n = 8) or pancreas (n = 1). VOI were drawn on baseline and both response scans using A50, resulting in baseline and (early and late) response VOI A50 . In addition, baseline scans were registered onto the (early and late) response scans using the various registration strategies, after which baseline VOI A50 were transformed according to the transformation parameters obtained, resulting in VOI registered . For each VOI, maximum SUV (SUV max ), peak SUV based on 1.2 cm diameter spherical VOI (SUV peak ) [15], average SUV (SUV mean ), MATV, total lesion glycolysis (TLG, calculated as product of SUV mean and MATV) and area under a cumulative SUV-volume histogram (AUC) [2,16] were obtained. AUC is a quantitative index of uptake heterogeneity, with lower AUC corresponding to a higher degree of (global) uptake heterogeneity [2,17,18]. SUV mean , MATV, TLG and AUC were not determined for VOI obtained using rigid image registration, due to its inability to change the shape or volume of a VOI. For all the quantitative measures, (early and late) responses were calculated as the values of the (early or late) response scans (obtained from either VOI registered or response VOI A50 ) divided by the values of the baseline scan (obtained using baseline VOI A50 ) times 100%.
To assess the agreement between VOI registered and response VOI A50 , Dice similarity coefficients (DSC) were calculated between VOI registered and response VOI A50 using DSC(X ,Y )~2 where X denotes the volume of VOI registered , Y the volume of response VOI A50 , and X \Y the overlap between the two volumes. A value of 0 indicates no overlap, whereas a value of 1 indicates perfect agreement. The level of agreement between responses obtained using A50 and each registration strategy was determined for each quantitative measure using intraclass correlation coefficients (ICC) with a two-way random single measures model (SPSS, Chicago, IL, USA). An ICC of 1 indicates a perfect agreement. Statistical significance was determined using a two-tailed paired Student's t-test, where p-values less than 0.05 were considered significant. Correlations between DSC and various values derived from MATV and AUC (absolute values of baseline and consecutive scans, and absolute responses) were assessed using squared Pearson's correlation coefficients (R 2 ).

Response classification
The obtained (early and late) responses for SUV peak were classified using PET Response Criteria in Solid Tumors version 1.0 (PERCIST) [15] as progressive metabolic disease (PMD), stable metabolic disease (SMD), partial metabolic response (PMR) and complete metabolic response (CMR). PERCIST specifies that PMR requires greater than a 30% and a 0.8 g/ml decline in SUV peak between the most intense lesions before as well as after treatment (not necessarily the same lesion); PMD requires . 30% and 0.8 g/ml increase in SUV peak or new lesions; CMR is assigned when all metabolically active tumours have visually disappeared. Unlike PERCIST, classification was not performed per subject for the metabolically most active lesion only, but for each lesion individually. As CMR can be observed visually, CMR lesions were excluded. The response thresholds of SUV peak were also used for Figure 1. Similarity between volumes of interest (VOI) obtained using A50 and various registration strategies. Box plots of Dice similarity coefficients (DSC) for early (A, C, F) and late (B, D, G) response assessments. DSC were obtained using all VOI (A, B), lung VOI (C, D) or liver VOI (E, F). The mean is illustrated by a square, outliers by dots, and minimum and maximum values by crosses. Abbreviations: A50, 3 dimensional (semi-)automatic isocontour method at 50% of the maximum pixel value that corrects for local background. doi:10.1371/journal.pone.0087167.g001 Figure 2. Sagittal images of a patient with a bone metastasis. Top row: baseline (left) and early response (right) PET/CT images. Bottom row: volumes of interest (shown in red) projected onto the baseline (first image) and early response scans (other images) that were obtained using (from left to right): A50 defined on baseline scan, A50 defined on early response scan, and global rigid PET, local rigid CT, non-rigid PET and non-rigid CTPET image registration. All images are shown using the same colour scales. Abbreviations: SUV, standardized uptake values; HU, hounsfield units; A50, 3 dimensional (semi-)automatic isocontour method at 50% of the maximum pixel value that corrects for local background. doi:10.1371/journal.pone.0087167.g002 SUV max . PERCIST does not specify response thresholds for SUV mean , TLG (only for PMD), MATV and AUC. Therefore, these thresholds were derived from retrospective test-retest data obtained using A50 [3]. These thresholds could then be used to classify responses in SUV mean , TLG, MATV as PMD, SMD and PMR, and observed responses in AUC as an increase in tracer uptake heterogeneity (IUH), stable tracer uptake heterogeneity (SUH) or a decrease in tracer uptake heterogeneity (DUH). The percentage response thresholds were obtained by calculating the mean test-retest value plus two times the standard deviation, rounded up to the next multiple of ten. The absolute response thresholds were obtained by calculating the mean absolute difference between test and retest values plus two times the standard deviation, rounded up to the tenth decimal place. More details on the used dataset can be found in [3]. Response thresholds derived from retrospective test-retest data were 30% with a minimum change of 1.6 g/ml, 110% with a minimum change of 11 ml, 10% with a minimum change of 0.06, and 110% with a minimum change of 28 g for SUV mean , MATV, AUC and TLG, respectively (table 1).

Results
Overlap between VOI obtained using A50 and each registration strategy Both non-rigid PET and CTPET registration showed the highest median DSC for early and late response assessments (early response assessments: 0.61 and 0.65, respectively; late response assessments: 0.55 and 0.54, respectively). For early response assessments, local rigid PET registration also showed a high median DSC (0.59). All registration strategies showed a decrease in median DSC from 9% (non-rigid PET registration) up to 38% (global rigid CT registration) between early and late response assessments (figures 1A and 1B, respectively). One VOI, located in bone, did not show overlap between A50 and global rigid PET or local rigid CT registration, and is illustrated in figure 2.
In general, DSC values obtained from lung VOI were significantly higher for non-rigid image registration compared to (local) rigid registration (p,0.04, figures 1C and 1D), except in early response assessments using local PET registration compared to non-rigid PET registration (p = 0.10). For liver VOI (figures 1E and 1F), non-rigid PET registration showed higher DSC values compared to (local) rigid PET registration in late response assessments (p,0.01), whereas non-rigid CT registration showed significantly lower DSC compared to local CT registration in early response assessments (p,0.05). Other results obtained for liver VOI were insignificant (p.0.12).
For early response assessments, there was a weak relationship between DSC and the absolute MATV response values obtained from either A50 or the registration strategy itself (table 2; R 2 : 0.20-0.42), except for non-rigid CT registration that showed no relationship (R 2 :,0.16). In late response assessments, only nonrigid PET registration showed a weak relationship between DSC and absolute MATV response values (R 2 : 0.31), all other methods did not show a relationship (R 2 :,0.19). Only non-rigid PET and CTPET registration showed a moderate relationship between DSC and the absolute AUC response values obtained from the registration strategy itself for both response assessments (R 2 : 0.31-0.37). All other values investigated (table 2) generally showed, no relationship with DSC. Some typical scatter plots for non-rigid PET registration are shown in figure 3.

Effects on response values
Absolute values of various quantitative PET measures obtained using A50 are listed in table 3. Median response values obtained using A50 and the various registration strategies are shown in figure 4. For both response assessments, SUV max and SUV peak response values derived from all registration strategies showed an almost perfect agreement with corresponding SUV max and SUV peak response values derived from A50 (table 4, ICC:.0.921), except for local rigid CT registration in early response assessments (ICC: 0.616). However, only non-rigid PET and CTPET image registration showed no significant differences in SUV max and SUV peak response values compared to those obtained from A50 (p.0.056). In addition, an almost perfect agreement was observed between SUV mean response values derived from A50 and from non-rigid PET or CTPET registration (ICC:.0.923), but the observed differences were significant (p,0.011). Poor to moderate agreement was found between MATV, TLG and AUC response values derived from A50 and from non-rigid PET or CTPET registration (ICC: 0.034-0.763). One lesion (outlier in figure 4G) showed a large increase in MATV (447%) for A50 in the early response assessment and is illustrated in figure 2.

Effects on response classifications
Only non-rigid PET and CTPET registrations showed no differences in response classifications compared to A50 for SUV max and SUV peak (figure 5). However, for MATV, SUV mean and TLG, compared with A50, non-rigid PET and CTPET registration showed in general more PMR and less SMD (up to 17%) or more SMD and less PMD (up to 11%). Moreover, nonrigid PET and CTPET registration showed more IUH and less SUH and/or DUH for AUC compared with A50 (up to 50%). Non-rigid CTPET and PET registration seemed to miscategorise response using SUV mean and AUC for small lung lesions (,2.8 ml, figure 6), whereas A50 seemed to miscategorise response using MATV when another lesion with high uptake was close to the target lesion (figure 7). All methods seem to be affected by lesions with visually (increased) heterogeneous tracer uptake (figure 8). Three lesions showed deviating classifications between A50 and non-rigid CTPET and/or PET registration for two or more quantitative measures (TLG, SUV mean , MATV and/or AUC) in late response assessments that was caused by a slightly larger or smaller volume for the VOI obtained with CTPET and PET compared to the VOI obtained with A50 ( figure 9). Their SUV max changed by 24%, 46% and 223% to 15, 22 and 7 g/ml. For AUC, an additional 13 lesions showed deviating classifications between A50 and non-rigid CTPET and/or PET registration when lesions were very small (,5.0 ml, six lesions) or had a slightly larger or smaller volume for the VOI obtained with CTPET and PET compared to the VOI obtained with A50 (seven lesions).

Discussion
This is the first study to investigate the effects of reusing baseline VOI by (non-)rigid image registration strategies on PET/CT response classifications and to compare these results to those obtained using VOI delineated on baseline and response scans separately. Out of all rigid registration strategies, local rigid PET registration showed the most similar performance to A50 for both response assessments. Nevertheless, local rigid PET registration showed one deviating response classification from A50 for SUV peak in early response assessments (located in liver, figure 7) and two in late response assessments (located in liver and lung). Thus, (local) rigid image registration should not be applied to reuse baseline VOI for response classifications. These results are consistent with published data [9] whereby rigid registration provided imperfect alignment of breast tissue between longitudinal breast cancer PET/CT studies.
Non-rigid CT registration showed a poorer performance compared to non-rigid PET registration in both response assessments and to local CT registration for liver VOI in early response assessments. As discussed in a previous study [8], CT image registration may be improved by using respiratory gating [19] or intermodality image registration to correct for small residual misalignments between CT and PET (figure 6) [20,21], or by using the original CT images that were not downsampled to the PET resolution. Although not shown for non-rigid CT registrations, reducing pixel resolution has little effect on the performance of rigid CT registration and can be used to speed up the algorithm without loss of accuracy [22].
These results indicate that non-rigid PET and CTPET registration may be used to classify response based on SUV max and SUV peak . These results were consistent with results reported by De Moor et al. [7] showing that non-rigid image registration could be used to access therapy using PET more efficiently. However, differences in response classification were observed for MATV, SUV mean , TLG and AUC. For MATV, SUV mean and TLG, differences were noted for small lung lesions (figure 6) or when another high uptake area or lesion was close to the target lesion (figures 2 and 7). In addition, all methods seem to be affected when a lesion showed (increased) heterogeneous tracer uptake ( figure 8). Furthermore, some lesions showed a larger or smaller VOI in A50 compared to non-rigid PET or CTPET registration in late response assessment that had no apparent cause ( figure 9). For AUC, an additional 13 lesions showed deviating classification non-rigid CTPET and/or PET registration and A50 when lesions were very small (,5.0 ml) or had a slightly larger or smaller volume for the VOI obtained with CTPET and PET compared to the VOI obtained with A50.
The registration of small lung lesions may be hampered by the limited registration parameters used in this study. As previously reported [8], registration parameters of the registration software (Elastix) could be adjusted to allow higher DSC for some patients, thereby likely obtaining more accurate SUV mean , MATV and TLG for some lesions. However, the use of these parameters was considered not feasible for reuse of baseline VOI due to image artefacts that were observed for some patients in the registered images. Only those parameters were used that showed a high DSC without any image artefacts, but this limits the flexibility of Elastix that may be required for some types of lesions. Classification of AUC was more affected by small lesions than classifications of other quantitative measures. An explanation for this is that AUC is ultimately dependent upon intensity histograms derived from individual tumours [23]. Therefore, tumour volumes should be sufficiently large to obtain valid results for AUC [24,25].
Another high uptake area or lesion close to the target lesion can cause potential outliers for A50, as illustrated in figure 2. This  bone lesion showed a decrease in SUV max from 8.0 to 3.0 g/ml. The resulting SUV max was close to the [ 18 F]FDG uptake of the surrounding bone tissue, causing A50 to delineate a larger fraction of the bone. Nevertheless, this large increase in MATV (447%) was only 2.8 ml, thereby not classified as PMD. However, for the lesion depicted in figure 7, this did result in the inclusion of a nearby lesion and was therefore erroneously classified as a PMD. Tumours with heterogeneous tracer uptake affect thresholdbased delineation methods such as A50 [26]. For the image registration strategies, all PET-based image registration strategies used in this study measure similarity by maximizing normalized cross correlation [27]. Other similarity measures, such as normalized mutual information, might more appropriate for tumours that show (increased) tracer uptake heterogeneity. However, DSC for the two lesions that showed (increased) tracer uptake heterogeneity were lower for mutual information (0.31 and 0.26) compared to normalized cross correlation (0.37 and 0.35, data not shown), indicating that mutual information might not be more appropriate for tumours that show (increased) tracer uptake heterogeneity than normalized cross correlation.
For SUV mean , MATV, TLG and/or AUC, some lesions (three that affected two or more quantitative measures, and seven that affected AUC alone) showed deviating response classification when obtained with non-rigid PET and/or CTPET registration compared to A50 for late response assessments. These lesions showed a larger or smaller VOI for A50 compared to those obtained using non-rigid PET or CTPET registration. The difference in VOI between those obtained using A50 and nonrigid PET or CTPET registration could not be explained by the presence of high uptake area or lesion close to the target lesion. Possible scenarios include either the VOI obtained using A50 were larger or smaller because of the decrease/increase in SUV max , or the VOI obtained using non-rigid PET or CTPET registration were smaller or larger because of used similarity measure or the limited parameters used for the registration software (Elastix). Which VOI is more predictive can only be determined by future studies that correlate quantitative measures derived from each method to patient survival data. Therefore, for quantitative parameters such as SUV mean , TLG, MATV and AUC, future studies should be performed to further validate the use of non-rigid  PET or CTPET registration for response classifications and correlating these to survival data. The fact that more deviating classifications were observed for AUC than for other quantitative measures may be explained by the higher sensitivity of AUC for differences in VOI placement/delineation compared to other metrics (i.e. SUV max , SUV peak or even SUV mean , TLG and MATV). This indicates that any results on heterogeneity measures should be carefully checked for errors in tumor delineation or VOI placements. Recently, it has been shown that AUC is less sensitive to the type of tumor delineation compared to other (more local or regional) tracer uptake heterogeneity measures [28]. This may suggest that the performance for CTPET or PET registration may not be adequate enough for quantification of changes in global tracer uptake heterogeneity.  Limitations One limitation of this study is that the interval between [ 18 F]FDG administration and the start of acquisition between subjects was 84632 min, i.e. a fairly large inter-subject variability.
The European Association of Nuclear Medicine (EANM) guidelines for quantitative [ 18 F]FDG PET/CT studies [12] emphasize that the recommended scan time should be 60 min post injection and the same interval (tolerance 65 min) should be applied in the Figure 8. Coronal images of a patient with a large liver metastasis showing heterogeneous tracer uptake. Top row: baseline (left) and early response (right) PET/CT images. Bottom row: volumes of interest (shown in red) projected onto the baseline (first image) and early response scans (other images) that were obtained using (from left to right): A50 defined on baseline scan, A50 defined on early response scan, and local rigid PET, non-rigid PET and non-rigid CTPET image registration. All images are shown using the same colour scales. Abbreviations: SUV, standardized uptake values; HU, hounsfield units; A50, 3 dimensional (semi-)automatic isocontour method at 50% of the maximum pixel value that corrects for local background. doi:10.1371/journal.pone.0087167.g008 Figure 9. Coronal images of a patient with a liver metastasis that showed an increased metabolically active tumour volume. Top row: baseline (left) and late response (right) PET/CT images. Bottom row: volumes of interest (shown in red) projected onto the baseline (first image) and late response scans (other images) that were obtained using (from left to right): A50 defined on baseline scan, A50 defined on late response scan, and local rigid PET, non-rigid PET and non-rigid CTPET image registration. All images are shown using the same colour scales. Abbreviations: SUV, standardized uptake values; HU, hounsfield units; A50, 3 dimensional (semi-)automatic isocontour method at 50% of the maximum pixel value that corrects for local background. doi:10.1371/journal.pone.0087167.g009 context of therapy response assessments. Note, however, that this study occurred prior to the EANM guidelines and the sites were asked to scan at 60610 min and then at the same time 615 min for next scan. For most patients, the difference in scan time between baseline and response scan was small (i.e. 866 min). Only two patients showed a large difference in this interval (i.e. 88618 min). As previously shown by Cheng et al. [29] the expected [ 18 F]FDG uptake in the background surrounding a lesion may vary significantly at different imaging time points. Therefore, the variability in scan time between baseline and response scan is expected to have affected the observed absolute SUV and response values based on relative SUV changes, at least for these two patients. This would have been a serious limitation when the results would have been correlated with patient survival data and both patients should then have been excluded from the study. However, in this study, both A50 and the various registration strategies use the same input data and only differences between these methods are investigated. Furthermore, both methods are less sensitive for changes in contrast. All PET-based registration strategies use normalized cross correlation as a similarity metric that compensates for a (global) change in contrast. In addition, A50 is able to adapt its threshold relative to the local average background and is therefore less sensitive for a change in local contrast [3,30]. Out of all five lesions identified within the two patients that showed a large deviation in scan time between baseline and response scan, only one lesion (figure 8) showed differences in response classification between A50 and the registration strategies, but this difference was likely caused by heterogeneous tracer uptake within the lesion. It is therefore expected that the difference in scan time between baseline and response scans had no effect on results presented in this study.
Another limitation of this study is the lack of correlative data, e.g. patient group survival data. As discussed earlier, both A50 as well as the proposed registration strategies have limitations and therefore this comparison can only provide limited conclusions. However, both methods use A50 as a common method to delineate VOI and therefore the comparison lies merely in the effect of reusing the baseline VOI after registration as opposed to independently delineating the VOI in all response scans. In addition, although there is no consensus on which (semi-)automatic delineation method to use in response monitoring studies, A50 has been shown to be an accurate and reproducible method to define VOI [3][4][5][6]30].

Conclusions
Non-rigid PET and CTPET image registration may be used to classify response based on SUV max and SUV peak . For MATV, SUV mean , TLG and AUC future studies should be able to assess which method is valid for response evaluations by correlation with survival data.