Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Reflectance Prediction Modelling for Residual-Based Hyperspectral Image Coding

  • Manoranjan Paul ,

    mpaul@csu.edu.au

    Affiliation CM3 Research Unit, School of Computing and Mathematics, Charles Sturt University, Bathurst, NSW, 2795, Australia

  • Rui Xiao,

    Affiliation CM3 Research Unit, School of Computing and Mathematics, Charles Sturt University, Bathurst, NSW, 2795, Australia

  • Junbin Gao,

    Affiliation Discipline of Business Analytics, the University of Sydney Business School, The University of Sydney, Sydney, NSW, 2006, Australia

  • Terry Bossomaier

    Affiliation CM3 Research Unit, School of Computing and Mathematics, Charles Sturt University, Bathurst, NSW, 2795, Australia

Abstract

A Hyperspectral (HS) image provides observational powers beyond human vision capability but represents more than 100 times the data compared to a traditional image. To transmit and store the huge volume of an HS image, we argue that a fundamental shift is required from the existing “original pixel intensity”-based coding approaches using traditional image coders (e.g., JPEG2000) to the “residual”-based approaches using a video coder for better compression performance. A modified video coder is required to exploit spatial-spectral redundancy using pixel-level reflectance modelling due to the different characteristics of HS images in their spectral and shape domain of panchromatic imagery compared to traditional videos. In this paper a novel coding framework using Reflectance Prediction Modelling (RPM) in the latest video coding standard High Efficiency Video Coding (HEVC) for HS images is proposed. An HS image presents a wealth of data where every pixel is considered a vector for different spectral bands. By quantitative comparison and analysis of pixel vector distribution along spectral bands, we conclude that modelling can predict the distribution and correlation of the pixel vectors for different bands. To exploit distribution of the known pixel vector, we estimate a predicted current spectral band from the previous bands using Gaussian mixture-based modelling. The predicted band is used as the additional reference band together with the immediate previous band when we apply the HEVC. Every spectral band of an HS image is treated like it is an individual frame of a video. In this paper, we compare the proposed method with mainstream encoders. The experimental results are fully justified by three types of HS dataset with different wavelength ranges. The proposed method outperforms the existing mainstream HS encoders in terms of rate-distortion performance of HS image compression.

Introduction

Hyperspectral (HS) images are concerned with the measurement, analysis, and interpretation of spectra acquired from a given scene (or specific object) at a short, medium or long distance. Depending on the different image collecting method, HS imaging systems broadly fall into two categories: airborne and ground-based. [1]. HS images are produced by spectrometers, which divide images into many bands (i.e. wavelength images). This is different from greyscale or colour images (RGB) with only one or a few relatively broad wavelength bands, mostly in three bands (red, green, and blue). HS images can provide a wealth of spectral information, usually beyond visible wavelengths.

A HS image includes hundreds of narrow and contiguous spectral bands. Each band which corresponds to intervals of wavelengths can range from 400nm to 2500nm or more. A HS image with distinct reflectance-distribution in particular wavelengths [1] shows the distribution of reflectance values in different wavelength bands for a number of different types of objects such as soil, vegetation, and water, which provide a unique signature based on the reflectance values. With the recent development in electronics and imaging sensor design, HS cameras are becoming affordable which opens new applications in agriculture, mineralogy, physics and surveillance around the world in laboratory and industry settings that use HS imaging.

A HS image represents 100 times more data compared to a traditional RGB (red, green, blue) image if we consider the same resolution of a HS image with 300 bands and an RGB image with three color channels. Thus, any processing and transmission of a HS image needs a sophisticated compression strategy. The existing lossless (where original data can be perfectly reconstructed from compressed data) HS image compression standard [2] only provides a very small compression ratio that limits HS applications. Moreover, there is no lossy HS compression standard in existence as yet. We argue that a fundamental shift is required from the existing “original pixel intensity”-based coding approaches using traditional image coders (e.g. JPEG2000) to the “residual” based approaches similar to a video coder (e.g. HEVC [3]) to achieve high compression ratios for HS images. The system can be made analogous to video coding by treating each band as a temporal frame. A high level of correlation exists in bands as they correspond to the same location of objects. However, bands corresponding to different wavelengths have different dynamic ranges, and thus do not lend themselves easily to simple residual coding [4].

Applying a video coder to the HS-residual in a straightforward way would not provide the expected compression due to the larger magnitude of the residual compared to traditional video frames where motion prediction modelling is used. Thus, a significant compression gain could be achieved if we can correctly predict a spectral band from previously encoded bands to minimize the residual in closed loop coding.

In this paper, a novel lossy compression framework using Reflectance Prediction Modelling (RPM) for a HS image is proposed. To take advantage of the similarity and variability among contiguous spectral bands, we generate a common informatics wavelength (CIW) band in each band using a Gaussian mixture modelling and a linear spectral predicted band by considering the differences of spectral bands. The first wavelength band is encoded as intra-coded and the remainder of wavelength bands in a HS image are coded as inter-coded (similar to video coding) by using two reference bands: one is the immediately previous encoded wavelength band and the other is an instant CIW band. Obviously the CIW band is updated before encoding the current band using the spectral predicted band through Gaussian modelling. This is a new approach to using HEVC inter-coding for HS data compression, with RPM noticeably improving the HEVC-inter compression performance in our experiments. The results of the experiment confirm that the proposed compression technique based on RPM outperforms the standalone HEVC and other leading-edge compression techniques in terms of rate-distortion performance.

Background

Several types of HS image compression studies are available in literature based on spectral decorrelation and spectral dimensionality reduction. One of the commonly used algorithms is based on Principal Component Analysis (PCA) [57]. PCA is a dimensionality reduction method. The original image is projected onto the subspace based on the variance within the data. It changes the original HS image physical data structure and also costs part of the image data loss. Another approach is to keep HS data in the original spectral dimension. A HS image can be encoded without significant loss of information, for this purpose optimal compression cube (OCC) [8] has been introduced. It calculates the average of cross correlation of the intra-band then determines an OCC for higher data redundancy. OCC has well-preserved transform information and can achieve higher rate-distortion performance than PCA-based compression techniques while applying different existing encoder techniques. Tang, Xing, Li and Wang [9] use an adaptive band selection to reduce dimensionality. In this method, adjacent bands are arranged into one group and compressed using the JPEG-LS standard. Other than the above-mentioned methods, PCA-discrete cosine transform (DCT) and PCA-JPEG2000 have also been widely applied in HS image compression fields [1018].

Numerous 2D compression techniques are extended for specific 3D HS data structures, such as 3D discrete wavelet transform (DWT) [1820] and 3D-DCT. In the 2D DCT of the JPEG standard, a block size of 8 × 8 is used for block-based compression. In 3D DCT 8 × 8 × 8, a sub-cube is applied. A number of improved 3D-DCT based approaches are proposed in [21][22]. Distributed source coding (DSC) [2325] schemes have received more attention due to their low complexity and error resilience which satisfy the requirements of onboard compression systems. The context-based adaptive lossless image coding (CALIC) is another commonly used technique for HS image compression [26]. This article [27] extends the CALIC algorithm from 2D to 3D-CALIC.

Other popular compression algorithms for HS images using wavelets are EZW- Embedded Zero-trees Wavelet [28][29], the SPIHT (Set Partitioning in Hierarchical Trees) [30] and the SPECK (Set Partitioning Embedded Block) [31][32]. Modified SPIHT algorithms are also introduced for compressing multispectral images [30]. The SPIHT possesses a number of desirable properties such as competitively good performance, low complexity, and embedded encoding which make it perfectly suited for the task of compressing multispectral images. Moreover, in the majority of cases SPIHT outperforms SPECK [33]. Hence in our experiments, we have chosen to compare our algorithm with state-of-the-art 3D-SPIHT. Additionally, Wavelet difference reduction (WDR) is another latterly proposed method for efficient embedded image coding [34][35]. The ASWDR algorithm is a simple modification of the WDR algorithm. Raja and Suruliandi [34] compared WDR and ASWDR in terms of parameters such as PSNR (peak signal to noise ratio) and MSE (mean square error). According to their results, the ASWDR technique performs better than the WDR.

Lossless and lossy compression ratios for HS images using the above mentioned original pixel-based encoding techniques are around 3:1 and 80:1, whereas the lossless and lossy compression of a video using the latest video coding standard—i.e. HEVC- are 3:1 and 1000:1 respectively, depending on the application requirements [36][4]. Thus, there is scope to improve the lossy compression ratio with a large margin for a HS image if we fundamentally shift from the existing “original pixel reflectance”-based coding approaches using traditional image coders (e.g. JPEG200) to the “residual”-based approaches using a video coder. Moreover, entropy analysis of HS images, based on the original pixel reflectance and residual between adjacent bands, shows that residual-based entropy is smaller compared to the original pixel reflectance entropy (see Fig 1). Note that smaller entropy provides more compression as it takes fewer bits to encode. The figure reveals that the entropy using residual is only 19% to 33% of the original pixel reflectance for five NASA HS images [10]. The figure indicates that a residual-based encoder—e.g. video coder—should provide better compression compared to the original pixel reflectance-based encoder (i.e. JPEG2000).

thumbnail
Fig 1. Entropy analysis using original pixel reflectance and residual between adjacent bands for HS images from NASA dataset [10].

https://doi.org/10.1371/journal.pone.0161212.g001

The HS image compression technique [37][38] using H.264 (predecessor of HEVC) outperforms a number of conventional image-based HS image compression techniques where both spatial (i.e. intra) and spectral (i.e. inter) correlations are exploited. Although it is a residual-based approach, it could not outperform existing HS compression techniques with large margins due to the relatively larger magnitude of HS-residual between adjacent bands. To achieve better compression we need to develop a predicted band [39] similar to the current band for smaller residual.

Recently dynamic background modelling (DBM)-based video coding [40] has attracted more attention due to its capability of better compression and reduction of computational time compared to its rival algorithms. The DBM technique represents background and foreground pixel intensities over time using Gaussian Mixture distributions based on means and variances [41]. The DBM exploits uncovered dynamic background areas through a Gaussian model to improve the rate-distortion and computational performance for encoding. The basic idea of DBM is to keep the pixel intensity for background frame generation, which does not change too much over time (i.e. for a number of consecutive frames). HS image structure can be assumed as a video if we consider different wavelength bands as different temporal frames of a video. However, the characteristics of spectral band-images are different compared to video frames. In the video, objects in the foreground are usually more prone to be moving and the background remains somewhat static, so it remains almost the same over multiple frames. In a HS image, spectral reflectance changes over the different bands and there is hardly any object for which reflectance does not change in the spectral bands. Thus, we cannot use a DBM to predict the upcoming wavelength band, as spectral reflectance in each wavelength band is different. Straightforward application of background modelling for HS image compression in the HEVC [3] framework did not provide satisfactory results. Note that a HEVC [42] compression and coding technique should provide comparable compression performance for a HS image compared to other techniques such as JPEG, Wavelet-based encoders, and PCA as it can exploit the residual of the adjacent wavelength bands. This is due to the magnitude of the reflectance-residues between adjacent wavelength bands usually being smaller compared to the original reflectance.

The multiple experiments were performed using three different HS datasets (described in Section III) covering both airborne and ground-based types of HS images to understand the reflectance in different wavelength bands and the performance of different methods including the proposed method. We have compared the proposed HS image coding technique with the relevant state-of-the-art scheme JPEG2000, JPEG, PCA-DCT, HEVC and other wavelet-based encoders 3D-SPIHT, ASWDR and EZW. We observed that the spectral reflectance changes gradually with different bands. Obviously the gradient of changes is varied for different objects. Thus, besides applying DBM in the HS image compression under the HEVC coding framework in a straightforward way, we need to model a CIW band using predicted spectral reflectance so that the generated CIW band is closer to the current wavelength band compared to the previous wavelength band. Using the CIW band as an extra reference band in the HEVC video coding framework should provide more compression while encoding the current wavelength band. This is due to higher similarity with the current band compared to the previously encoded wavelength bands. The results of the experiment confirm that the proposed compression technique based on RPM outperforms the standalone HEVC and other mainstream compression techniques.

Dataset

Brimrose HS camera

Our experimental HS images were captured using a Brimrose HS camera at Griffith University, Australia, consisting of an array of spectral reflectance of size 1392×1204×61. The images were captured of both indoor and outdoor scenes under daylight illumination. An optical tunable filter was used, capable of acquiring a HS image by sequentially tuning the filter through a series of 61 narrow wavelength bands, each with approximately 10nm bandwidth and interval steps at 10nm, starting from 400nm to 1000nm. Two examples of the HS images namely Pot and Vase are shown in Fig 2. A particular 500nm wavelength band in the visual range is used to represent the HS image.

thumbnail
Fig 2. Two examples of Hyperspectral images (for a particular wavelength 500nm) used in the experiments, the images are generated from the raw data provided by Griffith University.

https://doi.org/10.1371/journal.pone.0161212.g002

Natural scenes dataset

On top of the dataset collected from the Brimrose HS camera, another publicly available HS dataset popularly known as natural scenes [7] was also used in the test. The natural scenes dataset consists of a mixture of rural scenes from the Minho region of Portugal, containing rocks, trees, leaves and grass, and urban scenes from the cities of Porto and Braga. Images were obtained during the summers of 2002 and 2003. Particular care was taken to avoid scenes containing movement. Scenes were illuminated by direct sunlight in a clear or almost clear sky. Each HS image has wavelength bands from 400 to 720nm with 1018×1339×33 resolutions (however, Scene 5 has 32 bands).

Airborne visible/infrared imaging spectrometer (AVIRIS)

The dataset used in this part of the experiment is from Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) reflectance data [10]. AVIRIS data are mainly collected for identifying, measuring, and monitoring constituents of the earth's surface and atmosphere based on molecular absorption and particle scattering signatures. It delivers calibrated images with 224 contiguous spectral bands and approximately 10nm spectral resolutions covering the 0.4–2.5μm spectral range. Data was collected from mineral mapping at Cuprite, Nevada.

Reflectance Prediction Modelling

An RGB image cube has three (Red Green and Blue) colour components, along with wavelength axis. In this aspect a HS image can be considered as an image cube where the wavelength axis is represented by hundreds of contiguous spectral bands. A pixel in a HS image is actually a vector with a dimension equal to the number of spectral bands. Such inter-band spectral information can be used for spectral prediction. In this paper we propose the RPM technique to improve the compression capability of a HS image and Gaussian mixture-based CIW band modelling into the HEVC video-coding framework. Note that we have converted a 16-bit HS image into an 8-bit HS image before applying different encoders.

Motivation of multispectral data prediction

If we consider spectral information of a HS image, we can easily find the existence of spectral correlation between adjacent bands. The similarity and variability among contiguous bands can be presented preferably by RPM. Fig 3(A) and 3(B) show a graphical representation of the spectral reflectance of randomly selected 20 pixels in 61 wavelength bands ranging from 400nm to 1000nm using two HS images. Fig 3(C) shows similar reflectance tendency over the different bands using the AVIRIS HS image; the dataset comprises 220 bands with a spectral coverage from 0.4 to 2.5 μm. Five pixels representing ‘soybean’, ‘hay’, ‘corn’, ‘wheat’ and ‘wood’ were selected to plot the reflectance. A given HS matrix for a band can be presented as:

thumbnail
Fig 3.

The distribution of spectral reflectance in different bands: (a) and (b) the effective spectral reflectance of 20 pixels randomly selected in the range of 61 bands (400nm to 1000nm) of two HS images namely ‘Soil’ and ‘Strawberry’ captured by the Brimrose HS camera at Griffith University.

https://doi.org/10.1371/journal.pone.0161212.g003

Each pixel x along spectral bands n is a vector. Fig 3A & 3B show different reflectance against different wavelengths for two HS images, namely ‘Soil’ and ‘Strawberry’ captured by Brimrose HS camera at Griffith University. They are mapped into 8-bit reflectance. The reflectance of images with a different range of bands/wavelengths, scene and intensity also demonstrates that probability distribution and correlation of the pixel vector x are measureable and predictable. Thus, prediction modelling can be developed through quantitative comparison via spectral similarity measures, allowing for a HS image to present as a single slice key band with spectral shape information.

The observed spectral reflectance in different HS data shows varying degrees of correlation although the rate of increase or decrease might vary depending on the content of the objects in a HS image. The differences in spectral information between two adjacent wavelength bands is typically very small and therefore includes a large amount of data redundancy. For a particular material and a small range of wavelengths the shape of the reflectance/radiance in different bands follow Gaussian distribution. Thus, it is possible to generate a CIW using the existing Gaussian mixture-based modelling [40][41] by customizing the modelling mechanism.

Customized gaussian mixture modelling for a HS image

A customized DBM based on Gaussian Mixture Modelling (GMM) is introduced in this paper. GMM is a weighted sum of K Gaussian models to characterize pixels’ intensity of the HS image. When each new slice acquires at the current nth band, the value of a pixel reflectance is used to match with the GMM models. If a match is found, the pixel intensity is included in the model and updates its parameters based on the newly satisfied pixel intensity. If a match is not found then a new model is introduced where mean is assigned as the new pixel intensity and weight is assigned an arbitrarily high value. If the maximum number of K Gaussian models is reached, it is replaced by a new model. Normally the replaced model selected is based on the lowest ratio of weight and standard deviation amongst all models. The model, which provides the highest value of weight and lowest value of standard deviation, is normally classified as the background pixel.

If we assume that the k-th Gaussian representing a pixel reflectance with mean μk, variance , and weight wk, then The Gaussians are always ordered based on ω/σ in descending order, with the top Gaussian modelling the most stable reflectance. The system starts with an empty set of models and then for every new observation Xn at the current band n is first matched against the existing models in order to find one (say the kth model) such that If such a model exists, its associated parameters, μk, , and ωk, are updated. Otherwise, a new Gaussian is introduced with μK = Xn, arbitrarily high σ, and arbitrarily low ω by evicting the last model if it exists (based on ω/σ in descending order). A pseudo code for the CIW generation steps is given in Fig 4.

  1. If the current pixel Xn meets the condition we consider Xn satisfies with one of the Gaussians models, i.e. a match is found and then updates all relevant parameters.
  2. If no match is found with any of the K Gaussians; the pixel might be a new background or foreground. Another Gaussian model needs to be introduced with an initial parameter setting [40][41].
  3. If the number of the Gaussian models has already reached the maximum number, we need to evict an existing model based on the ratio of weight and standard deviation.

The parameters are updated as follows where α is the learning parameter: (1) (2) (3)

A mixture of K Gaussian distribution models is used to model each pixel reflectance independently. The pseudo code to generate a CIW band (i.e. ψ) uses pixel reflectance of different bands, which is very similar to the pseudo code described in [39][41]. The main difference is that we use a predicted band pixel reflectance instead of the previously encoded band. The other difference is that we use the mean reflectance value of the maximum weighted model to generate a CIW band rather than the most recent pixel reflectance. A pseudo code of CIW generation is given in Fig 4.

The CIW is an approximation of the current trend of the band based on previously encoded bands. It represents the average trend of the pixel reflectance over different bands. Thus, it is a good approximation of the common informatics of pixel reflectance. The CIW band is very close to the current band, it is used as an extra reference band to encode the current band. Sometimes, the CIW might provide a large residue compared to the current band due to a sudden change of the reflectance of material in a wavelength. To tackle this phenomenon we also use the immediate previous band as another reference band for encoding the current band. The Lagrangian multiplier [4345] is used to select an appropriate reference image between the CIW and the immediate previous band based on the bit requirements and distortion in block level. Instead of providing each encoded wavelength band into the DBM we use a spectral predicted wavelength band. For example when we are encoding nth wavelength band, (n-1)th and other previously encoded wavelength bands are used to generate a spectral predicted band, based on the correlation of previous bands, then sending the predicted band to the CIW modelling. Subsequently we use the generated nth CIW as an additional reference band to encode the nth wavelength band.

Spectral predictor and compression

The above-mentioned Gaussian models determine the CIW band using a number of wavelength bands. Our proposed method predicts the current nth band by using gradients of two immediate previous encoded bands, i.e. (n-1)th and (n-2)th bands and previous trend of gradients for creating a reference band. The current gradient is calculated as follows: (4) where Dn is the weighted gradient for nth band using the previous weighted gradient Dn-1, the weighted gradient between two recently encoded bands Pn-1 and Pn-2. We use w = 0.3 to balance between the previous gradient and the most recent gradient. To generate a CIW we need to provide a spectral predicted band which is generated by Bn = Pn−1 + Dn. The rationality of adding the gradient with the previous encoded band to obtain the predicted band is that we assume the variation of the reflectance in the next wavelength follows the same tendency of gradient that it experienced recently. If we assume that the CIW modelling function is Φ, then we can obtain the CIW, and Pn-1 are used as reference bands to encode the current Pn wavelength band. The whole prediction process is presented in Fig 5.

thumbnail
Fig 5. Block diagram of the spectral prediction and referencing technique for the proposed method.

https://doi.org/10.1371/journal.pone.0161212.g005

Experiment Result

The RPM algorithm is implemented by using GMM under the HEVC video coding framework. In the proposed method we use dual reference frames where the predicted band is used as the second reference frame and the immediately previous band is used as the first reference frame. In HEVC we use two immediately previous bands as the reference frames. We evaluate RPM with other state-of-the-art image encoders JPEG2000, JPG, HEVC-intra and PCA-DCT at the first set of tests, then by a performance review of RPM with wavelet transform-based encoders, conducted in the next section.

As mentioned in Section III, experiments were performed on test images taken from three sets of images: Flower and Vase, Scene5 and Scene6 and AVIRIS data from Cuprite_1 to Cuprite_4 and Indian_Pine. To evaluate the RPM encoder we present compression performance in Fig 6. It is clear that the proposed RPM scheme outperforms other encoders from the results. The observed difference in PSNR is between 0 to 5.0 dB. Furthermore, JPEG2000 and HEVC are two competitive encoders according to the content of HS sample images. PCA-DCT could not obtain high PSNR because it reconstructs the values of physical bands by eigenvectors. PCA selectively discards part of the energy of a HS image signal since only a subset of the eigenvectors has been selected. It is noteworthy that RPM has the most visible superiority for the Nature Scene and AVIRIS datasets. The reason we reach this conclusion is that the Brimrose HS camera collects Flower and Vase in the laboratory environment with a small portion of objects and a large portion of black background, where almost all encoders perform in a similar way. The other two datasets Scene and AVIRIS, were collected from complicated outdoor environments, particularly the Cuprite set. Consequently, RPM can acquire great prediction results for images of a complicated scene. The performance of the RPM is better compared to other encoders except JPEG2000 for the Indian_Pine HS image. Probable reasons for this would be the different entropy and resolution of the Indian_Pine image compared to the Cuprite HS images. The entropy of the Indian_Pine image based on the residual is 40% larger compared to the Cuprite HS images, and the original resolution of the Indian_Pine HS image is smaller compared to the Cuprite HS images.

thumbnail
Fig 6. Rate distortion performance of nine HS images using the proposed RPM method, JPEG2000, JPEG, PCA-DCT, and HEVC encoder techniques.

https://doi.org/10.1371/journal.pone.0161212.g006

As the encoded bands are used in the predicted and CIW generation process in both encoder and decoder, we do not need to encode the CIW band to transmit for the decoder. We use a dual reference band and IPPP format (i.e. the first wavelength band is intra-coded and other bands are predicted as inter-coded) for both standalone HEVC and the proposed RPM algorithm in the HEVC platform. We do not use any motion estimation, as unlike video, each wavelength band has no motion compared to its previous image. However, we use HEVC tree-structured block partitioning to get the best-matched block to minimize residual error.

We know that less prediction error variance always leads to less residual noise, less prediction distortion and higher compression efficiency. Since no scene change happens, CIW-based prediction provides smaller residue compared to the inter-bands’ residue in some cases, and this results in a better compression ratio compared to the HEVC. Moreover, the proposed technique provides better rate-distortion performance compared to some wavelet-based encoders due to its better ability to exploit the correlation between inter-band similarities.

Fig 7 shows computational time comparison and amount of references used by the second reference band between the proposed RPM scheme and the HEVC scheme. Fig 7 shows that the proposed scheme requires 3~4% extra computational time compared to HEVC, as the proposed scheme needs some computational time for generating the predicted band and the CIW band, through DBM modelling. The figure also reveals that the proposed scheme provides almost constant reference areas using the second reference band, i.e. the CIW band, which is around 10% for different bit rates and quality. On the other hand, the HEVC provides very high reference at the low bit rates and very low reference at high bit rates. This indicates the worthiness of the CIW band as the reference band for a wider range of bit rates.

thumbnail
Fig 7. Computational time requirement of the proposed RPM scheme with respect to HEVC (left) and the percentage of reference from the predicted spectral band i.e. CIW band and the 2nd reference band by the proposed scheme and the HEVC respectively.

https://doi.org/10.1371/journal.pone.0161212.g007

For the purpose of comprehensive comparisons to a variety of encoders, in this section we have compared RPM with a few standard wavelet based image coders: 3D-SPIHT, ASWDR and EZW. According to the experimental results published in [46], it was demonstrated that 3D-SPIHT can obtain a better compression result; bit per pixel per band (bpppb) is lower than 3D-EZW at the same level of PSNR. Therefore, we compare 3D-SPIHT with our method. Furthermore, it is worthy of mention that one of the most recent image compression algorithms is the adaptively scanned wavelet difference reduction (ASWDR) algorithm. ASWDR is another benchmark encoder based on wavelets to compare with distortion performance and compression ratio (refer to [34]). The scanning order of ASWDR is different with WDR. It dynamically adapts to the location of edge details in an image and this enhances the resolution of these edges in compressed images. Accordingly, ASWDR exhibits better perceptual qualities, especially at low bit rates.

The rate-distortion results are summarized into two categories, low bit rates (0.5bpppb) and high bit rates (1 bpppb) in Table 1. It can be observed that RPM has presented excellent performance compared to other wavelet-based encoders in most of cases. Particularly outstanding performance is shown in the high bit-rate range, and 3D-SPIHT techniques also provide competitive results.

thumbnail
Table 1. Rate distortion performance of four HS images using the proposed RPM method, 3D_SPIHT, EZW, and ASWDR.

https://doi.org/10.1371/journal.pone.0161212.t001

We have conducted some experiments to validate prediction error between a CIW band and the original reference band. For clear comparison we have calculated average pixel reflectance of a randomly selected 16×16 pixel block from the middle of the HS image for both the original bands and the predicted bands. Fig 8 illustrates the comparison results of predicted reflectance and original reflectance for two images. The figure confirms that the predicted pixel reflectance is very close to the original pixel reflectance over the different bands. This indicates that in a number of cases the CIW band is selected against the immediate previous band as a reference band to encode the current band.

thumbnail
Fig 8. Comparison of predicted reflectance and original reflectance for two sample HS images.

https://doi.org/10.1371/journal.pone.0161212.g008

Conclusion

In this paper, HEVC-inter encoding is used for HS data compression under the Reflectance Prediction Modelling technique to improve compression capability. We extract an instant spectral band of the HS image using Gaussian mixture-based modelling. The generated instant common informatics wavelength is used as the additional reference band for encoding the current band under the latest video coding standard HEVC framework. A HEVC framework is used by considering different bands of a HS image as a different frame or picture of a video.

The proposed HS image coding technique outperforms the relevant state-of-the-art scheme JPEG2000, JPEG, PCA-DCT, HEVC and other wavelet-based encoders 3D-SPIHT, ASWDR and EZW. Overall, the proposed method generates a higher compression rate while maintaining the quality of the image with comparable computational time so that it covers a wide range of applications.

Author Contributions

  1. Conceptualization: MP.
  2. Data curation: RX.
  3. Formal analysis: MP RX.
  4. Funding acquisition: MP.
  5. Investigation: MP RX.
  6. Methodology: MP RX.
  7. Project administration: MP RX JG TB.
  8. Resources: RX.
  9. Software: MP RX.
  10. Supervision: MP.
  11. Validation: MP RX.
  12. Visualization: MP RX.
  13. Writing – original draft: MP RX.
  14. Writing – review & editing: MP RX.

References

  1. 1. Plaza A, Benediktsson JA, Boardman JW, Brazile J, Bruzzone L, Camps-Valls G, et al. Recent advances in techniques for hyperspectral image processing. Remote Sens. Environ. 2011;113: S110–S122.
  2. 2. Lossless Multispectral & Hyperspectral Image Compression standard CCSDS 120.2-G-1, 2015.
  3. 3. Podder P, Paul M, Murshed M. Efficient coding strategy for HEVC performance improvement by exploiting motion features. In: Proceedings of the 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2015.
  4. 4. Miguel AC, Ladner RE, Riskin EA, Hauck S, Barney DK, Askew AR, et al. Predictive Coding of Hyperspectral Images. Hyperspectral Data Compression. ; 2006; 197–231.
  5. 5. Motta B, Rizzo F, Storer J. Hyperspectral data compression. New York: Springer Science+Business Media; 2006.
  6. 6. Huber-Lerner M, Hadar O, Rotman S, Huber-Shalem R. Compression of hyperspectral images containing a subpixel target. IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing. 2014;7(6): 2246–2255.
  7. 7. Foster DH, Amano K, Nascimento S, Foster MJ. Frequency of metamerism in natural scenes. Josa a. 2006;23: 2359–2372. pmid:16985522
  8. 8. Top-Down Urban Forest Assessment Tools—American Forests, 2015. [Online]. Available: http://www.americanforests.org/our-programs/urbanforests/urban-forests-tools-resources/urban-forest-assessments-resource-guide/urban-forest-assessment-tools/top-down-urban-forest-assessment-tools/.
  9. 9. Tang Y, Xing Q, Li G, Wang JW. Lossless compression of hyperspectral images based content. Optics and Precision Engineering. 2012;20(3): 668–673.
  10. 10. AVIRIS—Airborne Visible/Infrared Imaging Spectrometer, 2015. [Online]. Available: http://aviris.jpl.nasa.gov/.
  11. 11. Xiao R, Paul M. Efficient compression of hypersepctral images using optimal compression cube and image plane. Lect. Notes Comput. Sc. 2015;8935: 167–179.
  12. 12. Blanes I, Serra-Sagrista J. Cost and scalability improvements to the Karhunen–Loêve transform for remote-sensing image coding. IEEE Trans. Geosci. Remote Sens. 2010;48(7): 2854–2863.
  13. 13. Lei W, Jiaji W, Licheng J, Guangming S. Lossy-to-lossless hyperspectral image compression based on multiplierless reversible integer TDLT/KLT. IEEE Geosci. Remote Sens. Lett. 2009;6(3): 587–591.
  14. 14. Ly QDN, Fowler JE. An operational approach for hyperspectral image compression. IEEE Intl Geosci. Remote Sens. Symposium (IGARSS). 2012: 1357–1360.
  15. 15. Zhu W, Du Q, Fowler JE. Multitemporal hyperspectral image compression. IEEE Geosci. Remote Sens. Lett. 2011;8(3): 416–420.
  16. 16. Du Q, Fowler JE. Hyperspectral image compression using jpeg2000 and principal component analysis. IEEE Geosci. Remote Sens. Lett. 2007;4(2): 201–205.
  17. 17. Penna B, Tillo T, Magli E, Olmo G. Transform coding techniques for lossy hyperspectral data compression. IEEE Trans. Geosci. Remote Sens. 2007;45(5): 1408–1421.
  18. 18. Rasti B, Sveinsson JR, Ulfarsson MO, Benediktsson JA. Hyperspectral image denoising using 3D wavelets. In: Proceedings of IEEE Intl Geosci. Remote Sens. Symposium (IGARSS); 2012. p1349-1352.
  19. 19. Tang Y, Lu Y, Yuan H. Hyperspectral image classification based on three-dimensional scattering wavelet transform”, IEEE Trans. Geosci. Remote Sens. 2015;53(5): 2467–2480.
  20. 20. Chen L, Bei C, Zhai Y, Wu J. Efficient wavelet-based compression for hyperspectral images using precompression algorithm. In: Proceedings of the International Conference on Multimedia and Signal Processing (CMSP); 2011;1. p193-197.
  21. 21. Qiao T, Ren J, Sun M, Zheng J, Marshall S. Effective compression of hyperspectral imagery using an improved 3D DCT approach for land-cover analysis in remote-sensing applications., Int. J. Remote Sens. 2014;35(20): 7316–7337.
  22. 22. Karami A, Beheshti S, Yazdi M. Hyperspectral image compression using 3D discrete cosine transform and support vector machine learning. In: Proceedings of the 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA); 2012. p809-812.
  23. 23. Nian Y, He M, Wan J. Lossless and near-lossless compression of hyperspectral images based on distributed source coding. J. Vis. Commun. Image R. 2014;28: 113–119.
  24. 24. Pan X, Liu R, Lv X. Low-complexity compression method for hyperspectral images based on distributed source coding. IEEE Geosci. Remote Sens. Lett. 2012;9(2): 224–227.
  25. 25. Abrardo A, Barni M, Magli E, Nencini F. Error-resilient and low-complexity onboard lossless compression of hyperspectral images by means of distributed source coding. IEEE Trans. Geosci. Remote Sens. 2010;48(4): 1892–1904.
  26. 26. Magli E, Olmo G, Quacchio E. Optimized onboard lossless and near-lossless compression of hyperspectral data using CALIC. IEEE Geosci. Remote Sens. Lett. 2004;1(1): 21–25.
  27. 27. Xiaolin Wu W, Memon N. Context-based lossless interband compression-extending CALIC. IEEE Trans. Image Process. 2000;9(6): 994–1001. pmid:18255470
  28. 28. Kong X, Shu N, Huang W, Fu J. The research on effectiveness of spectral similarity measures for hyperspectral image. In: Proceedings of the 3rd International Congress in Image and Signal Processing (CISP) Vol. 5; 2010. p2269-2273.
  29. 29. Cheng KJ, Dill JC. An improved EZW hyperspectral image compression. Journal of Computer and Communications. 2014;2: 31–36.
  30. 30. Rehman M, Muhammad S, Mudassar R. Image compression: a survey. Research Journal of Applied Sciences, Engineering and Technology. 2014;7(4): 656–672.
  31. 31. George R, Manimekalai MAP. A novel approach for image compression using zero tree coding. In: Proceedings of the International Conference on Electronics and Communication Systems (ICECS); 2014. p1-5.
  32. 32. Suruliandi A, Raja SP. Empirical evaluation of EZW and other encoding techniques in the wavelet-based image compression domain. Int. J. Wavelets Multi. 2015;13(2).
  33. 33. Pearlman WA, Islam A, Nagaraj Nithin, Said A. Efficient, Low-Complexity Image Coding with a Set-Partitioning Embedded Block Coder. IEEE T CIRC SYST VID. 2004; 14(11): 1219–1235.
  34. 34. Raja SP, Suruliandi A. Image compression using WDR & ASWDR techniques with different wavelet codecs. ACEEE Int. J. Inform. Technol. 2011;1: 23–26.
  35. 35. Dalal S, Rani P. Memory efficient WDR (wavelet difference reduction) using inverse of echelon form by equation solving. International Journal of Computer Science and Mobile Computing. 2014; 3(7):512–521.
  36. 36. Sullivan GJ, Boyce J, Chen Y, Ohm JR. Standardized extensions of High Efficiency Video Coding (HEVC),” IEEE J. Selected Topics in Signal Processing. 2013; 7: 1001–1016.
  37. 37. Santos L, Lopez S, Callico GM, Lopez LF, Sarmiento R. Performance evaluation of the H.264/AVC video coding standard for lossy hyperspectral image compression. IEEE J. Sel. Topics Appl. Earth Observ. in Remote Sens. 2012;5(2): 451–461.
  38. 38. Gao F, Ji X, Yan C, Dai Q. Compression of multispectral image using HEVC. In: Dai Q, Shimura T, editors. Optoelectronic Imaging and Multimedia Tech: Proceedings of SPIE 2014; vol. 9273. https://doi.org/10.1117/12.2070610
  39. 39. Ibarria L, Lindstrom P, Rossignac J. Spectral predictors. In: Proceedings of IEEE Computer Science 2007 Data Compression Conference (DCC’07). p163-172.
  40. 40. Paul M, Lin W, Lau CT, Lee BS. Explore and model better I-frames for video coding. IEEE Trans. Circuits Syst. Video Technol. 2011;21(9): 1242–1254.
  41. 41. Haque M, Murshed M, Paul M. Improved Gaussian mixtures for robust object detection by adaptive multi-background generation. In: Proceedings of IEEE Conference on Pattern Recognition; 2008. p.1-4.
  42. 42. Hanhart P, Rerabek M, Korshunov P, Ebrahimi T. Subjective evaluation of HEVC intra coding for still image compression. In: Proceedings of the Seventh International Workshop on Video Processing and Quality Metrics for Consumer Electronics-VPQM; 2013.
  43. 43. Paul M, Murshed M. Video coding focusing on block partitioning and occlusions. IEEE Trans. Image Process. 2010;19(3): 691–701. pmid:19789112
  44. 44. Paul M. Efficient multi-view video coding using 3D motion estimation and virtual frame. Neurocomputing. 2016;175: 544–554.
  45. 45. Paul M Lin W, Lau CT, Lee BS. A Long-Term Reference Frame for Hierarchical B-Picture-Based Video Coding. IEEE Trans. Circuits Syst. Video Technol. 2014;24(10): 1729–1742.
  46. 46. Chen Z, Mu C, Xu F. An improvement of embedded zerotree wavelet coding based on compressed sensing. IEEE International Conference on Software Engineering and Service Science. 2014: 1177–1180.