Identification

Title

A Statistical Approach to Obtaining a Data Structural Similarity Index Cutoff Threshold

Abstract

Huge climate simulations such as the Community Earth System Model Large Ensemble (CESM-LE) output massive amounts of data, and switching from lossless to lossy compression of this data is inevitable. Applying lossy compression to climate data requires a guarantee that scientific conclusions will not be affected by the compression process. One way to check if a scientist's conclusions will be altered is by assessing if the data has been visually altered in any way. To that extent, the Data Structural Similarity Index Measure (DSSIM) is designed to test the visual similarity between two images. It is based on the well-known Structural Similarity Index Measure (SSIM), but has been modified to fit our application. The DSSIM is computed on two datasets instead of two images. This makes the measure invariant to the plotting parameters. Additionally, the DSSIM is much more efficient to compute on a floating-point dataset than the SSIM. A user study in a previous work determined an appropriate SSIM threshold, above which images are highly likely to be visually indistinguishable. In this work, we use the results of the previous study and statistical techniques to translate the SSIM threshold to an appropriate cutoff threshold for the DSSIM. We find the appropriate threshold by minimizing the pass/fail classification difference between images classified using the DSSIM and the previously acquired SSIM threshold. This threshold results in agreement between the image classification results using either metric in more than 92% of cases.

Resource type

document

Resource locator

Unique resource identifier

code

http://n2t.net/ark:/85065/d7gh9nd9

codeSpace

Dataset language

eng

Spatial reference system

code identifying the spatial reference system

Classification of spatial data and services

Topic category

geoscientificInformation

Keywords

Keyword set

keyword value

Text

originating controlled vocabulary

title

Resource Type

reference date

date type

publication

effective date

2016-01-01T00:00:00Z

Geographic location

West bounding longitude

East bounding longitude

North bounding latitude

South bounding latitude

Temporal reference

Temporal extent

Begin position

End position

Dataset reference date

date type

publication

effective date

2021-09-28T00:00:00Z

Frequency of update

Quality and validity

Lineage

Conformity

Data format

name of format

version of format

Constraints related to access and use

Constraint set

Use constraints

Copyright Author(s). This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Limitations on public access

None

Responsible organisations

Responsible party

contact position

OpenSky Support

organisation name

UCAR/NCAR - Library

full postal address

PO Box 3000

Boulder

80307-3000

email address

opensky@ucar.edu

web address

http://opensky.ucar.edu/

name: homepage

responsible party role

pointOfContact

Metadata on metadata

Metadata point of contact

contact position

OpenSky Support

organisation name

UCAR/NCAR - Library

full postal address

PO Box 3000

Boulder

80307-3000

email address

opensky@ucar.edu

web address

http://opensky.ucar.edu/

name: homepage

responsible party role

pointOfContact

Metadata date

2023-08-18T18:03:15.971048

Metadata language

eng; USA