Improving Statistical Characterization of Data Tensors with the Generalized Canonical Polyadic Tensor Decomposition

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This work explores the potential of the generalized canonical polyadic (GCP) tensor decomposition to be used as a diagnostic tool to determine the underlying statistical nature of a dataset. The GCP reformulates the standard canonical polyadic (CP) decomposition problem as a maximum likelihood estimate and models the natural parameter of the statistical distribution assumed to be associated with a data tensor as opposed to modeling the data itself through the use of a variety of statistically motivated loss functions. This property is of particular interest when a dataset is strongly non-Gaussian, such as is the case with binary or count data. In the work presented, we compare competing CP models of datasets with differing statistical natures to determine if the GCP can be used as an exploratory tool for the statistical characterization of a data tensor of interest. The quality of competing models is assessed via multiple metrics that include fit score, cosine similarity of the tensors, and the Core Consistency Diagnostic (CORCONDIA) score. Results are presented for a variety of artificially generated data tensors.

Original languageEnglish
Title of host publication2025 IEEE High Performance Extreme Computing Conference, HPEC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331578442
DOIs
StatePublished - 2025
Event2025 IEEE High Performance Extreme Computing Conference, HPEC 2025 - Virtual, Online
Duration: 15 Sep 202519 Sep 2025

Publication series

Name2025 IEEE High Performance Extreme Computing Conference, HPEC 2025

Conference

Conference2025 IEEE High Performance Extreme Computing Conference, HPEC 2025
CityVirtual, Online
Period15/09/2519/09/25

Keywords

  • dataset characterization
  • multi-linear algebra
  • tensor decomposition
  • tensors
  • unsupervised learning

Fingerprint

Dive into the research topics of 'Improving Statistical Characterization of Data Tensors with the Generalized Canonical Polyadic Tensor Decomposition'. Together they form a unique fingerprint.

Cite this