|Publication type:||Article in scientific journal|
|Type of review:||Peer review (publication)|
|Title:||Regression imputation with Q-mode clustering for rounded zero replacement in high-dimensional compositional data|
|Published in:||Journal of Applied Statistics|
|Publisher / Ed. Institution:||Taylor & Francis|
|Subjects:||Compositional data; Centered logratio coordinates; Rounded zeros; Cluster analysis; Partial least squares regression|
|Subject (DDC):||510: Mathematics|
|Abstract:||The logratio methodology is not applicable when rounded zeros occur in compositional data. There are many methods to deal with rounded zeros. However, some methods are not suitable for analyzing data sets with high dimensionality. Recently, related methods have been developed, but they cannot balance the calculation time and accuracy. For further improvement, we propose a method based on regression imputation with Q-mode clustering. This method forms the groups of parts and builds partial least squares regression with these groups using centered logratio coordinates. We also prove that using centered logratio coordinates or isometric logratio coordinates in the response of partial least squares regression have the equivalent results for the replacement of rounded zeros. Simulation study and real example are conducted to analyze the performance of the proposed method. The results show that the proposed method can reduce the calculation time in higher dimensions and improve the quality of results.|
|Fulltext version:||Published version|
|License (according to publishing contract):||Licence according to publishing contract|
|Departement:||School of Engineering|
|Organisational Unit:||Institute of Data Analysis and Process Design (IDP)|
|Appears in collections:||Publikationen School of Engineering|
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.