Multivariate Exploratory Data Analysis Toolbox Template
This workflow integrates the MEDA Toolbox for Matlab and Octave, focusing on data simulation, Principal Component Analysis (PCA), and result visualization. Key steps include simulating multivariate data, applying PCA for data modeling, and creating interactive visualizations. The MEDA Toolbox combines traditional and advanced methods, such as ANOVA Simultaneous Component Analysis (ASCA). The aim is to integrate the MEDA Toolbox into LifeWatch, providing tools for enhanced data analysis and visualization in research.
Background
This workflow is a template for the integration of the Multivariate Exploratory Data Analysis Toolbox (MEDA Toolbox, https://github.com/codaslab/MEDA-Toolbox ) in LifeWatch. The MEDA Toolbox for Matlab and Octave is a set of multivariate analysis tools for the exploration of data sets. There are several alternative tools in the market for that purpose, both commercial and free. The PLS_Toolbox from Eigenvector Inc. is a very nice example. The MEDA Toolbox is not intended to replace or compete with any of these toolkits. Rather, the MEDA Toolbox is a complementary tool that includes several contributions of the Computational Data Science Laboratory (CoDaS Lab) to the field of data analysis. Thus, traditional exploratory plots based on Principal Component Analysis (PCA) or Partial Least Squares (PLS), such as score, loading, and residual plots, are combined with new methods: MEDA, oMEDA, SVI plots, ADICOV, EKF & CKF cross-validation, CSP, GPCA, etc. A main tool in the MEDA Toolbox which has received a lot of attention lately is ANOVA Simultaneous Component Analysis (ASCA). The ASCA code in the MEDA Toolbox is one of the most advanced internationally.
Introduction
The workflow integrates three examples of functionality within the MEDA Toolbox. First, there is a data simulation step, in which a matrix of random data is simulated with a user-defined correlation level. The output is sent to a modeling step, in which Principal Component Analysis (PCA) is computed. The PCA model is then sent to a visualization module.
Aims
The main goal of this template is the integration of the MEDA Toolbox in LifeWatch, including data simulation, data modeling, and data visualization routines.
Scientific Questions
This workflow only exemplifies the integration of the MEDA Toolbox. No specific questions are addressed.
Default
Identification
- Date ( Publication )
- 2023-12-31T00:00:00
- Status
- Under development / Pre operational
- Version
- 1.0
- Keywords
- Multivariate Exploratory Data Analysis
- Keywords
- MEDA Toolbox
- Keywords
- Matlab
- Keywords
- Octave
- Keywords
- Data Simulation
- Keywords
- Principal Component Analysis (PCA)
- Keywords
- Data Visualization
- Keywords
- ANOVA Simultaneous Component Analysis (ASCA)
- Keywords
- Data Science
- Keywords
- Interactive Visualizations
- Keywords
- Computational Data Science Laboratory (CoDaS Lab)
- Access constraints
- Copyright
- Other constraints
- To be confirmed
- Protocol
- DOI
- Service Name
- simuleMV
- Service Description
- Simulation of random multivariate data with correlation.
- Service Name
- Output Model
- Service Description
- A data structure (struct named ‘model’) with fields related to a PCA model: ‘var’ contains the total variance in the input data, ‘lvs’ is a list of principal components considered, ‘loads’ contain the PCA loadings, ‘scores’ contains the PCA scores, ‘type’ contains the string ‘PCA’
- Service Name
- pca_pp
- Service Description
- Principal Component Analysis based on the Singular Value Decomposition.
- Service Name
- Scores
- Service Description
- Scatter plot of scores
- Workflow Helpdesk
- https://helpdesk.lifewatch.eu