Categorical Exploratory Data Analysis: From Multiclass Classification and Response Manifold Analytics perspectives of baseball pitching dynamics

06/25/2020
by   Fushing Hsieh, et al.
0

From two coupled Multiclass Classification (MCC) and Response Manifold Analytics (RMA) perspectives, we develop Categorical Exploratory Data Analysis (CEDA) on PITCHf/x database for the information content of Major League Baseball's (MLB) pitching dynamics. MCC and RMA information contents are represented by one collection of multi-scales pattern categories from mixing geometries and one collection of global-to-local geometric localities from response-covariate manifolds, respectively. These collectives shed light on the pitching dynamics and maps out uncertainty of popular machine learning approaches. On MCC setting, an indirect-distance-measure based label embedding tree leads to discover asymmetry of mixing geometries among labels' point-clouds. A selected chain of complementary covariate feature groups collectively brings out multi-order mixing geometric pattern categories. Such categories then reveal the true nature of MCC predictive inferences. On RMA setting, multiple response features couple with multiple major covariate features to demonstrate physical principles bearing manifolds with a lattice of natural localities. With minor features' heterogeneous effects being locally identified, such localities jointly weave their focal characteristics into system understanding and provide a platform for RMA predictive inferences. Our CEDA works for universal data types, adopts non-linear associations and facilitates efficient feature-selections and inferences.

READ FULL TEXT

page 11

page 20

research
11/28/2022

Unraveling heterogeneity of ADNI's time-to-event data using conditional entropy Part-I: Cross-sectional study

Through Alzheimer's Disease Neuroimaging Initiative (ADNI), time-to-even...
research
07/29/2020

Extreme-K categorical samples problem

With histograms as its foundation, we develop Categorical Exploratory Da...
research
09/06/2022

Multiscale major factor selections for complex system data with structural dependency and heterogeneity

Based on structured data derived from large complex systems, we computat...
research
12/01/2022

Shining light on data: Geometric data analysis through quantum dynamics

Experimental sciences have come to depend heavily on our ability to orga...
research
06/17/2019

Identifying and characterizing extrapolation in multivariateresponse data

Extrapolation is defined as making predictions beyond the range of the d...

Please sign up or login with your details

Forgot password? Click here to reset