Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

A Human-in-the-Loop Segmented Mixed-Effects Modeling Method for Analyzing Wearables Data

A Human-in-the-Loop Segmented Mixed-Effects Modeling Method for Analyzing Wearables Data A Human-in-the-loop Segmented Mixed-effects Modeling Method For Analyzing Wearables Data Segmented Modeling For Wearable Data Analytics KARTHIK SRINIVASAN0F* School of Business, University of Kansas, Lawrence KS, U.S., karthiks@ku.edu FAIZ CURRIM Eller College of Management, University of Arizona, Tucson AZ, U.S., fcurrim@eller.arizona.edu SUDHA RAM Eller College of Management, University of Arizona, Tucson AZ, U.S., ram@eller.arizona.edu Wearables are an important source of big data as they provide real-time high-resolution data logs of health indicators of individuals. Higher-order associations between pairs of variables is common in wearables data. Representing higher-order association curves as piece-wise linear segments in a regression model makes them more interpretable. However, existing methods for identifying the change points for segmented modeling either overfit or have low external validity for wearables data containing repeated measures. Therefore, we propose a human-in-the-loop method for segmented modeling of higher-order pairwise associations between variables in wearables data. Our method uses the smooth function estimated by a generalized additive mixed model to allow the analyst to annotate change point estimates for a segmented mixed-effects model, and thereafter employs the Brent’s constrained optimization procedure to fine-tuning the manually provided estimates. We validate our method using three real-world wearables datasets. Our method not only outperforms state-of-the-art modeling methods in terms of prediction performance but also provides more interpretable results. Our study contributes to health data science in terms of developing a new method for interpretable modeling of wearables data. Our analysis uncovers interesting insights on higher order associations for health researchers. CCS CONCEPTS • Information Systems • Information systems applications • Decision support systems • Data analytics Additional Keywords and Phrases: smart health, wearables, segmented mixed-effects regression, human-in-the-loop method, explainability, interpretable modeling 1 INTRODUCTION Wearables are smart electronic devices that bring the ability to continuously record, monitor, and analyze key health indicators of individuals as they go about their lives. The global market share for wearables is forecasted to be $33 billion by 2025, with an average growth rate of 15% each year [1]. Wearable devices have been instrumental in transforming personalized medicine and individual health monitoring practices. The number of wearables-based research has been steadily increasing each year; a trend that is expected to continue in the future [2]. While wearables are primarily wrist-worn sensors that continuously track elementary wellness indicators such as sleep, heart rate, activity, newer types of wearables including implantables, chest-worn sensors, head-mounted displays, smart jewelry, smart clothing could be capable of capturing a lot more health indicators such as physiological stress, respiration rate, blood alcohol, blood sugar, and other vitals in a non-invasive way. While many studies employ wearables data for predictive modeling such as fall detection or predicting 30-day readmissions [3]–[5], there is growing interest in conducting natural and quasi-natural experiments to study human health and its relationship with externalities [2], [6], [7]. Health indicators Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s). © 2022 Copyright held by the owner/author(s). 2158-656X/2022/1-ART1 $15.00 http://dx.doi.org/10.1145/3564276 ACM Trans. Manage. Inf. Syst. from wearables can be combined with other data sources such as electronic health records and social media to better understand dynamics between human health and society [8]–[10]. Wearables are capable of continuously monitoring and recording health indicators at fine granularity for multiple participants simultaneously. However, the resulting repeated measures data poses data quality and analytical challenges such as missing values, high dimensionality, lagged effects, and clustered errors [11], [12]. Existing glass-box models such as mixed-effects regression are often insufficient for accurately representing higher order associations between variables in wearables data [13]. For example, including a second-order term in the mixed-effects regression model can show that a second-order effect of an input is statistically significant, but it is difficult to explain the functional nature of the second-order association from the quadratic expression alone. That is, it would be interesting to learn the extrema (i.e., maxima/minima) of the second-order effect and how the outcome varies for unit change in the input around such a point. Furthermore, establishing such a functional relationship using visualizations alone can be challenging for wearables data with repeated measures per user and heterogeneity across users. Therefore, there is a need for novel interpretable methods for explaining higher-order associative patterns in wearables data. Segmented regression, also known as broken-stick or piece-wise regression, is an explanatory modeling approach where input variable(s) of interest are partitioned or segmented into intervals followed by fitting straight lines for each interval in the regression model. It is commonly used in design science research applications with a hypothesized curvilinear relationship in simple regression models [14], [15]. The most critical challenge in fitting a segmented model is the determination of input value(s) at which segments need to be separated, a problem also known as breakpoint or change point determination. Studies that use segmented regression with mixed effects [16] determine change points using either ad-hoc or black-box procedures [13], [17]. These methods either tend to have low external validity or overfit and are therefore unsuitable for making inferences. To overcome these shortcomings, we propose a new method for determining change points for input segments in a mixed effects regression. Our method is based on the human-in-the-loop (HITL) paradigm as it uses human inputs during the model training process. Our method repurposes the smooth functions generated by a Generalized Additive Mixed Model (GAMM) to allow the analyst to set initial estimates of the change points through visual inspection. Following which, a fast robust root-finding algorithm called Brent’s method is used to precisely locate each change point iteratively by maximizing the mixed-effects model’s the Akaike Information Criteria (AIC) [18]. In this way, our method takes advantage of human inputs to fit segmented models that can accurately represent higher order associations between variables in wearables data. We evaluate our method on three real world datasets. Our method consistently outperforms existing approaches for all the three datasets. The models developed using our method enable clear interpretations of four higher order associations across the datasets. Inferences from our analysis have multiple managerial implications. 2 BACKGROUND AND RELATED WORK Sensor-based content is among the key characteristics of third generation Business Intelligence and Analytics (BI&A) applications [19]. Wearables offer the unique opportunity to observe and physiological changes in individuals through measurement of activity, heart rate, body temperature and health ACM Trans. Manage. Inf. Syst. indicators. Different types of wearables are available for commercial use such as implantables, head- mounted displays, smart jewelry, smartwatches, fitness trackers and smart clothing. Out of these, the chest-worn and wrist-worn fitness trackers are most widely used in research applications [20]–[22]. During the early adoption phase of wearable technology, research was primarily directed towards sensor development and architecture [23], [24]. With more and more commercial products being introduced into the market today, research focus is shifting on wearable data analytics and associated design science research applications [25]–[27]. 2.1 Wearable data analytics Wearable data analytics (WDA) is the discovery, interpretation, and communication of meaningful patterns from large volumes of data generated by wearable devices [12]. WDA applications can be broadly classified into interpretable modeling and predictive modeling applications. Predictive modeling is the process of learning from existing data to effectively predict future unknown outcomes [28]. In the clinical setting, health vital sign parameters such as electrocardiogram (ECG), oxygen saturation (SpO2), heart rate (HR), respiratory rate and blood pressure are used to provide pre-emptive care for patients with cardiovascular diseases, renal diseases, neurological disorders and cerebrovascular disorders [20]. In the non-clinical setting, wearables-enabled predictive modeling has been employed in problem domains including ambient assisted living [5], human activity recognition [29] reality mining [30] and sports medicine [31]. Data mining, machine learning and deep learning are the most common approaches used in predictive modeling applications of WDA [21], [25]. Interpretable or explanatory modeling is the process of developing a mathematical representation of patterns in the data for explaining a hypothesized phenomenon [32]. Typically, controlled experiment- based and observational studies employing wearables for data collection use interpretable models to explain their phenomena of interest. Wearables-enabled interpretable modeling has been employed in problem domains including lifestyle modeling [27], [33], [34], environment-wellbeing modeling [13], [35], [36] and psycho-physiological stress modeling [37], [38]. Mixed effects regression is the most common glass-box model used for interpretable modeling in WDA [27], [35], [36]. In mixed-effects regression, fixed effects or the global coefficients represent the overall effects of inputs on outcomes, and the random effects or varying effects represent how these effects differ across individuals [39]. While few studies employing wearables data report linear pairwise associations [34], [37], many studies observe higher-order associations between variables. For example, [37] identify a linear association between subjective stress and physical stress measured using heart rate monitors among overweight office workers, and [34] find a linear association between number of steps measured using an activity watch and resting heart rate measured using a heart rate monitor for healthy adults. On the other hand, [40] find associations between two different measures of physical stress pNN50 and SDNN to be significant only in a limited range for surgeons in a hospital. [41] address the challenge of modeling the nonlinear association between work-related rumination and heart rate variability by dividing rumination score into low and high categories, while [13] use domain knowledge to set a change-point for segmented modeling. [42] measured heat strain using a heat exposure monitor and reported an inverted U-shaped association curve between heat strain and outdoor temperature for workers. Wearables-based studies such as the ones above ACM Trans. Manage. Inf. Syst. either hypothesize or empirically observe higher order associations. Compared to naïve approaches such as discretization of variables [41] which may lead to loss of information or just reporting first order and second order effects in regression models [42], characterizing the higher-order association using segmented modeling leads to better interpretable results. 2.2 Segmented modeling Higher order relationships between inputs and outcomes is common in information systems (IS) [14], [15]. Polynomial regression models account for higher order relationships, but they are not directly interpretable [43]. That is, first order effects and second order effects cannot be used to quantify the unit change in outcome due to a unit increase in input as in the case of a regression model with only first order effects. Segmented or piece-wise regression is a preferred approach for modeling higher order relationships, as it is easier to interpret. The primary challenge in using a segmented regression approach is the determination of change points linking the input segments. Change point determination has been studied in different problem contexts including detecting structural change in continuous values of parameters [44], interruption of time-series [45], and characterizing higher order functional relationships [46]–[48]. Common procedures to determine change points include visual inspection of pairwise plots [48], incorporating domain inputs, greedy search [46], likelihood-based estimation [47]. Fewer procedures exist for change point determination in mixed-effects models as the likelihood function of multilevel models are not directly differentiable, thus making greedy search and likelihood-based estimation more difficult [17] [46]. A maximum-likelihood estimation of a continuous functional approximation of the piece-wise linear function has been proposed as a more robust alternative to subjective/ad-hoc assignment of change points based on visualization of pairwise association plots [17]. However, this method estimates multiple change points automatically with no scope for user inputs into the estimation process. For example, manual intervention such as dropping change points at the extremities of an input distribution could avoid overfitting as well as improve interpretability. Moreover, if an automated method may fail to execute if there are too many local extrema or due to high sensitivity towards outliers. To summarize, existing procedures for determining change points in studies employing segmented models are either ad-hoc or analytically complex, leading to problems such as low external validity, overfitting, or failure in program execution. There is a need for a segmented mixed-effects modeling method that is robust, efficient, and transparent. Such a method can be helpful to better explain higher-order pairwise associations in wearables data. 2.3 Human-in-the-loop analytics methods Human-in-the-loop (HITL) analytics methods are geared towards enhancing algorithm performance by incorporating human knowledge and inputs into the modeling and program execution process. HITL is an extensive area of research that covers the intersection of computer science, cognitive science, and psychology [49], [50]. HITL can be performed at different stages of an analytics system, from data preprocessing and modeling to system implementation. Human-machine hybrid models have demonstrated superior performance in natural language and computer vision applications [50] and are being actively considered in general analytics applications [49]. For example, studies [51] have shown how initial knowledge inputs from domain experts improve downstream performance of automated machine ACM Trans. Manage. Inf. Syst. learning systems. Another effort [52] presents an iterative experimentation framework in which users repeated make changes to the ML workflow in order to improve performance. Similarly, HITL also finds applications in model design, training, testing, model optimization stages and is applicable to health analytics research involving subjective expertise and higher need for transparency [53], [54]. For instance, one study [55] incorporates physician inputs towards model parameterization for patient-specific IV fluid recommendation in sepsis treatment. A HITL approach is suitable for analyzing wearable data as human expert inputs and observation can spot technical and logical errors in the analysis at an early stage and avoid rework or erroneous conclusions. While our study does not attempt to specifically contribute to HITL methodology literature, our method is one of the first few attempts to use a HITL approach for accurately determining change points in mixed-effects modeling in the context of wearables data analytics. 3 OUR METHOD We propose a new method for capturing higher-order associative patterns in wearables data using segmented mixed-effects modeling. Our method uses a HITL approach to determine the change points in the segmented model by combining algorithmic search process with human inputs for fine-tuning. That is, our method uses the smooth function estimated by a generalized additive mixed model to allow the analyst to annotate the change point estimates followed by fine-tuning of the estimates using a constrained optimization procedure. Our method is novel since few studies have used an HITL approach to tune change point parameters for segmented modeling. It also provides a robust mechanism to capture and explain higher order relationships hypothesized in wearables data. In the rest of this section, we explain our method in detail. Consider a mixed effects model commonly used for explaining repeated measures in data from wearables as follows: ∑ ∑ (1) Equation (1) is a representation of generalized linear mixed models (GLMM) with a linear link function but any link function is applicable to our method. is value of a given health outcome for the { } observation and individual, is the fixed intercept, are coefficients for fixed effects { }, { } are J random intercepts for each individual, { } are coefficients for rando m effects { }, and is the residual error. Suppose there exists an input such that its second (or higher) order effects are significant, then Equation (1) can be represented as follows: ∑ ∑ (2) In Equation (2), denotes the value of input variable for the observation and individual. ACM Trans. Manage. Inf. Syst. As described in earlier section, segmented representations of higher order effects are more interpretable than polynomials. The input variable can be represented as the sum of segments as follows: (3) In Equation (3), { } is a set of change points defined for the input variable which is broken into P segments { }. is an indicator function equal to 1 if condition is true; otherwise it is 0. Therefore, the scalar product has value equal to when is true and is 0 otherwise. The next logical step is to estimate number of change points and their positions. We propose a HITL method to estimate the change points as follows. As the first step, we fit a Generalized Additive Mixed Model (GAMM) [56] with a given input as a non-parametric spline as shown below: ∑ ∑ (4) Though, a non-parametric spline can be included for the corresponding random effect of input , it is computationally more expensive for fitting the corresponding semi-parametric model. We empirically tested on multiple datasets and observed the shape of the component smooth function to not be sensitive to random effects as smooth functions. Therefore, we consider the smooth function only in the fixed effects. In the next step, we visualize the plot of the smooth function in Equation (4) approximated as a B-spline [57]. Here, a human input is required to identify the order of the curve by inspecting the number of extrema (i.e., minima and maxima) to set the value of . The setting of can be based upon visual inspection as well as prior domain knowledge. For example, in Figure 1, the values for the different scenarios (a)-(d) are chosen as , , , and respectively. This step also determines whether to opt for a segmented model over a linear model, by inspecting the curvilinear nature of the component smooth function. For instance, though we set =1 for scenario illustrated in Figure 1(c), an analyst may also approximate the monotonically increasing curve as a linear function in this case; thus favoring simplicity over slightly better model fit. The value of the maxima and minima are used as starting points in a linear search algorithm in the third step. The third step involves iteratively performing search for change points using Brent’s method [58], a linear optimization with box constraints. The Brent’s method is a hybrid root-finding algorithm combining the bisection method, secant method and the inverse quadratic interpolation that make it robust and highly efficient while incorporating box constraints (i.e., range of permitted values) [59]. For each iteration of the optimizer, a mixed model shown with segmented inputs for is fit for a particular change point parameter. The algorithm returns the change point parameter corresponding to the model with minimum Akaike information criteria (AIC). Since the Brent’s method is a single parameter estimator, we identify the change points sequentially by repeating the search algorithm for each subsequent change point after fixing values of previously determined change points. Finally, the segmented mixed effects model is fit as shown below: ACM Trans. Manage. Inf. Syst. ∑ ( ) ∑ ∑ (5) In Equation (5), i s a set of segments constructed using change points identified for input . Significance of the effect of input variable, , at each segment, , can be determined by inspecting the corresponding fixed effects coefficient, , under regular conditions. The algorithm for our method is shown in Table 1. Table 1: Human-in-the-loop method for segmented mixed-effects modeling Input: Mixed effects model with significant higher-order coefficients for input variable, 1: Fit a Generalized Additive Mixed Model (GAMM) with input represented as a non-parametric spline (Equation (1)). 2: Inspect the component smooth function plot to identify number of change points , starting points and box constraints { } for corresponding change points . 3: For , compute as follows: ∑ ∑ For , fix values of { } in model to compute as follows: 4: ( ) ( ) ( ) ∑ ∑ Output: Segmented model with change points as shown in Equation (5). 4 EXPERIMENTAL SETUP To demonstrate the utility and effectiveness of our method, we apply it to model high-order pairwise relationships in four different applications across three real-world wearable datasets – WellbuiltforWellbeing, HospitalMonitoring, and BeerCrawl. We describe each of the datasets in the next sub-section followed by analysis and findings. ACM Trans. Manage. Inf. Syst. 4.1 Data 4.1.1 WellbuiltforWellbeing The Wellbuilt-for-Wellbeing (WB2) project [60] was a sixteen-month multi-phase field study funded by the US General Services Administration to better understand the influence of the office environment on human health, comfort and performance. In the study, self-described healthy adult workers involved in a variety of office-based roles for the U.S. government were recruited across four federal office buildings across the country. Participants wore two sensors for three days while carrying out their day-to-day activities, a heart and physical activity monitor, and a personal environment quality sensor-based device. The study also included experience sampling mobile surveys to collect individuals’ perceived psychological responses at periodic intervals of one to two hours. Post-processing, the dataset contained around 3000 hours of wearables data with wide range of variables from 231 participants. More details about the field study setup and variables can be found in [6]. For our study, we analyze two pairwise associations from this dataset, that of ambient sound level and heart rate variability, and that of instantaneous activity and heart rate variability. Heart rate variability (HRV) is the variability between heart beats and is considered as a proxy measure for the physiological wellbeing of a person, i.e., the higher the variability, the higher the physical wellbeing [61]. Among different HRV measures, the mean of standard deviation for all successive R-R intervals (SDNN) measured in milliseconds reflects the overall activity in the autonomous nervous system and is widely used as an indicator of better health and wellbeing [62]. Physical activity levels were assessed in g (i.e., 1 unit of gravitational force) from a triaxial accelerometer sensor and sound levels were measured in dBA (i.e., decibel weighted according to human ear hearing) using separate neck-worn sensors. The raw data from multiple wearables were aggregated at 5-minute intervals to be integrated with the heart rate monitor sensor used for computing SDNN [63]. We consider appropriate covariates in our models including person-level fixed effects (i.e., Age, Gender, BMI, worktype), time of the day, day of the week after closely examining all the variables collected in the WB2 project. 4.1.2 HospitalMonitoring The HospitalMonitoring dataset contains vital signs data recorded from patients undergoing anesthesia at the Royal Adelaide Hospital [64]. It is publicly accessible from the University of Queensland website . Data was collected for 32 cases using multiple wearables and stationary sensors including the electrocardiograph (ECG), pulse oximeter, capnograph, noninvasive arterial blood pressure monitor, airway flow, pressure monitor, Y-piece spirometer, electroencephalogram (EMG) monitor, and arterial blood pressure monitor [64]. The processed data is aggregated at 1-second intervals and has 51 variables out of which we select a meaningful subset for our analysis. For our study, we analyze the pairwise association between ST-segment index and airway respiratory rate. In electrocardiography, the ST segment connects the QRS complex and the T wave and its depression or elevation is related to acute cardiovascular conditions including myocardial ischemia, infarction, and arrhythmia [65]. The human respiratory rate is measured by counting the number of breaths per minute with typical values ranging from 12 to 16 for a healthy adult. Respiration rate has been related to HospitalMonitoring dataset URL: https://outbox.eait.uq.edu.au/uqdliu3/uqvitalsignsdataset/index.html ACM Trans. Manage. Inf. Syst. abnormalities in oxygen saturation, aging, cardiovascular diseases, and has been widely adopted as part of early warning systems [66]. In our model, we consider heart rate, oxygen saturation (i.e., SPO2), and perfusion index as covariates after examining collinearity and cross-correlations among all features and potential confounding effects. 4.1.3 BeerCrawl The BeerCrawl dataset contains blood alcohol content and movement information recorded in a field study by [4]. It is publicly accessible from the University of California Irvine (UCI) dataset repository . Transdermal alcohol content (TAC) is measured using an ankle bracelet wearable while the movement data was captured using raw accelerometer readings from mobile phones for 20 students participating in an annual college bar crawl event. While the TAC data was sampled every 30 minutes, accelerator readings were available at a more granular level leading to over 30M samples across participants. TAC has been shown to be a more reliable indicator of sustained alcohol use as compared to self-reporting [67]. For our study, we analyze the pairwise association between the raw z-axis readings from the tri-axial accelerometer and TAC values in the dataset. Prior studies have proposed several features using accelerometer readings that are related to a person’s gait, activities, and wellbeing with z-axis values contributing significantly to their variability [4], [68]. The units of TAC and accelerometer are g/dl and m/s respectively. In our model, we consider the x and y axis co-ordinate values as covariates. The accelerometer is mapped to the TAC monitor readings at the minute level. Table 2: Summary of real-world wearables datasets used in our study Size (Row, Input statistics (Mean, Dataset Accessibility Outcome Input Columns) SD, Min, Max) WellbuiltforWellbeing (31557, 7) Not public SDNN Sound (Sound level in (50.24, 8.69, [60] (Heart rate dBA) 0.00, 87.80) variability) Activity (Activity (0.1738, 0.3187, level in g) 0.0000, 3.0000) HospitalMonitoring (83861, 4) Public RR ST2 (ST segment (0.0635, 0.3969, [64] (Respiration index) -1.1000, 1.3000) rate) BeerCrawl [4] (8273, 3) Public TAC (Blood z (z-axis of (0.0503, 0.1840, alcohol accelerometer) -0.6705, 0.6676) level) Table 2 shows a summary of the datasets used in our study. The datasets and respective input-outcome associations were chosen to demonstrate variety of scenarios with higher-order pair-wise associations across different problem domains. 4.2 Data pre-processing and model assumptions Training and test samples were partitioned in a 75:25 ratio for performance evaluation. A variance component structure for the covariance matrix of the random effects coefficients is assumed in the mixed- effects regression models. The input variable of interest was included as a fixed effect as well as random effect in the model. We compared models having uncorrelated residual errors with counterparts with BeerCrawl dataset URL: https://archive.ics.uci.edu/ml/datasets/Bar+Crawl%3A+Detecting+Heavy+Drinking ACM Trans. Manage. Inf. Syst. autoregressive error residuals and we observed that the model fit did not improve significantly after controlling for temporal correlations. We also compared different error distributions and found the normal distribution to be most suitable. Hence our model is represented as Equation (1) without any special link function or autocorrelation terms. 4.3 Results We fit mixed-effects models using the three datasets for explaining the following four input-outcome associations – Sound-SDNN, Activity-SDNN, ST2-RR, and z-TAC. The corresponding component smooth functions for the partial effects from GAMM are shown in Figure 1. While the smooth functions for Sound- SDNN and Activity-SDNN approximate second order and third order polynomial curves, smooth functions of ST2-RR, and z-TAC indicate a higher-order curve. Figure 1: Smooth function plots generated using GAMM for (a) Sound-SDNN; (b) Activity-SDNN; (c) ST2-RR; and (d) z-TAC ACM Trans. Manage. Inf. Syst. As part of the HITL step, we made the following model interventions. Figure 1(a) shows a maxima around 50-60 dBA range, while Figure 1(b) shows maximal range between 0.2 and 1 followed by a decrease in the smooth function until it reaches a minima at around 2.5, after which the function again turns upwards. The extrema in Figure 1(c) are more subtle with confidence interval being most conservative around ST2=0.0. Figure 1(d) clearly depicts three maxima and two minima within the main range while a minima exists around z=0.6 which may be ignored due its extreme right position in the input distribution. Based on these observations of the smooth functions, we chose , , , and as number of change points for associations Sound-SDNN, Activity-SDNN, ST2-RR, and z-TAC respectively. Through visual inspection, we set initial change point estimates as well as box constraints for each of the pairwise association. We compared the prediction performance of mixed-effects models fitted using our method with following benchmarks: (i) model with inputs as first-order effects (i.e., linear), (ii) model with inputs as first-order and second-order effects (i.e., curvilinear), (iii) segmented inputs using change points identified by the maximum likelihood (ML) method [17], and (iv) segmented inputs using visually identified change points (i.e., visual). The fixed effects model was used as a baseline, representing the case when only fixed effects are considered in the mixed-effects regression. The model with linear inputs (i.e., first order effects only) is a benchmark that emphasizes simplicity over better model fit through capture of higher order associations. The curvilinear model is more commonly used in prior literature as it improves model fit when compared to linear effects only model, but at the cost of lower interpretability of its higher order coefficients. The segmented models with change points determined visually or using the fully automated maximum likelihood (ML) approach can be considered as the state-of-the-art approaches for segmented mixed-effects modeling for capturing higher order associations. Other methods in machine learning or statistical modeling such as multivariate adaptive regression splines or non-parametric analysis are not considered as their objective of capturing higher order patterns is not primarily towards explanatory modeling, but towards making predictions. We performed repeated 4-fold cross-validation (CV) with 10 iterations (i.e., 40 runs in total) for evaluating the performance of our method against the benchmarks across the three datasets. Table 3 shows the mean and standard deviation (in parenthesis) of performance metrics - R-squared, Root Mean Squared error (RMSE), and Mean Absolute Prediction Error (MAPE) of our method and benchmarks across the three datasets and four higher-order association scenarios. A higher value of R-squared and lower values of Root Mean Squared error (RMSE) and Mean Absolute Prediction Error (MAPE) are preferred. Statistical significance of better performance of our method over each benchmark was tested using the Tukey- Kramer pairwise comparison test on ANOVA fit over the performance metrics of our method and benchmarks. The R-squared values and error estimates for best performing models are highlighted. Our method has a statistically significant improvement in the prediction performance over existing higher- order mixed-effects modeling benchmarks for WellbuiltforWellbeing and HospitalMonitoring datasets. For the BeerCrawl dataset, our method if better than other benchmarks except the visual method, but the performance difference between the visual method and our method is not statistically significant. While the ML approach is popular for GLM models with the R package segmented [69], it is not very compatible for wearables data in terms of setting number of change points and convergence. Though the visual ACM Trans. Manage. Inf. Syst. method performs well for the BeerCrawl dataset, the complete dependence on human inputs and rationalization renders it less robust as evident from its significant lower performance in other two datasets. Table 3: Model Fit and Predictive Performance Comparison of Segmented Multilevel Models WellbuiltforWellbeing - HRV WellbuiltforWellbeing - HRV HospitalMonitoring - RR BeerCrawl - TAC (Sound) (Activity) MAP Model RMSE RMSE MAPE RMSE RMSE MAPE MAP R-sq. R-sq. R-sq. (br/mi R-sq. (ms) (%) E (%) (ms) (%) n) (g/dl) (%) 0.5168 17.78 25.81 0.5557 17.54 26.12 0.733 2.58 5.47 0.6007 0.04 70.38 Fixed (0.01)** (0.03)** (0.05)** (0.01)*** (0.02)*** (0.07)*** 5 (0.00) (0.00) (0.01) (0.00) (0.72) effects * * * (0.00) *** *** *** *** *** *** 0.5237 17.65 25.47 0.6128 16.65 24.44 0.8781 2.36 4.92 0.600 0.0377 66.54 (0.01)** (0.03)** (0.05)** (0.01)*** (0.08)*** (0.10)*** (0.00) (0.00) (0.00) 3 (0.00) (0.73) Linear * * * *** *** *** (0.01) *** *** *** 0.5876 16.83 23.78 0.6630 15.64 23.12 0.8875 2.36 4.92 0.5995 0.0377 66.56 Curvilin (0.00)** (0.04)** (0.06)** (0.01)*** (0.05)*** (0.06)*** (0.00) (0.00) (0.00) (0.01) (0.00) (0.72) ear * *** *** *** *** *** *** 0.5891 16.81 23.74 0.6719 15.70 23.01 0.8856 2.35 4.85 0.6419 0.0369 63.09 Visual (0.00) (0.04) (0.06) (0.01)*** (0.06)*** (0.07)*** (0.00) (0.00) (0.00) (0.01) (0.00)* (0.72) *** *** *** 0.5860 16.82 23.80 0.6734 15.69 23.01 0.8810 2.35 4.84 0.6011 0.0380 66.57 ML (0.00)** (0.04)** (0.06)** (0.01)*** (0.06)*** (0.07)*** (0.00) (0.00) (0.00) (0.01) (0.00) (0.72) * * *** *** *** *** *** *** 0.5900 16.80 23.72 0.7628 14.93 21.99 0.9201 2.33 4.82 0.641 0.037 63.17 Our (0.00) (0.04) (0.06) (0.01) (0.09) (0.10) (0.00) (0.00) (0.00) 2 3 (0.70) method (0.01) (0.00) Statistical significance of Tukey Kramer test for comparison of performance metrics of benchmark with HIL *** = p < .01, ** = p < 0.05, * = p < 0.1 In addition to better model fit and prediction performance, our method is able to generate better interpretable models. The fixed effects of segmented inputs across all three datasets are shown in Table 4. In the WellbuiltforWellbeing dataset, HRV increases by 0.19 ms/dBA for Sound levels less than 51 dBA indicating that physical wellbeing improves with every unit increase in sound in quiet environments. While higher activity levels such as brisk walking (i.e., Activity >= 0.9) decreases SDNN by 15.21 ms/g, moderate walking speed (0.9 > Activity >= 0.21) is related to steep increase of 90.49 ms/g in SDNN. Assuming brisk walking is confounded by intention to reach meetings in time and therefore related to a higher stress, the steep gradient in the moderate walking range underscores the value of intermittent low- level activity on physiological wellbeing in office spaces. In the HospitalMonitoring dataset, we observe that unit increase in ST-segment index beyond -0.18 is related to increase in respiration rate by 2.28 breath/minute. This finding underscores the importance of looking for elevated values of the ST-segment as reported in medical literature [65]. It also hints at a possible association between acute cardiovascular events and elevation in respiration rate in bed-ridden patients. In the BeerCrawl dataset, the z-axis indicates acceleration perpendicular to the phone screen by participants. While experimental research is required to understand the precise interpretations of the coefficients identified in our study, our study does uncover significant associative patterns between z-axis values and blood alcohol levels across different z- ACM Trans. Manage. Inf. Syst. axis range segments. Higher inebriety is associated with increase in acceleration at upper range of z (i.e., 0.41 > z >= -0.01) and associated with decrease in lower range of z (i.e., -0.01 > z >= -0.15). Table 4 also shows that the above interpretable pairwise associative patterns are not evident from linear and curvilinear inputs in the mixed-effects models. Table 4: Fixed effects of segmented, linear and curvilinear models across all three datasets Coefficient (SE) Dataset / Outcomes Input Segmented Linear Curvilinear Sound 0.07 (0.02) *** 0.09 (0.02) *** Sound -0.01 (0.00) *** *** Sound < 51 0.19 (0.04) Sound >= 51 -0.01 (0.03) WellbuiltforWellbeing / Activity 18.19 (1.27) *** 46.09 (1.68) *** SDNN Activity -43.39 (1.64) *** *** Activity < 0.21 9.00 (1.41) 0.9 > Activity >= 0.21 90.49 (3.87) *** Activity >= 0.9 -15.21 (1.46) *** ** ST2 1.63 (0.85) 1.75 (0.85) *** ST2 -0.44 (-4.09) HospitalMonitoring / RR ST2 < -0.18 1.51 (1.92) ST2 >=-0.18 2.28 (0.95) ** z -0.0073 (0.0104) -0.0061 (0.0098) z -0.026 (0.0102) ** z < -0.44 -0.0246 (0.0205) -0.2 > z >= -0.44 -0.0318 (0.0167) BeerCrawl / TAC -0.15 > z >= -0.2 -0.0340 (0.0390) *** -0.01 > z >= -0.15 -0.2014 (0.0734) *** 0.41 > z >= -0.01 0.0857 (0.0279) z >= 0.41 0.0030 (0.0182) *** = p < .01, ** = p < .05 Figure 2 shows a visual representation of the segmented model coefficients compared to coefficients from linear and curvilinear models. Co-incidentally, the shapes of the piecewise relationships for each of the pairwise association resembles the corresponding smooth functions shown in Figure 1 reinstating the importance of our HITL approach to train robust interpretable segmented inputs in the mixed-effects models. 5 DISCUSSION AND CONCLUSION In this study, we presented the problem of developing an interpretable model that captures piecewise pairwise associations between different modalities captured by wearables. Since existing methods for segmented modeling for mixed-effect regression are insufficient to determine robust and verifiable change point, our method is timely with increasing research applications utilizing wearables in a natural experimental setup. Our method involves the inspection of smooth functions of pairwise associations captured using GAMM, followed by using the Brent’s method to sequentially position change points optimizing model fit. Our method not only uses analytical tools to determine change points, but also utilizes user discretion to control the number of change points and its localization. For example, it is often desirable to avoid change points at extremities as data ACM Trans. Manage. Inf. Syst. corresponding to these segments may be very sparse, rendering inference unreliable. We apply our method to three different wearables datasets and show that not only is it effective in terms of improving model fit and prediction performance, but also significantly enhances model interpretability and ability to derive meaningful inferences. Figure 2: Trajectory of linear, curvilinear and segmented fixed effects coefficients for (a) Sound-SDNN; (b) Activity- SDNN; (c) ST2-RR; and (d) z-TAC 5.1 Managerial implications Our method and analysis have several managerial implications. Our study provides a novel tool to analyze wearables data, thus boosting the value for storing and processing of large amounts of big data generated by wearables. Our HITL-based segmented modeling method can be used in a wide range of wearables applications such as patient monitoring systems, military fitness management programs, smart diet applications, COVID-19 contact tracing, etc. Our analysis over the three wearables datasets present interesting pairwise associations. The positive relationship between sound level and physical wellbeing measure below the range of 51 dBA informs workplace design practices on the need for further examination of sound levels effects on employee health for different sound level ranges. A higher gradient of activity-wellbeing relationship in the lower range of activity provides additional empirical evidence on the value of low-intensity/intermittent activities on elevating instantaneous stress and improving wellbeing. The significant association between cardiovascular wellness measure and respiration rate after a certain threshold of the ST segment index solicits clinical researchers to further examine inter- relationships between pulmonary and cardiovascular wellness indices to improve on existing hospital ACM Trans. Manage. Inf. Syst. monitoring and early warning systems. Finally, the association between a dimension of raw accelerometer data and extrinsic phenomena such as alcohol consumption stresses the value of looking raw data in addition to expert-engineered features such as gait variability and number of continuous steps. 5.2 Contribution to IS research Predictive modeling and statistical modeling in analytics go side-by-side as one predicts the future using existing data, focusing on informing us on the question What will be, while the other explicates hidden patterns and tells us about What is with respect to a phenomenon. Both of them are important and require attention to optimize the utility of the generated data. As the number of wearable technology- based applications increases in future, the quantum of available data to analyze will exponentially increase and warrant for more and more advancements in explainable modeling for meaningful interpretations of patterns. In this study, we introduce a new method to address the design challenge of representing non- linear associative patterns in wearables data. Our contribution is timely in IS research, as the discipline is widening its scope in design science as well as explanatory modeling applications by using novel data sources such as wearables [70]. WDA is a promising area in IS [5], [19], opening a wide range of research applications owing to the following two reasons; the ubiquitous nature of wearables in today’s lifestyle, and the promise of wearables to generate rich, personalized, temporal and highly grained information content. We therefore posit that our contributions through a novel interpretable modeling method for addressing challenges in WDA lays the foundation for promising research in IS using data generated from wearables. 5.3 Limitations There are some caveats and limitations to our study. We have focused on the design science problem of developing an interpretable modeling method, but do not delve into the subject of determining the significance of input variables themselves. Also, our method by itself does not imply causation though it can be applied to any explanatory modeling scenario including causal or quasi-causal experimental settings. If curvilinear effects are absent, the segmented modeling approach should be avoided to prevent over-fitting. The modeling approach described in this study is useful when higher order association is pre-determined between pairs of repeated measures and there is a need to better explain these associations for making inferences. For high-dimensional large datasets, GAMM can take longer time to fit, and the change point optimization can be tedious for the analyst. A few ways to avoid this problem are to apply feature selection, variable transformation, and outlier detection procedures before examining pairwise associations using our method. Next, our HITL approach involves human inputs, and therefore, may still be susceptible to human errors and biases, despite the fine- tuning step using the optimization procedure. One way of reducing such potential errors is to consult domain experts post determination of change point from the optimization procedure. Finally, it is worth noting that our method caters to the problem of improving interpretability of glass-box models, at the cost of increased bias and limited predictive power when compared to black-box data mining models [28]. 5.4 Conclusion With the increasing availability of wearables, we can measure and understand different health phenomena at a highly granular level. We propose a human-in-the-loop method for accurate estimation of change points in ACM Trans. Manage. Inf. Syst. segmented mixed-effects regression facilitating the interpretations of pairwise associations of variables in wearables data. Our method is robust, efficient and the resultant segmented models provide better prediction accuracy than state-of-the-art alternatives for given problem. Our proposed method is empirically validated, more reliable due to human verification, and provides better interpretable results. Our approach can be generalized to other areas of IS where non-linear pairwise associations are anticipated. REFERENCES [1] GlobalNewsWire, Smart Wearables Market and Wearable Apps Mar ket Global, 2021. https://www.globenewswire.com/news - release/2021/12/15/2352316/0/en/Smart-Wearables-Market-and-Wearable-Apps-Market-Global-Share-Insights-2021-Top-Countries-Data-Future- Growth-Developments-Impact-of-Covid-19-on-Industry-Size-Production-Cost-Value-V.html (accessed Jan. 11, 2022). [2] I. Bardhan, H. Chen, and E. Karahanna, Connecti ng systems, data, and people: A multidisciplinary research roadmap for chronic disease management, MIS Quarterly, vol. 44, no. March, pp. 185–200, 2020. [3] HeLuo, LiuHongyan, YangYinghui, and WangBei, A Multi -attention Collaborative Deep Learning Approach for Blood Pressure Prediction, ACM Transactions on Management Information Systems (TMIS), vol. 13, no. 2, pp. 1–20, Oct. 2021, doi: 10.1145/3471571. [4] J. A. Killian, K. M. Passino, A. Nandi, D. R. Madden, and J. Clapp, Learning to detect heavy drinking episodes using smartphone ac celerometer data, in CEUR Workshop Proceedings, 2019, vol. 2429. [5] H. Zhu, S. Samtani, R. A. Brown, and H. Chen, A deep lea rning approach for recognizing activity of daily living (adl) for senior care: Exploiting interaction dependency and temporal patterns, MIS Quarterly, vol. 45, no. 2, 2021, doi: 10.25300/MISQ/2021/15574. [6] C. M. Lindberg et al., Effects of office workstation type on physical activity and stress, Occup Environ Med, vol. 75, no. 10, pp. 689–695, 2018, doi: 10.1136/oemed-2018-105077. [7] E. Smets et al., Large -scale wearable data reveal digital phenotypes for daily-life stress detection, npj Digital Medicine 2018 1:1, vol. 1, no. 1, pp. 1–10, Dec. 2018, doi: 10.1038/s41746-018-0074-9. [8] W. Zhang and S. Ram, A comprehensive analysis of triggers a nd risk factors for asthma based on machine learning and large he terogeneous data sources, MIS Quarterly, vol. 44, no. 1, pp. 305–349, 2020, doi: 10.25300/MISQ/2020/15106. [9] C. C. Yang, G. Leroy, and S. Ananiadou, Smart health and wellbeing, ACM Trans Manag Inf Syst, vol. 4, no. 4, 2013, doi: 10.1145/2555810.2555811. [10] GuKang, VosoughiSoroush, and PrioleauTemiloluwa, Sympto mID: A Framework for Rapid Symptom Identification in Pandemics Using News Reports, ACM Transactions on Management Information Systems (TMIS), vol. 12, no. 4, pp. 1–17, Sep. 2021, doi: 10.1145/3462441. [11] I. Azodo, R. Williams, A. Sheikh, and K. Cresswell, Opport unities and Challenges Surrounding the Use of Data From Wearable S ensor Devices in Health Care: Qualitative Interview Study, J Med Internet Res 2020;22(10):e19542 https://www.jmir.org/2020/10/e19542, vol. 22, no. 10, p. e19542, Oct. 2020, doi: 10.2196/19542. [12] M. Uddin and S. Syed‐abdul, Data Analytics and Appli cations of the Wearable Sensors in Healthcare: An Overview, Sensors (Basel), vol. 20, no. 5, Mar. 2020, doi: 10.3390/S20051379. [13] U. Kraus et al., Individual daytime noise exposure during routine activiti es and heart rate variability in adults: a repeated measures stud y, Environ Health Perspect, vol. 121, no. 5, pp. 607–612, 2013. [14] B. Q. Liu and D. L. Goodhue, Two worlds of trust for pot ential e-commerce users: Humans as cognitive misers, Information Systems Research, vol. 23, no. 4, pp. 1246–1262, 2012. [15] G. Pant and P. Srinivasan, Predicting Web Page Status, Information Systems Research, vol. 21, no. 2, pp. 345–364, Jun. 2010, doi: 10.1287/isre.1080.0231. [16] J. D. Singer and J. B. Willett, Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence. New York: Oxford University Press, 2009. doi: 10.1093/acprof:oso/9780195152968.001.0001. [17] V. M. Muggeo, D. C. Atkins, R. J. Gallop, and S. Dimidjian, Segmented mixed models with random ch angepoints: a maximum likelihood approach with application to treatment for depression study, Stat Modelling, vol. 14, pp. 293–313, 2014. [18] H. Bozdogan, Model selection and Akaike’s Information C riterion (AIC): The general theory and its analytical extensions, Psychometrika, vol. 52, no. 3, p. 3450370, 1987. [19] H. Chen, R. H. L. Chiang, and V. C. Storey, Business In telligence and Analytics: From Big Data To Big Impact, MIS Quarterly, vol. 36, no. 4, pp. 1165–1188, 2012. [20] M. Chan, D. Estève, J.-Y. Fourniols, C. Escriba, and E. Campo, Smart wear able systems: Current status and future challenges, Artif Intell Med, vol. 56, no. 3, pp. 137–156, 2012. [21] H. Banaee, M. U. Ahmed, and A. Loutfi, Data mining for wearable sensors in health monitoring systems: A review of recent trends and challenges, Sensors, vol. 13, pp. 17472–17500, 2013. [22] S. Majumder, T. Mondal, and M. Deen, Wearable Sensors for Remote Health Monitoring, Sensors, 2017. [23] I. Yamada and G. Lopez, Wearable sensing systems for heal thcare monitoring, in 2012 Symposium on VLSI Technology (VLSIT), Jun. 2012, pp. 5–10. ACM Trans. Manage. Inf. Syst. [24] K. Malhi, S. C. Mukhopadhyay, J. Schnepper, M. Haefke, and H. Ewald, A zigbee -based wearable physiological parameters monitoring system, IEEE Sens J, 2012. [25] D. Ravi, C. Wong, B. Lo, and G.-Z. Yang, A Deep Learning Approach to on -Node Sensor Data Analytics for Mobile or Wearable Devices, IEEE J Biomed Health Inform, 2017. [26] A. Sano et al., Identifying Objective Ph ysiological Markers and Modifiable Behaviors for Self-Reported Stress and Mental Health Status Using Wearable Sensors and Mobile Phones: Observational Study, J Med Internet Res, vol. 20, no. 6, pp. 210–216, 2018. [27] J. B. Wang et al., Wearable Sensor/Device (Fitbit One) and SMS Text-Messaging Prompts to Increase Physical Activity in Overweight and Obese Adults: A Randomized Controlled Trial, Telemedicine and e-Health, vol. 21, no. 10, pp. 782–792, Oct. 2015. [28] G. Shmueli, To Explain or to Predict?, Statistical science, vol. 25, no. 3, pp. 289–310, 2010. [29] H. Zhu, H. Chen, and R. Brown, A sequence -to-sequence model-based deep learning approach for recognizing activity of daily living for senior care, J Biomed Inform, vol. 84, pp. 148–158, Aug. 2018. [30] A. Pentland, D. Lazer, D. Brewer, and T. Heibeck, Using reality min ing to improve public health and medicine, 2009. [31] S. Guillén, M. T. Arredondo, and E. Castellano, A s urvey of commercial wearable systems for sport application, in Wearable Monitoring Systems, [32] L. Breiman, Statistical Modeling: The Two Cultures, Statistical Science, 2001. [33] M. N. Jarczok et al., Autonomic nervous system activity and workplace stressors -A systematic review, Neuroscience and Biobehavioral Reviews, vol. 37, no. 8. 2013. [34] X. Li et al., Digital Health: Tracking Physiomes and Activity Using Wearable Biosensors Reveals Useful Health-Related Information, PLoS Biol, [35] P. MacNaughton, J. Spengler, J. Vallarino, S. Santanam, U. Satish, and J. Allen, Environmental perceptions an d health before and after relocation to a green building, Build Environ, vol. 104, pp. 138–144, 2016. [36] J. F. Thayer et al., Effects of the physical work environment on physiological measures of stress, European journal of cardiovascular prevention and rehabilitation, vol. 17, no. 4, pp. 431–9, Aug. 2010. [37] T. Föhr et al., Subjective stress, objective heart rate variability -based stress, and recovery on workdays among overweight and psychologically distressed individuals: a cross-sectional study, Journal of Occupational Medicine and Toxicology, 2015. [38] T. Pereira, P. R. Almeida, J. P. S. Cunha, and A. Aguiar, Heart rate variability metrics for fine -grained stress level assessment, Comput Methods Programs Biomed, vol. 148, pp. 71–80, Sep. 2017. [39] A. Gelman and J. Hill, Data Analysis Using Regression and Multilevel/Hierarchical Models, 1st ed. New York, NY: Cambridge University Press, 2006. doi: 10.1017/CBO9780511790942. [40] G. Pimentel, S. Rodrigues, P. A. Silva, A. Vilarinho, R. Vaz, and J. P. Silva Cunha, A wearable approach for i ntraoperative physiological stress monitoring of multiple cooperative surgeons, Int J Med Inform, vol. 129, 2019, doi: 10.1016/j.ijmedinf.2019.05.028. [41] M. Cropley et al., The association between work -related rumination and heart rate variability: A field study, Front Hum Neurosci, vol. 11, Jan. 2017, doi: 10.3389/FNHUM.2017.00027/ABSTRACT. [42] J. D. Runkle, C. Cui, C. Fuhrmann, S. Stevens, J. del Pinal, and M. M. Sugg, Evaluation of wearable sensors for ph ysiologic monitoring of individually experienced temperatures in outdoor workers in southeastern U.S., Environ Int, vol. 129, 2019, doi: 10.1016/j.envint.2019.05.026. [43] M. Durban, J. Harezlak, M. P. Wand, and R. J. Carroll, Simple fitting of subject -specific curves for longitudinal data, Stat Med, vol. 24, no. 8, pp. 1153–1167, 2005. [44] D. W. K. Andrews, Tests for Parameter Instability and Struc tural Change With Unknown Change Point, Econometrica, vol. 61, no. 4, 1993, doi: 10.2307/2951764. [45] A. Linden, Conducting interrupted time -series analysis for single- and multiple-group comparisons, Stata Journal, vol. 15, no. 2, 2015, doi: 10.1177/1536867x1501500208. [46] S. E. Ryan and L. S. Porth, A tutorial on the piecewise r egression approach applied to bedload transport data, USDA Forest Service - General Technical Report RMRS-GTR, no. 189 RMRS-GTR, 2007. [47] V. M. R. Muggeo, Estimating regression models with un known break-points, Stat Med, vol. 22, no. 19, pp. 3055–3071, 2003, doi: 10.1002/sim.1545. [48] X. Shuai, Z. Zhou, and R. S. Yost, Using segmented regress ion models to fit soil nutrient and soybean grain y ield changes due to liming, J Agric Biol Environ Stat, vol. 8, no. 2, pp. 240–252, 2003. [49] A. Doan, Human -in-the-loop data analysis: A personal perspective, Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2018, Jun. 2018, doi: 10.1145/3209900.3209913. [50] X. Wu, L. Xiao, Y. Sun, J. Zhang, T. Ma, and L. He, A survey of human-in-the-loop for machine learning, Future Generation Computer Systems, vol. 135, pp. 364–381, Oct. 2022, doi: 10.1016/j.future.2022.05.014. [51] Y. Gil et al., Towards h uman-guided machine learning, in International Conference on Intelligent User Interfaces, Proceedings IUI, 2019, vol. Part F147615. doi: 10.1145/3301275.3302324. [52] D. D. Xin, L. L. Ma, J. J. Liu, S. S. Macke, S. S. Song, and A. A. Parameswaran, Acceleratin g Human-in-the-loop Machine Learning: Challenges and opportunities, 2018. doi: 10.1145/3209889.3209896. [53] A. Holzinger, Interactive machine learning for health i nformatics: when do we need the human-in-the-loop?, Brain Inform, vol. 3, no. 2, 2016, doi: 10.1007/s40708-016-0042-6. ACM Trans. Manage. Inf. Syst. [54] K. Zheng, G. Chen, M. Herschel, K. Y. Ngiam, B. C. Ooi, and J. Gao, PACE: Learning Effective Task Decomposition for Human -in-the-loop Healthcare Delivery, Proceedings of the ACM SIGMOD International Conference on Management of Data, vol. 13, pp. 2156–2168, 2021, doi: 10.1145/3448016.3457281. [55] A. Gupta, M. T. Lash, and S. K. Nachimuthu, Optimal s epsis patient treatment using human -in-the-loop artificial intelligence, Expert Syst Appl, vol. 169, p. 114476, May 2021, doi: 10.1016/J.ESWA.2020.114476. [56] S. N. Wood, Stable and efficient multiple smoothing parameter estimation for generalized additive models, J Am Stat Assoc, vol. 99, no. 467, 2004, doi: 10.1198/016214504000000980. [57] J. J. Faraway, Extending the linear model with R: generalized linear, mixed effects and nonparametric regression models, 1st ed. Boca Raton, FL: CRC press, 2016. [58] R. P. Brent, Algorithms for Minimization without Derivatives, 2nd ed. Eaglewood Cliffs, N.J.: Dover publications, 2013. [59] R. Fletcher, Practical Methods of Optimization, 4th ed., vol. 53. West Sussex, England: John Wiley & Sons, 2013. [60] GSA, Wellbuilt for Wellbeing, 2021. https://www.gsa.gov/gove rnmentwide-initiatives/federal-highperformance-green-buildings/resource- library/health/wellbuilt-for-wellbeing (accessed Jan. 12, 2022). [61] F. Shaffer and J. P. Ginsberg, An Overview of Heart Rate Var iability Metrics and Norms, Front Public Health, vol. 5, no. 258, pp. 1–17, Sep. 2017, doi: 10.3389/fpubh.2017.00258. [62] L. Soares-Miranda et al., Physical Activity and Heart Rate Variability in Older Adults, Circulation, vol. 129, no. 21, pp. 2100–2110, May 2014. [63] K. Srinivasan et al., A Regularization Approach for Identifying Cumulative La gged Effects in Smart Health Applications, in Proceedings of the 7th International Conference on Digital Health, 2017, pp. 99–103. [64] D. Liu, M. Görges, and S. A. Jenkins, The University o f Queensland Vital Signs Dataset: Development of an accessible reposit ory of anesthesia patient monitoring data for research, Anesth Analg, vol. 114, no. 3, 2012, doi: 10.1213/ANE.0b013e318241f7c0. [65] B. Vogel et al., ST -segment elevation myocardial infarction, Nature Reviews Disease Primers 2019 5:1, vol. 5, no. 1, pp. 1–20, Jun. 2019, doi: 10.1038/s41572-019-0090-3. [66] G. B. Smith, D. R. Prytherch, P. Meredith, P. E. Schmidt, and P. I. Featherstone, The abil ity of the National Early Warning Score (NEWS) to discriminate patients at risk of early cardiac arrest, unanticipated intensive care unit admission, and death, Resuscitation, vol. 84, no. 4, pp. 465– 470, Apr. 2013, doi: 10.1016/J.RESUSCITATION.2012.12.016. [67] S. Kianersi, M. Luetke, J. Agley, R. Gassman, C. Ludema, and M. Rosenberg, Validation of transdermal alcohol conc entration d ata collected using wearable alcohol monitors: A systematic review and meta-analysis, Drug Alcohol Depend, vol. 216, p. 108304, Nov. 2020, doi: 10.1016/J.DRUGALCDEP.2020.108304. [68] D. Pradeep Kumar, N. Toosizadeh, J. Mohler, H. Ehsani, C. Mannier, and K. Laksari, Sensor -based characterization of daily walking: A new paradigm in pre-frailty/frailty assessment, BMC Geriatr, vol. 20, no. 1, pp. 1–11, May 2020, doi: 10.1186/S12877-020-01572-1/FIGURES/2. [69] V. M. R. Muggeo, segmented: An R package to Fit Regression M odels with Broken-Line Relationships, R News, vol. 8, no. May, 2008, doi: 10.1159/000323281. [70] A. Rai et al., Editor’s Comments: Diversity of Design Science Research, MIS Quarterly, vol. 41, pp. iii–xviii, 2017. ACM Trans. Manage. Inf. Syst. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png ACM Transactions on Management Information Systems (TMIS) Association for Computing Machinery

A Human-in-the-Loop Segmented Mixed-Effects Modeling Method for Analyzing Wearables Data

Loading next page...
 
/lp/association-for-computing-machinery/a-human-in-the-loop-segmented-mixed-effects-modeling-method-for-ANK3Fk6iHb

References

References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.

Publisher
Association for Computing Machinery
Copyright
Copyright © 2023 Copyright held by the owner/author(s).
ISSN
2158-656X
eISSN
2158-6578
DOI
10.1145/3564276
Publisher site
See Article on Publisher Site

Abstract

A Human-in-the-loop Segmented Mixed-effects Modeling Method For Analyzing Wearables Data Segmented Modeling For Wearable Data Analytics KARTHIK SRINIVASAN0F* School of Business, University of Kansas, Lawrence KS, U.S., karthiks@ku.edu FAIZ CURRIM Eller College of Management, University of Arizona, Tucson AZ, U.S., fcurrim@eller.arizona.edu SUDHA RAM Eller College of Management, University of Arizona, Tucson AZ, U.S., ram@eller.arizona.edu Wearables are an important source of big data as they provide real-time high-resolution data logs of health indicators of individuals. Higher-order associations between pairs of variables is common in wearables data. Representing higher-order association curves as piece-wise linear segments in a regression model makes them more interpretable. However, existing methods for identifying the change points for segmented modeling either overfit or have low external validity for wearables data containing repeated measures. Therefore, we propose a human-in-the-loop method for segmented modeling of higher-order pairwise associations between variables in wearables data. Our method uses the smooth function estimated by a generalized additive mixed model to allow the analyst to annotate change point estimates for a segmented mixed-effects model, and thereafter employs the Brent’s constrained optimization procedure to fine-tuning the manually provided estimates. We validate our method using three real-world wearables datasets. Our method not only outperforms state-of-the-art modeling methods in terms of prediction performance but also provides more interpretable results. Our study contributes to health data science in terms of developing a new method for interpretable modeling of wearables data. Our analysis uncovers interesting insights on higher order associations for health researchers. CCS CONCEPTS • Information Systems • Information systems applications • Decision support systems • Data analytics Additional Keywords and Phrases: smart health, wearables, segmented mixed-effects regression, human-in-the-loop method, explainability, interpretable modeling 1 INTRODUCTION Wearables are smart electronic devices that bring the ability to continuously record, monitor, and analyze key health indicators of individuals as they go about their lives. The global market share for wearables is forecasted to be $33 billion by 2025, with an average growth rate of 15% each year [1]. Wearable devices have been instrumental in transforming personalized medicine and individual health monitoring practices. The number of wearables-based research has been steadily increasing each year; a trend that is expected to continue in the future [2]. While wearables are primarily wrist-worn sensors that continuously track elementary wellness indicators such as sleep, heart rate, activity, newer types of wearables including implantables, chest-worn sensors, head-mounted displays, smart jewelry, smart clothing could be capable of capturing a lot more health indicators such as physiological stress, respiration rate, blood alcohol, blood sugar, and other vitals in a non-invasive way. While many studies employ wearables data for predictive modeling such as fall detection or predicting 30-day readmissions [3]–[5], there is growing interest in conducting natural and quasi-natural experiments to study human health and its relationship with externalities [2], [6], [7]. Health indicators Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s). © 2022 Copyright held by the owner/author(s). 2158-656X/2022/1-ART1 $15.00 http://dx.doi.org/10.1145/3564276 ACM Trans. Manage. Inf. Syst. from wearables can be combined with other data sources such as electronic health records and social media to better understand dynamics between human health and society [8]–[10]. Wearables are capable of continuously monitoring and recording health indicators at fine granularity for multiple participants simultaneously. However, the resulting repeated measures data poses data quality and analytical challenges such as missing values, high dimensionality, lagged effects, and clustered errors [11], [12]. Existing glass-box models such as mixed-effects regression are often insufficient for accurately representing higher order associations between variables in wearables data [13]. For example, including a second-order term in the mixed-effects regression model can show that a second-order effect of an input is statistically significant, but it is difficult to explain the functional nature of the second-order association from the quadratic expression alone. That is, it would be interesting to learn the extrema (i.e., maxima/minima) of the second-order effect and how the outcome varies for unit change in the input around such a point. Furthermore, establishing such a functional relationship using visualizations alone can be challenging for wearables data with repeated measures per user and heterogeneity across users. Therefore, there is a need for novel interpretable methods for explaining higher-order associative patterns in wearables data. Segmented regression, also known as broken-stick or piece-wise regression, is an explanatory modeling approach where input variable(s) of interest are partitioned or segmented into intervals followed by fitting straight lines for each interval in the regression model. It is commonly used in design science research applications with a hypothesized curvilinear relationship in simple regression models [14], [15]. The most critical challenge in fitting a segmented model is the determination of input value(s) at which segments need to be separated, a problem also known as breakpoint or change point determination. Studies that use segmented regression with mixed effects [16] determine change points using either ad-hoc or black-box procedures [13], [17]. These methods either tend to have low external validity or overfit and are therefore unsuitable for making inferences. To overcome these shortcomings, we propose a new method for determining change points for input segments in a mixed effects regression. Our method is based on the human-in-the-loop (HITL) paradigm as it uses human inputs during the model training process. Our method repurposes the smooth functions generated by a Generalized Additive Mixed Model (GAMM) to allow the analyst to set initial estimates of the change points through visual inspection. Following which, a fast robust root-finding algorithm called Brent’s method is used to precisely locate each change point iteratively by maximizing the mixed-effects model’s the Akaike Information Criteria (AIC) [18]. In this way, our method takes advantage of human inputs to fit segmented models that can accurately represent higher order associations between variables in wearables data. We evaluate our method on three real world datasets. Our method consistently outperforms existing approaches for all the three datasets. The models developed using our method enable clear interpretations of four higher order associations across the datasets. Inferences from our analysis have multiple managerial implications. 2 BACKGROUND AND RELATED WORK Sensor-based content is among the key characteristics of third generation Business Intelligence and Analytics (BI&A) applications [19]. Wearables offer the unique opportunity to observe and physiological changes in individuals through measurement of activity, heart rate, body temperature and health ACM Trans. Manage. Inf. Syst. indicators. Different types of wearables are available for commercial use such as implantables, head- mounted displays, smart jewelry, smartwatches, fitness trackers and smart clothing. Out of these, the chest-worn and wrist-worn fitness trackers are most widely used in research applications [20]–[22]. During the early adoption phase of wearable technology, research was primarily directed towards sensor development and architecture [23], [24]. With more and more commercial products being introduced into the market today, research focus is shifting on wearable data analytics and associated design science research applications [25]–[27]. 2.1 Wearable data analytics Wearable data analytics (WDA) is the discovery, interpretation, and communication of meaningful patterns from large volumes of data generated by wearable devices [12]. WDA applications can be broadly classified into interpretable modeling and predictive modeling applications. Predictive modeling is the process of learning from existing data to effectively predict future unknown outcomes [28]. In the clinical setting, health vital sign parameters such as electrocardiogram (ECG), oxygen saturation (SpO2), heart rate (HR), respiratory rate and blood pressure are used to provide pre-emptive care for patients with cardiovascular diseases, renal diseases, neurological disorders and cerebrovascular disorders [20]. In the non-clinical setting, wearables-enabled predictive modeling has been employed in problem domains including ambient assisted living [5], human activity recognition [29] reality mining [30] and sports medicine [31]. Data mining, machine learning and deep learning are the most common approaches used in predictive modeling applications of WDA [21], [25]. Interpretable or explanatory modeling is the process of developing a mathematical representation of patterns in the data for explaining a hypothesized phenomenon [32]. Typically, controlled experiment- based and observational studies employing wearables for data collection use interpretable models to explain their phenomena of interest. Wearables-enabled interpretable modeling has been employed in problem domains including lifestyle modeling [27], [33], [34], environment-wellbeing modeling [13], [35], [36] and psycho-physiological stress modeling [37], [38]. Mixed effects regression is the most common glass-box model used for interpretable modeling in WDA [27], [35], [36]. In mixed-effects regression, fixed effects or the global coefficients represent the overall effects of inputs on outcomes, and the random effects or varying effects represent how these effects differ across individuals [39]. While few studies employing wearables data report linear pairwise associations [34], [37], many studies observe higher-order associations between variables. For example, [37] identify a linear association between subjective stress and physical stress measured using heart rate monitors among overweight office workers, and [34] find a linear association between number of steps measured using an activity watch and resting heart rate measured using a heart rate monitor for healthy adults. On the other hand, [40] find associations between two different measures of physical stress pNN50 and SDNN to be significant only in a limited range for surgeons in a hospital. [41] address the challenge of modeling the nonlinear association between work-related rumination and heart rate variability by dividing rumination score into low and high categories, while [13] use domain knowledge to set a change-point for segmented modeling. [42] measured heat strain using a heat exposure monitor and reported an inverted U-shaped association curve between heat strain and outdoor temperature for workers. Wearables-based studies such as the ones above ACM Trans. Manage. Inf. Syst. either hypothesize or empirically observe higher order associations. Compared to naïve approaches such as discretization of variables [41] which may lead to loss of information or just reporting first order and second order effects in regression models [42], characterizing the higher-order association using segmented modeling leads to better interpretable results. 2.2 Segmented modeling Higher order relationships between inputs and outcomes is common in information systems (IS) [14], [15]. Polynomial regression models account for higher order relationships, but they are not directly interpretable [43]. That is, first order effects and second order effects cannot be used to quantify the unit change in outcome due to a unit increase in input as in the case of a regression model with only first order effects. Segmented or piece-wise regression is a preferred approach for modeling higher order relationships, as it is easier to interpret. The primary challenge in using a segmented regression approach is the determination of change points linking the input segments. Change point determination has been studied in different problem contexts including detecting structural change in continuous values of parameters [44], interruption of time-series [45], and characterizing higher order functional relationships [46]–[48]. Common procedures to determine change points include visual inspection of pairwise plots [48], incorporating domain inputs, greedy search [46], likelihood-based estimation [47]. Fewer procedures exist for change point determination in mixed-effects models as the likelihood function of multilevel models are not directly differentiable, thus making greedy search and likelihood-based estimation more difficult [17] [46]. A maximum-likelihood estimation of a continuous functional approximation of the piece-wise linear function has been proposed as a more robust alternative to subjective/ad-hoc assignment of change points based on visualization of pairwise association plots [17]. However, this method estimates multiple change points automatically with no scope for user inputs into the estimation process. For example, manual intervention such as dropping change points at the extremities of an input distribution could avoid overfitting as well as improve interpretability. Moreover, if an automated method may fail to execute if there are too many local extrema or due to high sensitivity towards outliers. To summarize, existing procedures for determining change points in studies employing segmented models are either ad-hoc or analytically complex, leading to problems such as low external validity, overfitting, or failure in program execution. There is a need for a segmented mixed-effects modeling method that is robust, efficient, and transparent. Such a method can be helpful to better explain higher-order pairwise associations in wearables data. 2.3 Human-in-the-loop analytics methods Human-in-the-loop (HITL) analytics methods are geared towards enhancing algorithm performance by incorporating human knowledge and inputs into the modeling and program execution process. HITL is an extensive area of research that covers the intersection of computer science, cognitive science, and psychology [49], [50]. HITL can be performed at different stages of an analytics system, from data preprocessing and modeling to system implementation. Human-machine hybrid models have demonstrated superior performance in natural language and computer vision applications [50] and are being actively considered in general analytics applications [49]. For example, studies [51] have shown how initial knowledge inputs from domain experts improve downstream performance of automated machine ACM Trans. Manage. Inf. Syst. learning systems. Another effort [52] presents an iterative experimentation framework in which users repeated make changes to the ML workflow in order to improve performance. Similarly, HITL also finds applications in model design, training, testing, model optimization stages and is applicable to health analytics research involving subjective expertise and higher need for transparency [53], [54]. For instance, one study [55] incorporates physician inputs towards model parameterization for patient-specific IV fluid recommendation in sepsis treatment. A HITL approach is suitable for analyzing wearable data as human expert inputs and observation can spot technical and logical errors in the analysis at an early stage and avoid rework or erroneous conclusions. While our study does not attempt to specifically contribute to HITL methodology literature, our method is one of the first few attempts to use a HITL approach for accurately determining change points in mixed-effects modeling in the context of wearables data analytics. 3 OUR METHOD We propose a new method for capturing higher-order associative patterns in wearables data using segmented mixed-effects modeling. Our method uses a HITL approach to determine the change points in the segmented model by combining algorithmic search process with human inputs for fine-tuning. That is, our method uses the smooth function estimated by a generalized additive mixed model to allow the analyst to annotate the change point estimates followed by fine-tuning of the estimates using a constrained optimization procedure. Our method is novel since few studies have used an HITL approach to tune change point parameters for segmented modeling. It also provides a robust mechanism to capture and explain higher order relationships hypothesized in wearables data. In the rest of this section, we explain our method in detail. Consider a mixed effects model commonly used for explaining repeated measures in data from wearables as follows: ∑ ∑ (1) Equation (1) is a representation of generalized linear mixed models (GLMM) with a linear link function but any link function is applicable to our method. is value of a given health outcome for the { } observation and individual, is the fixed intercept, are coefficients for fixed effects { }, { } are J random intercepts for each individual, { } are coefficients for rando m effects { }, and is the residual error. Suppose there exists an input such that its second (or higher) order effects are significant, then Equation (1) can be represented as follows: ∑ ∑ (2) In Equation (2), denotes the value of input variable for the observation and individual. ACM Trans. Manage. Inf. Syst. As described in earlier section, segmented representations of higher order effects are more interpretable than polynomials. The input variable can be represented as the sum of segments as follows: (3) In Equation (3), { } is a set of change points defined for the input variable which is broken into P segments { }. is an indicator function equal to 1 if condition is true; otherwise it is 0. Therefore, the scalar product has value equal to when is true and is 0 otherwise. The next logical step is to estimate number of change points and their positions. We propose a HITL method to estimate the change points as follows. As the first step, we fit a Generalized Additive Mixed Model (GAMM) [56] with a given input as a non-parametric spline as shown below: ∑ ∑ (4) Though, a non-parametric spline can be included for the corresponding random effect of input , it is computationally more expensive for fitting the corresponding semi-parametric model. We empirically tested on multiple datasets and observed the shape of the component smooth function to not be sensitive to random effects as smooth functions. Therefore, we consider the smooth function only in the fixed effects. In the next step, we visualize the plot of the smooth function in Equation (4) approximated as a B-spline [57]. Here, a human input is required to identify the order of the curve by inspecting the number of extrema (i.e., minima and maxima) to set the value of . The setting of can be based upon visual inspection as well as prior domain knowledge. For example, in Figure 1, the values for the different scenarios (a)-(d) are chosen as , , , and respectively. This step also determines whether to opt for a segmented model over a linear model, by inspecting the curvilinear nature of the component smooth function. For instance, though we set =1 for scenario illustrated in Figure 1(c), an analyst may also approximate the monotonically increasing curve as a linear function in this case; thus favoring simplicity over slightly better model fit. The value of the maxima and minima are used as starting points in a linear search algorithm in the third step. The third step involves iteratively performing search for change points using Brent’s method [58], a linear optimization with box constraints. The Brent’s method is a hybrid root-finding algorithm combining the bisection method, secant method and the inverse quadratic interpolation that make it robust and highly efficient while incorporating box constraints (i.e., range of permitted values) [59]. For each iteration of the optimizer, a mixed model shown with segmented inputs for is fit for a particular change point parameter. The algorithm returns the change point parameter corresponding to the model with minimum Akaike information criteria (AIC). Since the Brent’s method is a single parameter estimator, we identify the change points sequentially by repeating the search algorithm for each subsequent change point after fixing values of previously determined change points. Finally, the segmented mixed effects model is fit as shown below: ACM Trans. Manage. Inf. Syst. ∑ ( ) ∑ ∑ (5) In Equation (5), i s a set of segments constructed using change points identified for input . Significance of the effect of input variable, , at each segment, , can be determined by inspecting the corresponding fixed effects coefficient, , under regular conditions. The algorithm for our method is shown in Table 1. Table 1: Human-in-the-loop method for segmented mixed-effects modeling Input: Mixed effects model with significant higher-order coefficients for input variable, 1: Fit a Generalized Additive Mixed Model (GAMM) with input represented as a non-parametric spline (Equation (1)). 2: Inspect the component smooth function plot to identify number of change points , starting points and box constraints { } for corresponding change points . 3: For , compute as follows: ∑ ∑ For , fix values of { } in model to compute as follows: 4: ( ) ( ) ( ) ∑ ∑ Output: Segmented model with change points as shown in Equation (5). 4 EXPERIMENTAL SETUP To demonstrate the utility and effectiveness of our method, we apply it to model high-order pairwise relationships in four different applications across three real-world wearable datasets – WellbuiltforWellbeing, HospitalMonitoring, and BeerCrawl. We describe each of the datasets in the next sub-section followed by analysis and findings. ACM Trans. Manage. Inf. Syst. 4.1 Data 4.1.1 WellbuiltforWellbeing The Wellbuilt-for-Wellbeing (WB2) project [60] was a sixteen-month multi-phase field study funded by the US General Services Administration to better understand the influence of the office environment on human health, comfort and performance. In the study, self-described healthy adult workers involved in a variety of office-based roles for the U.S. government were recruited across four federal office buildings across the country. Participants wore two sensors for three days while carrying out their day-to-day activities, a heart and physical activity monitor, and a personal environment quality sensor-based device. The study also included experience sampling mobile surveys to collect individuals’ perceived psychological responses at periodic intervals of one to two hours. Post-processing, the dataset contained around 3000 hours of wearables data with wide range of variables from 231 participants. More details about the field study setup and variables can be found in [6]. For our study, we analyze two pairwise associations from this dataset, that of ambient sound level and heart rate variability, and that of instantaneous activity and heart rate variability. Heart rate variability (HRV) is the variability between heart beats and is considered as a proxy measure for the physiological wellbeing of a person, i.e., the higher the variability, the higher the physical wellbeing [61]. Among different HRV measures, the mean of standard deviation for all successive R-R intervals (SDNN) measured in milliseconds reflects the overall activity in the autonomous nervous system and is widely used as an indicator of better health and wellbeing [62]. Physical activity levels were assessed in g (i.e., 1 unit of gravitational force) from a triaxial accelerometer sensor and sound levels were measured in dBA (i.e., decibel weighted according to human ear hearing) using separate neck-worn sensors. The raw data from multiple wearables were aggregated at 5-minute intervals to be integrated with the heart rate monitor sensor used for computing SDNN [63]. We consider appropriate covariates in our models including person-level fixed effects (i.e., Age, Gender, BMI, worktype), time of the day, day of the week after closely examining all the variables collected in the WB2 project. 4.1.2 HospitalMonitoring The HospitalMonitoring dataset contains vital signs data recorded from patients undergoing anesthesia at the Royal Adelaide Hospital [64]. It is publicly accessible from the University of Queensland website . Data was collected for 32 cases using multiple wearables and stationary sensors including the electrocardiograph (ECG), pulse oximeter, capnograph, noninvasive arterial blood pressure monitor, airway flow, pressure monitor, Y-piece spirometer, electroencephalogram (EMG) monitor, and arterial blood pressure monitor [64]. The processed data is aggregated at 1-second intervals and has 51 variables out of which we select a meaningful subset for our analysis. For our study, we analyze the pairwise association between ST-segment index and airway respiratory rate. In electrocardiography, the ST segment connects the QRS complex and the T wave and its depression or elevation is related to acute cardiovascular conditions including myocardial ischemia, infarction, and arrhythmia [65]. The human respiratory rate is measured by counting the number of breaths per minute with typical values ranging from 12 to 16 for a healthy adult. Respiration rate has been related to HospitalMonitoring dataset URL: https://outbox.eait.uq.edu.au/uqdliu3/uqvitalsignsdataset/index.html ACM Trans. Manage. Inf. Syst. abnormalities in oxygen saturation, aging, cardiovascular diseases, and has been widely adopted as part of early warning systems [66]. In our model, we consider heart rate, oxygen saturation (i.e., SPO2), and perfusion index as covariates after examining collinearity and cross-correlations among all features and potential confounding effects. 4.1.3 BeerCrawl The BeerCrawl dataset contains blood alcohol content and movement information recorded in a field study by [4]. It is publicly accessible from the University of California Irvine (UCI) dataset repository . Transdermal alcohol content (TAC) is measured using an ankle bracelet wearable while the movement data was captured using raw accelerometer readings from mobile phones for 20 students participating in an annual college bar crawl event. While the TAC data was sampled every 30 minutes, accelerator readings were available at a more granular level leading to over 30M samples across participants. TAC has been shown to be a more reliable indicator of sustained alcohol use as compared to self-reporting [67]. For our study, we analyze the pairwise association between the raw z-axis readings from the tri-axial accelerometer and TAC values in the dataset. Prior studies have proposed several features using accelerometer readings that are related to a person’s gait, activities, and wellbeing with z-axis values contributing significantly to their variability [4], [68]. The units of TAC and accelerometer are g/dl and m/s respectively. In our model, we consider the x and y axis co-ordinate values as covariates. The accelerometer is mapped to the TAC monitor readings at the minute level. Table 2: Summary of real-world wearables datasets used in our study Size (Row, Input statistics (Mean, Dataset Accessibility Outcome Input Columns) SD, Min, Max) WellbuiltforWellbeing (31557, 7) Not public SDNN Sound (Sound level in (50.24, 8.69, [60] (Heart rate dBA) 0.00, 87.80) variability) Activity (Activity (0.1738, 0.3187, level in g) 0.0000, 3.0000) HospitalMonitoring (83861, 4) Public RR ST2 (ST segment (0.0635, 0.3969, [64] (Respiration index) -1.1000, 1.3000) rate) BeerCrawl [4] (8273, 3) Public TAC (Blood z (z-axis of (0.0503, 0.1840, alcohol accelerometer) -0.6705, 0.6676) level) Table 2 shows a summary of the datasets used in our study. The datasets and respective input-outcome associations were chosen to demonstrate variety of scenarios with higher-order pair-wise associations across different problem domains. 4.2 Data pre-processing and model assumptions Training and test samples were partitioned in a 75:25 ratio for performance evaluation. A variance component structure for the covariance matrix of the random effects coefficients is assumed in the mixed- effects regression models. The input variable of interest was included as a fixed effect as well as random effect in the model. We compared models having uncorrelated residual errors with counterparts with BeerCrawl dataset URL: https://archive.ics.uci.edu/ml/datasets/Bar+Crawl%3A+Detecting+Heavy+Drinking ACM Trans. Manage. Inf. Syst. autoregressive error residuals and we observed that the model fit did not improve significantly after controlling for temporal correlations. We also compared different error distributions and found the normal distribution to be most suitable. Hence our model is represented as Equation (1) without any special link function or autocorrelation terms. 4.3 Results We fit mixed-effects models using the three datasets for explaining the following four input-outcome associations – Sound-SDNN, Activity-SDNN, ST2-RR, and z-TAC. The corresponding component smooth functions for the partial effects from GAMM are shown in Figure 1. While the smooth functions for Sound- SDNN and Activity-SDNN approximate second order and third order polynomial curves, smooth functions of ST2-RR, and z-TAC indicate a higher-order curve. Figure 1: Smooth function plots generated using GAMM for (a) Sound-SDNN; (b) Activity-SDNN; (c) ST2-RR; and (d) z-TAC ACM Trans. Manage. Inf. Syst. As part of the HITL step, we made the following model interventions. Figure 1(a) shows a maxima around 50-60 dBA range, while Figure 1(b) shows maximal range between 0.2 and 1 followed by a decrease in the smooth function until it reaches a minima at around 2.5, after which the function again turns upwards. The extrema in Figure 1(c) are more subtle with confidence interval being most conservative around ST2=0.0. Figure 1(d) clearly depicts three maxima and two minima within the main range while a minima exists around z=0.6 which may be ignored due its extreme right position in the input distribution. Based on these observations of the smooth functions, we chose , , , and as number of change points for associations Sound-SDNN, Activity-SDNN, ST2-RR, and z-TAC respectively. Through visual inspection, we set initial change point estimates as well as box constraints for each of the pairwise association. We compared the prediction performance of mixed-effects models fitted using our method with following benchmarks: (i) model with inputs as first-order effects (i.e., linear), (ii) model with inputs as first-order and second-order effects (i.e., curvilinear), (iii) segmented inputs using change points identified by the maximum likelihood (ML) method [17], and (iv) segmented inputs using visually identified change points (i.e., visual). The fixed effects model was used as a baseline, representing the case when only fixed effects are considered in the mixed-effects regression. The model with linear inputs (i.e., first order effects only) is a benchmark that emphasizes simplicity over better model fit through capture of higher order associations. The curvilinear model is more commonly used in prior literature as it improves model fit when compared to linear effects only model, but at the cost of lower interpretability of its higher order coefficients. The segmented models with change points determined visually or using the fully automated maximum likelihood (ML) approach can be considered as the state-of-the-art approaches for segmented mixed-effects modeling for capturing higher order associations. Other methods in machine learning or statistical modeling such as multivariate adaptive regression splines or non-parametric analysis are not considered as their objective of capturing higher order patterns is not primarily towards explanatory modeling, but towards making predictions. We performed repeated 4-fold cross-validation (CV) with 10 iterations (i.e., 40 runs in total) for evaluating the performance of our method against the benchmarks across the three datasets. Table 3 shows the mean and standard deviation (in parenthesis) of performance metrics - R-squared, Root Mean Squared error (RMSE), and Mean Absolute Prediction Error (MAPE) of our method and benchmarks across the three datasets and four higher-order association scenarios. A higher value of R-squared and lower values of Root Mean Squared error (RMSE) and Mean Absolute Prediction Error (MAPE) are preferred. Statistical significance of better performance of our method over each benchmark was tested using the Tukey- Kramer pairwise comparison test on ANOVA fit over the performance metrics of our method and benchmarks. The R-squared values and error estimates for best performing models are highlighted. Our method has a statistically significant improvement in the prediction performance over existing higher- order mixed-effects modeling benchmarks for WellbuiltforWellbeing and HospitalMonitoring datasets. For the BeerCrawl dataset, our method if better than other benchmarks except the visual method, but the performance difference between the visual method and our method is not statistically significant. While the ML approach is popular for GLM models with the R package segmented [69], it is not very compatible for wearables data in terms of setting number of change points and convergence. Though the visual ACM Trans. Manage. Inf. Syst. method performs well for the BeerCrawl dataset, the complete dependence on human inputs and rationalization renders it less robust as evident from its significant lower performance in other two datasets. Table 3: Model Fit and Predictive Performance Comparison of Segmented Multilevel Models WellbuiltforWellbeing - HRV WellbuiltforWellbeing - HRV HospitalMonitoring - RR BeerCrawl - TAC (Sound) (Activity) MAP Model RMSE RMSE MAPE RMSE RMSE MAPE MAP R-sq. R-sq. R-sq. (br/mi R-sq. (ms) (%) E (%) (ms) (%) n) (g/dl) (%) 0.5168 17.78 25.81 0.5557 17.54 26.12 0.733 2.58 5.47 0.6007 0.04 70.38 Fixed (0.01)** (0.03)** (0.05)** (0.01)*** (0.02)*** (0.07)*** 5 (0.00) (0.00) (0.01) (0.00) (0.72) effects * * * (0.00) *** *** *** *** *** *** 0.5237 17.65 25.47 0.6128 16.65 24.44 0.8781 2.36 4.92 0.600 0.0377 66.54 (0.01)** (0.03)** (0.05)** (0.01)*** (0.08)*** (0.10)*** (0.00) (0.00) (0.00) 3 (0.00) (0.73) Linear * * * *** *** *** (0.01) *** *** *** 0.5876 16.83 23.78 0.6630 15.64 23.12 0.8875 2.36 4.92 0.5995 0.0377 66.56 Curvilin (0.00)** (0.04)** (0.06)** (0.01)*** (0.05)*** (0.06)*** (0.00) (0.00) (0.00) (0.01) (0.00) (0.72) ear * *** *** *** *** *** *** 0.5891 16.81 23.74 0.6719 15.70 23.01 0.8856 2.35 4.85 0.6419 0.0369 63.09 Visual (0.00) (0.04) (0.06) (0.01)*** (0.06)*** (0.07)*** (0.00) (0.00) (0.00) (0.01) (0.00)* (0.72) *** *** *** 0.5860 16.82 23.80 0.6734 15.69 23.01 0.8810 2.35 4.84 0.6011 0.0380 66.57 ML (0.00)** (0.04)** (0.06)** (0.01)*** (0.06)*** (0.07)*** (0.00) (0.00) (0.00) (0.01) (0.00) (0.72) * * *** *** *** *** *** *** 0.5900 16.80 23.72 0.7628 14.93 21.99 0.9201 2.33 4.82 0.641 0.037 63.17 Our (0.00) (0.04) (0.06) (0.01) (0.09) (0.10) (0.00) (0.00) (0.00) 2 3 (0.70) method (0.01) (0.00) Statistical significance of Tukey Kramer test for comparison of performance metrics of benchmark with HIL *** = p < .01, ** = p < 0.05, * = p < 0.1 In addition to better model fit and prediction performance, our method is able to generate better interpretable models. The fixed effects of segmented inputs across all three datasets are shown in Table 4. In the WellbuiltforWellbeing dataset, HRV increases by 0.19 ms/dBA for Sound levels less than 51 dBA indicating that physical wellbeing improves with every unit increase in sound in quiet environments. While higher activity levels such as brisk walking (i.e., Activity >= 0.9) decreases SDNN by 15.21 ms/g, moderate walking speed (0.9 > Activity >= 0.21) is related to steep increase of 90.49 ms/g in SDNN. Assuming brisk walking is confounded by intention to reach meetings in time and therefore related to a higher stress, the steep gradient in the moderate walking range underscores the value of intermittent low- level activity on physiological wellbeing in office spaces. In the HospitalMonitoring dataset, we observe that unit increase in ST-segment index beyond -0.18 is related to increase in respiration rate by 2.28 breath/minute. This finding underscores the importance of looking for elevated values of the ST-segment as reported in medical literature [65]. It also hints at a possible association between acute cardiovascular events and elevation in respiration rate in bed-ridden patients. In the BeerCrawl dataset, the z-axis indicates acceleration perpendicular to the phone screen by participants. While experimental research is required to understand the precise interpretations of the coefficients identified in our study, our study does uncover significant associative patterns between z-axis values and blood alcohol levels across different z- ACM Trans. Manage. Inf. Syst. axis range segments. Higher inebriety is associated with increase in acceleration at upper range of z (i.e., 0.41 > z >= -0.01) and associated with decrease in lower range of z (i.e., -0.01 > z >= -0.15). Table 4 also shows that the above interpretable pairwise associative patterns are not evident from linear and curvilinear inputs in the mixed-effects models. Table 4: Fixed effects of segmented, linear and curvilinear models across all three datasets Coefficient (SE) Dataset / Outcomes Input Segmented Linear Curvilinear Sound 0.07 (0.02) *** 0.09 (0.02) *** Sound -0.01 (0.00) *** *** Sound < 51 0.19 (0.04) Sound >= 51 -0.01 (0.03) WellbuiltforWellbeing / Activity 18.19 (1.27) *** 46.09 (1.68) *** SDNN Activity -43.39 (1.64) *** *** Activity < 0.21 9.00 (1.41) 0.9 > Activity >= 0.21 90.49 (3.87) *** Activity >= 0.9 -15.21 (1.46) *** ** ST2 1.63 (0.85) 1.75 (0.85) *** ST2 -0.44 (-4.09) HospitalMonitoring / RR ST2 < -0.18 1.51 (1.92) ST2 >=-0.18 2.28 (0.95) ** z -0.0073 (0.0104) -0.0061 (0.0098) z -0.026 (0.0102) ** z < -0.44 -0.0246 (0.0205) -0.2 > z >= -0.44 -0.0318 (0.0167) BeerCrawl / TAC -0.15 > z >= -0.2 -0.0340 (0.0390) *** -0.01 > z >= -0.15 -0.2014 (0.0734) *** 0.41 > z >= -0.01 0.0857 (0.0279) z >= 0.41 0.0030 (0.0182) *** = p < .01, ** = p < .05 Figure 2 shows a visual representation of the segmented model coefficients compared to coefficients from linear and curvilinear models. Co-incidentally, the shapes of the piecewise relationships for each of the pairwise association resembles the corresponding smooth functions shown in Figure 1 reinstating the importance of our HITL approach to train robust interpretable segmented inputs in the mixed-effects models. 5 DISCUSSION AND CONCLUSION In this study, we presented the problem of developing an interpretable model that captures piecewise pairwise associations between different modalities captured by wearables. Since existing methods for segmented modeling for mixed-effect regression are insufficient to determine robust and verifiable change point, our method is timely with increasing research applications utilizing wearables in a natural experimental setup. Our method involves the inspection of smooth functions of pairwise associations captured using GAMM, followed by using the Brent’s method to sequentially position change points optimizing model fit. Our method not only uses analytical tools to determine change points, but also utilizes user discretion to control the number of change points and its localization. For example, it is often desirable to avoid change points at extremities as data ACM Trans. Manage. Inf. Syst. corresponding to these segments may be very sparse, rendering inference unreliable. We apply our method to three different wearables datasets and show that not only is it effective in terms of improving model fit and prediction performance, but also significantly enhances model interpretability and ability to derive meaningful inferences. Figure 2: Trajectory of linear, curvilinear and segmented fixed effects coefficients for (a) Sound-SDNN; (b) Activity- SDNN; (c) ST2-RR; and (d) z-TAC 5.1 Managerial implications Our method and analysis have several managerial implications. Our study provides a novel tool to analyze wearables data, thus boosting the value for storing and processing of large amounts of big data generated by wearables. Our HITL-based segmented modeling method can be used in a wide range of wearables applications such as patient monitoring systems, military fitness management programs, smart diet applications, COVID-19 contact tracing, etc. Our analysis over the three wearables datasets present interesting pairwise associations. The positive relationship between sound level and physical wellbeing measure below the range of 51 dBA informs workplace design practices on the need for further examination of sound levels effects on employee health for different sound level ranges. A higher gradient of activity-wellbeing relationship in the lower range of activity provides additional empirical evidence on the value of low-intensity/intermittent activities on elevating instantaneous stress and improving wellbeing. The significant association between cardiovascular wellness measure and respiration rate after a certain threshold of the ST segment index solicits clinical researchers to further examine inter- relationships between pulmonary and cardiovascular wellness indices to improve on existing hospital ACM Trans. Manage. Inf. Syst. monitoring and early warning systems. Finally, the association between a dimension of raw accelerometer data and extrinsic phenomena such as alcohol consumption stresses the value of looking raw data in addition to expert-engineered features such as gait variability and number of continuous steps. 5.2 Contribution to IS research Predictive modeling and statistical modeling in analytics go side-by-side as one predicts the future using existing data, focusing on informing us on the question What will be, while the other explicates hidden patterns and tells us about What is with respect to a phenomenon. Both of them are important and require attention to optimize the utility of the generated data. As the number of wearable technology- based applications increases in future, the quantum of available data to analyze will exponentially increase and warrant for more and more advancements in explainable modeling for meaningful interpretations of patterns. In this study, we introduce a new method to address the design challenge of representing non- linear associative patterns in wearables data. Our contribution is timely in IS research, as the discipline is widening its scope in design science as well as explanatory modeling applications by using novel data sources such as wearables [70]. WDA is a promising area in IS [5], [19], opening a wide range of research applications owing to the following two reasons; the ubiquitous nature of wearables in today’s lifestyle, and the promise of wearables to generate rich, personalized, temporal and highly grained information content. We therefore posit that our contributions through a novel interpretable modeling method for addressing challenges in WDA lays the foundation for promising research in IS using data generated from wearables. 5.3 Limitations There are some caveats and limitations to our study. We have focused on the design science problem of developing an interpretable modeling method, but do not delve into the subject of determining the significance of input variables themselves. Also, our method by itself does not imply causation though it can be applied to any explanatory modeling scenario including causal or quasi-causal experimental settings. If curvilinear effects are absent, the segmented modeling approach should be avoided to prevent over-fitting. The modeling approach described in this study is useful when higher order association is pre-determined between pairs of repeated measures and there is a need to better explain these associations for making inferences. For high-dimensional large datasets, GAMM can take longer time to fit, and the change point optimization can be tedious for the analyst. A few ways to avoid this problem are to apply feature selection, variable transformation, and outlier detection procedures before examining pairwise associations using our method. Next, our HITL approach involves human inputs, and therefore, may still be susceptible to human errors and biases, despite the fine- tuning step using the optimization procedure. One way of reducing such potential errors is to consult domain experts post determination of change point from the optimization procedure. Finally, it is worth noting that our method caters to the problem of improving interpretability of glass-box models, at the cost of increased bias and limited predictive power when compared to black-box data mining models [28]. 5.4 Conclusion With the increasing availability of wearables, we can measure and understand different health phenomena at a highly granular level. We propose a human-in-the-loop method for accurate estimation of change points in ACM Trans. Manage. Inf. Syst. segmented mixed-effects regression facilitating the interpretations of pairwise associations of variables in wearables data. Our method is robust, efficient and the resultant segmented models provide better prediction accuracy than state-of-the-art alternatives for given problem. Our proposed method is empirically validated, more reliable due to human verification, and provides better interpretable results. Our approach can be generalized to other areas of IS where non-linear pairwise associations are anticipated. REFERENCES [1] GlobalNewsWire, Smart Wearables Market and Wearable Apps Mar ket Global, 2021. https://www.globenewswire.com/news - release/2021/12/15/2352316/0/en/Smart-Wearables-Market-and-Wearable-Apps-Market-Global-Share-Insights-2021-Top-Countries-Data-Future- Growth-Developments-Impact-of-Covid-19-on-Industry-Size-Production-Cost-Value-V.html (accessed Jan. 11, 2022). [2] I. Bardhan, H. Chen, and E. Karahanna, Connecti ng systems, data, and people: A multidisciplinary research roadmap for chronic disease management, MIS Quarterly, vol. 44, no. March, pp. 185–200, 2020. [3] HeLuo, LiuHongyan, YangYinghui, and WangBei, A Multi -attention Collaborative Deep Learning Approach for Blood Pressure Prediction, ACM Transactions on Management Information Systems (TMIS), vol. 13, no. 2, pp. 1–20, Oct. 2021, doi: 10.1145/3471571. [4] J. A. Killian, K. M. Passino, A. Nandi, D. R. Madden, and J. Clapp, Learning to detect heavy drinking episodes using smartphone ac celerometer data, in CEUR Workshop Proceedings, 2019, vol. 2429. [5] H. Zhu, S. Samtani, R. A. Brown, and H. Chen, A deep lea rning approach for recognizing activity of daily living (adl) for senior care: Exploiting interaction dependency and temporal patterns, MIS Quarterly, vol. 45, no. 2, 2021, doi: 10.25300/MISQ/2021/15574. [6] C. M. Lindberg et al., Effects of office workstation type on physical activity and stress, Occup Environ Med, vol. 75, no. 10, pp. 689–695, 2018, doi: 10.1136/oemed-2018-105077. [7] E. Smets et al., Large -scale wearable data reveal digital phenotypes for daily-life stress detection, npj Digital Medicine 2018 1:1, vol. 1, no. 1, pp. 1–10, Dec. 2018, doi: 10.1038/s41746-018-0074-9. [8] W. Zhang and S. Ram, A comprehensive analysis of triggers a nd risk factors for asthma based on machine learning and large he terogeneous data sources, MIS Quarterly, vol. 44, no. 1, pp. 305–349, 2020, doi: 10.25300/MISQ/2020/15106. [9] C. C. Yang, G. Leroy, and S. Ananiadou, Smart health and wellbeing, ACM Trans Manag Inf Syst, vol. 4, no. 4, 2013, doi: 10.1145/2555810.2555811. [10] GuKang, VosoughiSoroush, and PrioleauTemiloluwa, Sympto mID: A Framework for Rapid Symptom Identification in Pandemics Using News Reports, ACM Transactions on Management Information Systems (TMIS), vol. 12, no. 4, pp. 1–17, Sep. 2021, doi: 10.1145/3462441. [11] I. Azodo, R. Williams, A. Sheikh, and K. Cresswell, Opport unities and Challenges Surrounding the Use of Data From Wearable S ensor Devices in Health Care: Qualitative Interview Study, J Med Internet Res 2020;22(10):e19542 https://www.jmir.org/2020/10/e19542, vol. 22, no. 10, p. e19542, Oct. 2020, doi: 10.2196/19542. [12] M. Uddin and S. Syed‐abdul, Data Analytics and Appli cations of the Wearable Sensors in Healthcare: An Overview, Sensors (Basel), vol. 20, no. 5, Mar. 2020, doi: 10.3390/S20051379. [13] U. Kraus et al., Individual daytime noise exposure during routine activiti es and heart rate variability in adults: a repeated measures stud y, Environ Health Perspect, vol. 121, no. 5, pp. 607–612, 2013. [14] B. Q. Liu and D. L. Goodhue, Two worlds of trust for pot ential e-commerce users: Humans as cognitive misers, Information Systems Research, vol. 23, no. 4, pp. 1246–1262, 2012. [15] G. Pant and P. Srinivasan, Predicting Web Page Status, Information Systems Research, vol. 21, no. 2, pp. 345–364, Jun. 2010, doi: 10.1287/isre.1080.0231. [16] J. D. Singer and J. B. Willett, Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence. New York: Oxford University Press, 2009. doi: 10.1093/acprof:oso/9780195152968.001.0001. [17] V. M. Muggeo, D. C. Atkins, R. J. Gallop, and S. Dimidjian, Segmented mixed models with random ch angepoints: a maximum likelihood approach with application to treatment for depression study, Stat Modelling, vol. 14, pp. 293–313, 2014. [18] H. Bozdogan, Model selection and Akaike’s Information C riterion (AIC): The general theory and its analytical extensions, Psychometrika, vol. 52, no. 3, p. 3450370, 1987. [19] H. Chen, R. H. L. Chiang, and V. C. Storey, Business In telligence and Analytics: From Big Data To Big Impact, MIS Quarterly, vol. 36, no. 4, pp. 1165–1188, 2012. [20] M. Chan, D. Estève, J.-Y. Fourniols, C. Escriba, and E. Campo, Smart wear able systems: Current status and future challenges, Artif Intell Med, vol. 56, no. 3, pp. 137–156, 2012. [21] H. Banaee, M. U. Ahmed, and A. Loutfi, Data mining for wearable sensors in health monitoring systems: A review of recent trends and challenges, Sensors, vol. 13, pp. 17472–17500, 2013. [22] S. Majumder, T. Mondal, and M. Deen, Wearable Sensors for Remote Health Monitoring, Sensors, 2017. [23] I. Yamada and G. Lopez, Wearable sensing systems for heal thcare monitoring, in 2012 Symposium on VLSI Technology (VLSIT), Jun. 2012, pp. 5–10. ACM Trans. Manage. Inf. Syst. [24] K. Malhi, S. C. Mukhopadhyay, J. Schnepper, M. Haefke, and H. Ewald, A zigbee -based wearable physiological parameters monitoring system, IEEE Sens J, 2012. [25] D. Ravi, C. Wong, B. Lo, and G.-Z. Yang, A Deep Learning Approach to on -Node Sensor Data Analytics for Mobile or Wearable Devices, IEEE J Biomed Health Inform, 2017. [26] A. Sano et al., Identifying Objective Ph ysiological Markers and Modifiable Behaviors for Self-Reported Stress and Mental Health Status Using Wearable Sensors and Mobile Phones: Observational Study, J Med Internet Res, vol. 20, no. 6, pp. 210–216, 2018. [27] J. B. Wang et al., Wearable Sensor/Device (Fitbit One) and SMS Text-Messaging Prompts to Increase Physical Activity in Overweight and Obese Adults: A Randomized Controlled Trial, Telemedicine and e-Health, vol. 21, no. 10, pp. 782–792, Oct. 2015. [28] G. Shmueli, To Explain or to Predict?, Statistical science, vol. 25, no. 3, pp. 289–310, 2010. [29] H. Zhu, H. Chen, and R. Brown, A sequence -to-sequence model-based deep learning approach for recognizing activity of daily living for senior care, J Biomed Inform, vol. 84, pp. 148–158, Aug. 2018. [30] A. Pentland, D. Lazer, D. Brewer, and T. Heibeck, Using reality min ing to improve public health and medicine, 2009. [31] S. Guillén, M. T. Arredondo, and E. Castellano, A s urvey of commercial wearable systems for sport application, in Wearable Monitoring Systems, [32] L. Breiman, Statistical Modeling: The Two Cultures, Statistical Science, 2001. [33] M. N. Jarczok et al., Autonomic nervous system activity and workplace stressors -A systematic review, Neuroscience and Biobehavioral Reviews, vol. 37, no. 8. 2013. [34] X. Li et al., Digital Health: Tracking Physiomes and Activity Using Wearable Biosensors Reveals Useful Health-Related Information, PLoS Biol, [35] P. MacNaughton, J. Spengler, J. Vallarino, S. Santanam, U. Satish, and J. Allen, Environmental perceptions an d health before and after relocation to a green building, Build Environ, vol. 104, pp. 138–144, 2016. [36] J. F. Thayer et al., Effects of the physical work environment on physiological measures of stress, European journal of cardiovascular prevention and rehabilitation, vol. 17, no. 4, pp. 431–9, Aug. 2010. [37] T. Föhr et al., Subjective stress, objective heart rate variability -based stress, and recovery on workdays among overweight and psychologically distressed individuals: a cross-sectional study, Journal of Occupational Medicine and Toxicology, 2015. [38] T. Pereira, P. R. Almeida, J. P. S. Cunha, and A. Aguiar, Heart rate variability metrics for fine -grained stress level assessment, Comput Methods Programs Biomed, vol. 148, pp. 71–80, Sep. 2017. [39] A. Gelman and J. Hill, Data Analysis Using Regression and Multilevel/Hierarchical Models, 1st ed. New York, NY: Cambridge University Press, 2006. doi: 10.1017/CBO9780511790942. [40] G. Pimentel, S. Rodrigues, P. A. Silva, A. Vilarinho, R. Vaz, and J. P. Silva Cunha, A wearable approach for i ntraoperative physiological stress monitoring of multiple cooperative surgeons, Int J Med Inform, vol. 129, 2019, doi: 10.1016/j.ijmedinf.2019.05.028. [41] M. Cropley et al., The association between work -related rumination and heart rate variability: A field study, Front Hum Neurosci, vol. 11, Jan. 2017, doi: 10.3389/FNHUM.2017.00027/ABSTRACT. [42] J. D. Runkle, C. Cui, C. Fuhrmann, S. Stevens, J. del Pinal, and M. M. Sugg, Evaluation of wearable sensors for ph ysiologic monitoring of individually experienced temperatures in outdoor workers in southeastern U.S., Environ Int, vol. 129, 2019, doi: 10.1016/j.envint.2019.05.026. [43] M. Durban, J. Harezlak, M. P. Wand, and R. J. Carroll, Simple fitting of subject -specific curves for longitudinal data, Stat Med, vol. 24, no. 8, pp. 1153–1167, 2005. [44] D. W. K. Andrews, Tests for Parameter Instability and Struc tural Change With Unknown Change Point, Econometrica, vol. 61, no. 4, 1993, doi: 10.2307/2951764. [45] A. Linden, Conducting interrupted time -series analysis for single- and multiple-group comparisons, Stata Journal, vol. 15, no. 2, 2015, doi: 10.1177/1536867x1501500208. [46] S. E. Ryan and L. S. Porth, A tutorial on the piecewise r egression approach applied to bedload transport data, USDA Forest Service - General Technical Report RMRS-GTR, no. 189 RMRS-GTR, 2007. [47] V. M. R. Muggeo, Estimating regression models with un known break-points, Stat Med, vol. 22, no. 19, pp. 3055–3071, 2003, doi: 10.1002/sim.1545. [48] X. Shuai, Z. Zhou, and R. S. Yost, Using segmented regress ion models to fit soil nutrient and soybean grain y ield changes due to liming, J Agric Biol Environ Stat, vol. 8, no. 2, pp. 240–252, 2003. [49] A. Doan, Human -in-the-loop data analysis: A personal perspective, Proceedings of the Workshop on Human-In-the-Loop Data Analytics, HILDA 2018, Jun. 2018, doi: 10.1145/3209900.3209913. [50] X. Wu, L. Xiao, Y. Sun, J. Zhang, T. Ma, and L. He, A survey of human-in-the-loop for machine learning, Future Generation Computer Systems, vol. 135, pp. 364–381, Oct. 2022, doi: 10.1016/j.future.2022.05.014. [51] Y. Gil et al., Towards h uman-guided machine learning, in International Conference on Intelligent User Interfaces, Proceedings IUI, 2019, vol. Part F147615. doi: 10.1145/3301275.3302324. [52] D. D. Xin, L. L. Ma, J. J. Liu, S. S. Macke, S. S. Song, and A. A. Parameswaran, Acceleratin g Human-in-the-loop Machine Learning: Challenges and opportunities, 2018. doi: 10.1145/3209889.3209896. [53] A. Holzinger, Interactive machine learning for health i nformatics: when do we need the human-in-the-loop?, Brain Inform, vol. 3, no. 2, 2016, doi: 10.1007/s40708-016-0042-6. ACM Trans. Manage. Inf. Syst. [54] K. Zheng, G. Chen, M. Herschel, K. Y. Ngiam, B. C. Ooi, and J. Gao, PACE: Learning Effective Task Decomposition for Human -in-the-loop Healthcare Delivery, Proceedings of the ACM SIGMOD International Conference on Management of Data, vol. 13, pp. 2156–2168, 2021, doi: 10.1145/3448016.3457281. [55] A. Gupta, M. T. Lash, and S. K. Nachimuthu, Optimal s epsis patient treatment using human -in-the-loop artificial intelligence, Expert Syst Appl, vol. 169, p. 114476, May 2021, doi: 10.1016/J.ESWA.2020.114476. [56] S. N. Wood, Stable and efficient multiple smoothing parameter estimation for generalized additive models, J Am Stat Assoc, vol. 99, no. 467, 2004, doi: 10.1198/016214504000000980. [57] J. J. Faraway, Extending the linear model with R: generalized linear, mixed effects and nonparametric regression models, 1st ed. Boca Raton, FL: CRC press, 2016. [58] R. P. Brent, Algorithms for Minimization without Derivatives, 2nd ed. Eaglewood Cliffs, N.J.: Dover publications, 2013. [59] R. Fletcher, Practical Methods of Optimization, 4th ed., vol. 53. West Sussex, England: John Wiley & Sons, 2013. [60] GSA, Wellbuilt for Wellbeing, 2021. https://www.gsa.gov/gove rnmentwide-initiatives/federal-highperformance-green-buildings/resource- library/health/wellbuilt-for-wellbeing (accessed Jan. 12, 2022). [61] F. Shaffer and J. P. Ginsberg, An Overview of Heart Rate Var iability Metrics and Norms, Front Public Health, vol. 5, no. 258, pp. 1–17, Sep. 2017, doi: 10.3389/fpubh.2017.00258. [62] L. Soares-Miranda et al., Physical Activity and Heart Rate Variability in Older Adults, Circulation, vol. 129, no. 21, pp. 2100–2110, May 2014. [63] K. Srinivasan et al., A Regularization Approach for Identifying Cumulative La gged Effects in Smart Health Applications, in Proceedings of the 7th International Conference on Digital Health, 2017, pp. 99–103. [64] D. Liu, M. Görges, and S. A. Jenkins, The University o f Queensland Vital Signs Dataset: Development of an accessible reposit ory of anesthesia patient monitoring data for research, Anesth Analg, vol. 114, no. 3, 2012, doi: 10.1213/ANE.0b013e318241f7c0. [65] B. Vogel et al., ST -segment elevation myocardial infarction, Nature Reviews Disease Primers 2019 5:1, vol. 5, no. 1, pp. 1–20, Jun. 2019, doi: 10.1038/s41572-019-0090-3. [66] G. B. Smith, D. R. Prytherch, P. Meredith, P. E. Schmidt, and P. I. Featherstone, The abil ity of the National Early Warning Score (NEWS) to discriminate patients at risk of early cardiac arrest, unanticipated intensive care unit admission, and death, Resuscitation, vol. 84, no. 4, pp. 465– 470, Apr. 2013, doi: 10.1016/J.RESUSCITATION.2012.12.016. [67] S. Kianersi, M. Luetke, J. Agley, R. Gassman, C. Ludema, and M. Rosenberg, Validation of transdermal alcohol conc entration d ata collected using wearable alcohol monitors: A systematic review and meta-analysis, Drug Alcohol Depend, vol. 216, p. 108304, Nov. 2020, doi: 10.1016/J.DRUGALCDEP.2020.108304. [68] D. Pradeep Kumar, N. Toosizadeh, J. Mohler, H. Ehsani, C. Mannier, and K. Laksari, Sensor -based characterization of daily walking: A new paradigm in pre-frailty/frailty assessment, BMC Geriatr, vol. 20, no. 1, pp. 1–11, May 2020, doi: 10.1186/S12877-020-01572-1/FIGURES/2. [69] V. M. R. Muggeo, segmented: An R package to Fit Regression M odels with Broken-Line Relationships, R News, vol. 8, no. May, 2008, doi: 10.1159/000323281. [70] A. Rai et al., Editor’s Comments: Diversity of Design Science Research, MIS Quarterly, vol. 41, pp. iii–xviii, 2017. ACM Trans. Manage. Inf. Syst.

Journal

ACM Transactions on Management Information Systems (TMIS)Association for Computing Machinery

Published: Jan 25, 2023

Keywords: Smart health

References