1 Introduction

Dance and music may be employed as a kind of exercise that lasts a lifetime. In addition to enhancing emotional and physical health, music and dance also have the power to regulate daily life, foster better interpersonal relationships, and fill individuals with vigor and vitality. According to psychologists, the emotional responses of the elderly can be broadly categorized as melancholy, sadness, anxiety, loneliness, guilt, helplessness, and anger, and the cognitive changes can be broadly categorized as pessimism, short attention span, hopelessness, helplessness, uselessness, low value, and, in some cases, delusions, fantasies, or suicidal thoughts, which are more common in the elderly than in other age groups [1]. Compared to other age groups, the elderly are more prone to have these symptoms [2]. According to the survey, it is well known that senior people frequently experience temporary anxiety symptoms. If psychological issues are effectively handled, elderly people’s bad behaviors and social issues can be alleviated [3]. Female seniors who actively engage in social activities have been shown to live longer, and group music and dance activities give seniors a new platform to express their social roles while also giving them a place to confide in and communicate with one another, effectively reducing loneliness and raising happiness levels. However, it is very important to accurately capture the real physiological excitement of people to music and dance, as well as the pleasurable biological stimulation.

Physiological signal analysis and processing algorithms for wearable technology have advanced quickly in recent years, making significant contributions to domains of human health monitoring such heart rate estimates of human activity and emotion identification [4, 5]. In the human health monitoring system, as seen in Fig. 1, wearable devices like smart watches and bracelets are used to gather physiological signals from the body, such as photoplethysmography (PPG) and electrocardiogram (ECG), before transmitting the information to a distant server. Finally, the analysis and processing of physiological information allows for the perception of the human body’s state of health [6]. The human health monitoring system can help reduce the scarcity of medical resources brought on by China’s massive population base and growing aging rate in addition to offering low-load and long-term continuous daily health monitoring for the monitored [7]. Consequently, developing algorithms for physiological data analysis and processing for wearable technology has significant scientific and societal implications.

Fig. 1
figure 1

Human health monitoring system based on physiological signal analysis and processing

A human health monitoring system can not only provide low-load, long-term continuous daily health monitoring for the monitored individuals, but also alleviate the shortage of medical resources due to the huge population base and the deepening aging in China. This paper proposes a wearable ECG individual signal discrepancy elimination algorithm based on BDA. The effectiveness of this algorithm in eliminating individual signal differences in ECG is verified in an emotion recognition experiment based on an open dataset. Our model can provide effective help for accurate estimation of emotions and group activities, not only limited to music and dance. Therefore, it is of great scientific and social importance to conduct research on physiological signal analysis and processing algorithms for wearable devices.

2 Related Work

One of the sports that older people engage in is aerobic exercise of low-to-moderate intensity, and one of them is music and dance, which forms a regular acoustic vibration through changes in tone and melody, which is a benign stimulus for the human body and has a direct effect on the brain, and this physical energy can cause the tissue cells to resonate harmoniously, resulting in physiological and psychological responses [8, 9]. Al-Turjman et al. [10] indicate that older adults who engage in regular exercise have significantly higher subjective health status (self-perceived health, recuperation, mental health, and stress management) than those who do not exercise. He et al. [11] interviewed older adults who participated in square dancing and concluded that they had higher happiness scores and all had good interpersonal relationships and fulfilling social lives. Kaur et al. [12] noted that female older adults who exercised consistently had fewer functional limitations in health, better daily functioning, better exercise related self-efficacy, and fewer falls. Shen et al. [13] indicated that as older adults participated in group leisure activities more frequently, their physical and mental health and vitality tended to increase. By analyzing the current status of domestic and international research, this section attempts to show the trend of related research areas.

2.1 Product Design of Physiological Data

The word “emotion” is derived from the Greek word “pathos” and was first used to express people’s feelings about tragedy [14]. Emotion plays a central role in a range of intelligent activities, such as perception, decision-making, logical reasoning, and social interaction, and some studies even show that “80%” of human communication is emotional information [15]. Because of the significance of emotion in human communication, affective computing is an essential part of the human–computer interaction process and an important breakthrough in making machines intelligent [16, 17]. Emotional computing and understanding aims to give computer systems the ability to recognize, understand, express, and adapt to human emotions to build a harmonious human–computer environment, and to make computers more intelligent and comprehensive [18, 19]. Depending on the input signal, affective computing and understanding encompasses different research directions. In addition, unlike the short-term and sparse characteristics of visual or speech emotion signals, physiological signals are long-time continuous signals that can work continuously in the absence or failure of traditional visual and speech signals, and thus can effectively enhance human–computer interaction or interpersonal interaction.

2.2 Methods Related to Emotion Measurement

According to the current mainstream research, models for measuring emotions are mainly divided into basic emotion theories (categorical models) and dimensional emotion theories. The basic emotion theory is a collection of basic and universal fundamental emotions [20, 21]. In recent years, physiological data mainly include peripheral physiological measurements (e.g., heart rate, respiration, electromyography, dermatography, skin temperature, etc.) and central nervous system measurements (e.g., EEG and functional magnetic resonance imaging) [22, 23]. Southwestern University in China has also done a lot of work on emotion recognition based on human physiological data, and considers physiological data as a very useful feature for identifying human emotions [24]. Measuring physiological signals is the first step in building a system that can automatically identify physiological patterns related to emotions. In a broad sense, all bodily changes can be considered as changes in physiological signals, including brain activity, facial expressions, voice patterns, and body chemistry. In this paper, we focus on continuous physiological signals measured from the skin surface, which can reflect the activity of the autonomic nervous system. Next, we will discuss the characteristics of cardiac activity (heart rate and heart rate variability), blood pressure, skin conductivity, surface electromyography, respiration, pulse, and other physiological signals.

With the rapid development of network technology and intelligent hardware, wearable devices have become a hot spot for current research. The application of wearable devices in the field of human health monitoring provides a new path for disease monitoring and diagnosis, and has become a hot spot in the current medical device innovation. Based on this, this paper briefly introduces the current application status and development trend of wearable devices in the field of music and dance well-being of elderly people.

3 Methodology

3.1 PPG Motion Artifact Removal Based on Sparse Representation

The purpose of this section is to remove motion artifacts (MA) of PPG using sparse representation (SR) based on the selected dictionary. It should be noted that this paper sets the dictionary form according to the strength of MA in the original signal to effectively express the original signal. In this part, we first introduce the binary decision based on correlation (used to determine whether the original PPG signal contains strong MA), and then describe the PPG MA removal algorithm based on sparse representation. The purpose of correlation-based decision-making is to determine whether the original PPG signal contains strong MA by calculating the correlation coefficient between the STFT amplitude of the original PPG signal and the STFT amplitude of the acceleration signal. If the calculated Pearson correlation coefficient is not high, then the original PPG signal does not contain strong MA (that is, the PPG signal is considered clean). In this case, to avoid interference of MA dictionary atoms (from acceleration signal), MA dictionary R is not used. First, the STFT amplitude of PPG signal and acceleration signal is calculated. Specifically, this paper uses STFT to convert the original PPG signal sraw and acceleration signal araw into STFT amplitude domain, as shown in Eqs. (1) and (2)

$${\mathbf{X}} = \left| {{\text{STFT}} \left( {{\mathbf{s}}_{{\text{raw }}} } \right)} \right|$$
(1)
$${\mathbf{A}} = \left| {{\text{STFT}} \left( {{\mathbf{a}}_{{\text{raw }}} } \right)} \right|$$
(2)

where X ∈ R m × N is the STFT amplitude of sraw, and A ∈ R m × N is the STFT amplitude of araw. M represents the component in the frequency range of 0–5 Hz (the frequency range of heart rate) in the STFT domain. N represents the number of time windows. The acceleration signal araw is the addition of three-axis acceleration signals after band-pass filtering. Second, the matrix X is converted into a column vector \(x \in R\left( {m * n} \right) \times 1\), that is, x is composed of column vectors in matrix X. Similarly, matrix A is transformed into a column vector \(a \in R\left( {m * n} \right) \times 1\), that is, a is composed of column vectors in matrix A. M · n represents the multiplied value of m and n. Finally, measure the Pearson correlation coefficient between x and a whether the pcorr meets the following conditions, as shown in Eq. (3):

$$\left| {\rho_{{\text{corr }}} } \right| \le \Delta_{{\text{corr }}} .$$
(3)

Including Δ Corr is a preset threshold. If ρcorr meets Eq. (3), the original PPG signal is determined to be clean (or contains weak MA). At this time, only the PPG dictionary is used, and the MA dictionary is not used, which is expressed as Eq. (4)

$$D = \tilde{R}$$
(4)

If Pcorr cannot meet the Eq. (3), and the original PPG signal is determined to be unclean (or contains strong MA). At this time, both PPG dictionary and MA dictionary will be used, expressed as Eq. (5):

$$D = [\tilde{R}R].$$
(5)

where D represents the dictionary used in the MA removal algorithm of sparse representation. (2) The sparse representation PPG motion artifact removal sparse representation PPG motion artifact (MA) removal algorithm aims to remove MA from the original PPG signal using the sparse representation algorithm. Sparse representation is a new signal processing technology, which has shown great potential in many applications. The sparse representation algorithm can be used for PPG and acceleration signals, because PPG signal and acceleration signal (MA reference signal) are sparse in a specific domain; the regularity of PPG and acceleration signals indicates that these two signals can be well represented. If an original PPG signal contains strong MA, this paper assumes that the STFT amplitude of the original PPG signal \(x \in R\left( {m * n} \right) \times 1\), STFT amplitude s ∈ R (m · n) of clean PPG signal × 1, and STFT amplitude of MA \(m \in R\left( {m * n} \right) \times 1\) and 1. This assumption can be expressed as Eq. (6)

$$X = s + m.$$
(6)

This assumption is like the assumption of speech enhancement. It is the sum of the spectrum amplitude of the original noisy speech signal that can be approximately regarded as speech signal and noise signal. Based on this assumption, if the original PPG signal contains strong MA, then D = [R \(\tilde{R}\)], That is, the dictionary D used in sparse representation is determined by the PPG dictionary \(\tilde{R}\) and MA dictionary R. At this time, the optimization problem in Eq. (7) can be used to remove MA. The purpose of sparse representation is to use a linear combination of a small number of dictionary atoms (atoms selected from dictionary D that are related to the original PPG signal x), the sparse coefficient is obtained by approximating the original PPG signal x with a lower error. The amplitude of clean PPG is estimated by the sparse coefficient obtained, and the contribution of MA is ignored, so MA is removed, and the denoised PPG signal is obtained. According to the form of D, the sparse representation algorithm includes two cases. On the one hand, if \(D = \tilde{R}\), the purpose of sparse representation algorithm is to use the dictionary \(\tilde{R}\) from the clean PPG linear combination of dictionary atoms. In this case, to remove MA, the obtained MA dictionary atom is ignored. Only the sparse coefficient corresponding to the clean PPG signal is used to estimate the clean PPG amplitude, and the clean PPG signal is reconstructed combining the phase of the original PPG signal to obtain the denoised PPG signal. The optimization problem of sparse representation algorithm is expressed as

$$\begin{gathered} \min_{{\mathbf{c}}} \left\| {{\mathbf{x}} - {\mathbf{D}}} \right\|_{2} \, \hfill \\ {\text{s}}{\text{.t}}{. }\left\| {\mathbf{c}} \right\|_{0} \le K, \hfill \\ \end{gathered}$$
(7)

where x represents the STFT amplitude of the original PPG signal after band-pass filtering. D is the dictionary after the decision using correlation (D = \(\tilde{R}\) or D = [R \(\tilde{R}\)]) ° K is a small positive integer. C is a sparse coefficient vector. c = c \(\tilde{R}\) or c = [\(\tilde{C}\) r cr] T, where \(\tilde{C}\) R and cr correspond to PPG dictionary \(\tilde{R}\) And MA dictionary R. This optimization problem aims to obtain a sparse coefficient vector c, so that when the coefficient vector is sparse, the linear combination of a few dictionary atoms (the product of dictionary atoms and sparse coefficients) in dictionary D can approximately represent the original PPG signal x. The above optimization problem, Eq. (7), can be solved by using many algorithms, such as orthogonal matching pursuit OMP and base pursuit noise reduction. These algorithms use different criteria and steps to solve problems. Because OMP algorithm has low computational complexity and is simple to implement, this paper chooses OMP algorithm in the experiment. OMP is an iterative algorithm for selecting atoms from dictionaries. After the OMP algorithm is used to solve the optimization problem represented by Eq. (7), a sparse coefficient vector c is obtained in this paper. Then, the estimated STFT amplitude is corresponding to the denoised (clean) PPG signal \(\hat{S}\). It can be obtained in the following way, as shown in Eq. (8):

$$\hat{S} = \hat{R}c_{{\tilde{r}}}^{T} .$$
(8)

Among them, \(\hat{S}\) S can be converted into a denoised PPG signal in the time domain. Vector \(\hat{S}\) ∈ R (m·n) × 1 is converted to matrix \(\hat{S}\) ∈ R m × n, where the matrix under the STFT field \(\hat{S}\) It can be converted into a denoised PPG signal in the time domain [by the following Eq. (9)]. Finally, based on the PPG signal \(\hat{S}\) in the STFT amplitude domain and the phase of the original PPG signal, \(\hat{S}\) is converted into the denoised PPG signal (signal in the time domain) srecon through the inverse STFT transform. The denoised PPG signal srecon will be used in the following heart rate peak tracking algorithm. The process of inverse STFT transformation can be expressed, as shown in Eq. (9). Where ISTFT represents the inverse STFT transform

$$S_{RECON} = ISTFT(\hat{S}).$$
(9)

3.2 Wearable ECG Difference Elimination Algorithm

In this section, a balanced BDA-based wearable ECG individual signal disparity elimination algorithm is proposed for sentiment recognition scenarios, which improves the performance of online sentiment recognition by overcoming inter-individual signal disparity as well as intra-individual signal disparity simultaneously. The algorithm uses the BDA algorithm [25] to reduce inter-individual signal differences, obtains a shared subspace of source and target user data, and trains an emotion classifier (based on source user data) for the target user in the shared subspace. At the same time, the Online Data Adaptation (ODA) algorithm is used to reduce the intra-individual signal differences, so that the time-varying ECG signals arriving online can be adapted to improve the online recognition performance. Figure 2 shows the framework of a wearable ECG individual signal difference elimination algorithm for emotion recognition. It mainly includes four parts: signal difference elimination between individuals based on balanced BDA, emotion classifier training, intra-individual signal difference elimination based on ODA, and online emotion recognition.

Fig. 2
figure 2

Wearable ECG individual signal difference elimination algorithm framework for emotion recognition

The BDA algorithm is used to learn a shared subspace and prepare for the training part of the emotion classifier. Figure 3 shows the principle of BDA-based inter-individual signal difference elimination algorithm. It can be seen from the figure that BDA makes the data of the two fields distributed and aligned in the shared subspace. BDA is a UDA algorithm, which can not only minimize the difference between edge distribution and conditional distribution, but also adaptively adjust the importance of each distribution. The BDA algorithm aims to minimize the difference between the source domain ECG data and the target domain ECG data, as shown in Eq. (10)

$$\begin{gathered} \frac{{\min_{{}} }}{{P_{r} }}{\text{tr}} \left( {{\mathbf{P}}_{{\text{r}}} {\mathbf{X}}\left( {(1 - \theta ){\mathbf{M}}_{0} + \theta \sum\limits_{c = 1}^{C} {{\mathbf{M}}_{c} } } \right){\mathbf{X}}^{{\text{T}}} {\mathbf{P}}_{{\text{r}}} } \right) + \lambda \left\| {{\mathbf{P}}_{{\text{r}}} } \right\|_{F}^{2} \hfill \\ s.t.{\mathbf{P}}_{{\text{r}}}^{{\text{T}}} {\mathbf{XHX}}^{{\text{T}}} {\mathbf{P}}_{{\text{r}}} = {\mathbf{I}},0 \le \theta \le 1. \hfill \\ \end{gathered}$$
(10)
Fig. 3
figure 3

Schematic diagram of signal difference elimination algorithm between individuals based on BDA and intra-individual signal difference elimination algorithm based on ODA

Because of the nonstationary of ECG, the ECG changes with time, resulting in differences between the online ECG data and the initial data of the same target user (intra-individual signal differences), resulting in the classifier f obtained in the previous part may not be applicable to the newly arrived online data. As shown in Fig. 3, although Pr obtained by BDA can map online data to a shared subspace (i.e., Fig. 3c). The purpose of ODA algorithm is to adaptively adapt time-varying ECG signals by reducing the signal difference within individuals (the difference between the online data of the same target user and the initial target data), to improve the performance of online emotion recognition. Specifically, the algorithm reduces the difference of second-order statistical characteristics between online data and initial target data, the time-varying ECG signals arriving online are adaptive to improve the online learning ability of the recognition model.

4 Experiments

4.1 Datasets

In the experiment, a variety of typical pseudo-periodic motion data sets are used. The six typical pseudo-periodic movements are walking, fast running, waving, arm swinging, elliptical machine, and squatting. These movements are related to hand movements, which are the main source of MA. Figure 4 shows the schematic diagram of six common fitness exercises: walking, fast running, waving, arm swinging, elliptical machine, and squatting. Figure 5 shows the hardware settings for collecting data. Figure 5a shows the hardware frame diagram during data acquisition, and Fig. 5b shows a subject with hardware. Among them, ECG sensor is used to obtain three lead ECG: LA, RA, and LL (LA and RA are placed in the left and right chest, and LL lead is placed in the left lower abdomen); the reflective pulse oximeter sensor with green LED is used to collect PPG signals; the ADXL345 sensor is used to collect the three-axis acceleration signal. Finally, all data of the above sensors are processed through STM32 and transmitted to the host for data analysis and processing.

Fig. 4
figure 4

Schematic diagram of six typical pseudo-periodic motions. a Walking; b run fast; c wave; d swing arm; e elliptical training; f squat

Fig. 5
figure 5

Hardware settings for data acquisition. a Block diagram of hardware settings for data acquisition. b A subject wearing hardware

The six datasets used in this chapter were collected, while subjects performed six typical pseudo-periodic exercises. Specifically, the Running dataset contains Walking and Running Fast; Beckoning contains Reckoning; Swing Arm contains Swing Arm; Elliptical Trainer contains Elliptical Machine; Deep Keen Bend contains Squat; and Mixture contains Reckoner, Swing Arm, and Squat.

The Running dataset was first presented in the literature [26]. It was collected from 12 subjects (aged 18–35 years) and 12 records were obtained. For each individual, three signals were acquired simultaneously, namely, a single-pass photoelectric volumetric pulse-wave signal (PPG), a triaxial acceleration signal, and an ECG. These signals were sampled at a frequency of 125 Hz and were acquired for approximately five minutes for each subject. Specifically, subjects first walked at 12 km/h for 0.5 min, then ran at 68 km/h for 1 min, then ran at 1215 km/h for 1 min, and finally walked at 12 km/h for 0.5 min.

The other five datasets (Beckoning, Swing Arm, Elliptical Trainer, Deep Keen Bend, and Mixture) involved exercises, such as handstands, arm swings, elliptical training, and squats. The first four datasets were collected from 9 subjects (aged 18–35 years), resulting in 9 records. The Mixture dataset was collected from 4 subjects, resulting in four records. Each record lasted approximately 4 min. For each subject, the signals collected simultaneously included a single-channel PPG signal, a triaxial acceleration signal, and an ECG signal.

The other five datasets are described in detail below.

The Beckoning dataset contains the beckoning action. During the acquisition, subjects rested for 0.5 min at the beginning and end of the acquisition. At other times, subjects performed the hand wave for 3 min. The hand-waving motion was very similar to the common mascot, the manekineko (depicting a cat waving with an upright paw).

The Swing Arm dataset contains arm-swinging movements. During the acquisition, subjects rested for 0.5 min at the beginning and end of the acquisition. At other times, the subject swings the arm with the sensor back and forth for 3 min at an angle of approximately 60°.

The Elliptical Trainer data set was collected from subjects during elliptical training. At the time of acquisition, subjects first rested for 0.5 min, then exercised on the elliptical machine for 1 min at 46 km/h, followed by 1 min at 78 km/h, then 1 min at 46 km/h, and finally rested for 0.5 min.

The Deep Keen Bend data set was collected while the subjects were performing the deep squat exercise. At the time of acquisition, subjects rested for 0.5 min at the beginning and end of the acquisition. At other times, subjects performed the squat exercise for 3 min. Before starting the squat, the subject straightens the back and lower back with the knees aligned with the tips of the toes. The squat was performed in a natural and fluid manner, with hands on the knees when the lowest point of the squat was reached, so that the hands could provide proper support when rising.

The Mixture dataset was collected from subjects performing a mixture of three movements, namely, handstands, arm swings, and squats. This data set contains four records. In each record, subjects rested for 0.5 min at the beginning and end of the acquisition. At other times, subjects performed two types of exercises for a total of 3 min. For the first type, subjects beckoned for 1.5 min and then swung their arms for 1.5 min. For the second type, the subject beckoned for 1.5 min and then squatted for 1.5 min. Of the four records, the first two were of the first type and the last two were of the second type.

4.2 Experimental Results and Analysis

In this paper, the performance of the algorithm was evaluated in five aspects: error, Bland Altman plot, Pearson coefficient scatter plot, heart rate estimation curve plot, and MA removal effect plot. Among the six methods compared in Table 1 (RandF, temko, TROIKA, JOSS, EEMD, and CorNet), there were some wild points, such as 70.18 for TROIKA. after removing outliers greater than 30 BPM, the mean errors of the five methods for all records were 3.25 BPM, 3.38 BPM, 3.52 BPM 7.66 BPM, 6.88 BPM, and 7.31 BPM. These results indicate that the performance of our method (DL + HMR + SR-based heart rate estimation) is still better (2.40 BPM) than these five methods even after removing the wild points in the six methods, which indicates that our method (DL + HMR + SR-based heart rate estimation) performs well and suggests that our algorithm (DL + HMR + SR) is effective in removing pseudo-periodic motion artifacts from the PPG signal.

Table 1 Error comparison between the method in this chapter and other heart rate estimation methods

To better compare different methods, Fig. 6 compares the F1 scores of different methods, that is, the average of the two types of F1 scores. For this method (emotion recognition based on BDA + ODA), the average F1 scores of all subjects are Dreamer’s valence recognition is 0.69%, dreamer’s arousal recognition is 0.65%, Amigos’ valence recognition is 0.66%, and Amigos’ arousal recognition is 0.63%. It can be seen from Fig. 6 that the F1 score of the method in this paper is higher than that of other methods (UDA is used to reduce the individual signal difference), which shows the advantages of the method in emotion recognition and shows that the algorithm in this paper (BDA + ODA) is better than other UDA algorithms in overcoming the individual signal difference of ECG [27]. Further, to better compare different methods, Figs. 7 and 8 show the confusion matrices for different methods to classify on Dreamer Valence and Dreamer Ambient, respectively. In the confusion matrix, each row represents the real class, and each column represents the prediction class. The elements (i, j) represent the percentage obtained when the samples originally belonging to class i are divided into class j. On the Dreamer dataset, the method in this paper (emotion recognition based on BDA + ODA) is 78.17% in positive emotion, 60.97% in negative emotion, and 57.12% in high arousal emotion, 79.32% in low arousal. Compared with other methods, this method performs better in valence classification and arousal classification, It shows that this method is superior to other methods (using UDA to reduce the individual signal difference) when it is used for emotion recognition, and it also shows that this algorithm (BDA + ODA) is superior to other UDA-based algorithms when it is used to eliminate the individual signal difference of ECG.

Fig. 6
figure 6

F1 score of different emotion recognition methods

Fig. 7
figure 7

Confusion matrix of different emotion recognition methods in Dreamer valence classification

Fig. 8
figure 8

Confusion matrix of different emotion recognition methods in Dreamer arousal classification

4.3 Heart Rate Estimation

On the data set Elliptic Trainer (Fig. 9), the estimated value deviates from the true value in 70–100 s. The reason may be that the trend of heart rate changes frequently, resulting in a large error in the heart rate peak tracking phase (when the heart rate value in the current time window is corrected based on the previous heart rate change trend). However, when the change trend of the rate of care is relatively stable, this method (heart rate estimation based on DL + HMR + SR) can accurately reestimate the heart rate. Moreover, in Fig. 9, the trend of heart rate estimation in the overall time is generally consistent with the trend of the real value, which indicates that the method in this paper performs a good heart rate estimation, thus demonstrating the effectiveness of the algorithm in this paper (DL + HMR + SR) in removing the pseudo-periodic motion artifacts in PPG.

Fig. 9
figure 9

The estimated heart rate curve of the method in this section (heart rate estimation based on DL + HMR + SR) on a subject record in six data sets

5 Conclusion

Wearable devices monitor human health and emotions by collecting human physiological signals and based on physiological signal analysis and processing algorithms to effectively detect group music and dance well-being. First, we propose an algorithm based on sparse representation to eliminate motion artifacts of wearable PPGs. In addition, to address the problem of poor model generalization due to individual signal differences (between and within individuals) in human ECG signals, a wearable ECG individual signal difference elimination algorithm based on BDA is proposed. The effectiveness of the algorithm in eliminating ECG individual signal differences is verified in an emotion recognition experiment based on an open dataset. Our model can provide effective help for accurate estimation of emotions and group activities, not only limited to music and dance.