Logo of halHAL - Archives Ouvertes - Home page.
IEEE Trans Biomed Eng. 2013 January; 60(1): 106–14.
Published online 2012 October 18. doi: 10.1109/TBME.2012.2225428.

The number of patients treated with Implantable Cardiac Devices (ICD) has strongly increased over the past 10 years. According to the American Heart Association [1], an estimated 111 000 defibrillator and 358 000 pacemaker implant procedures were performed for patients in the United States in 2007. These patients require regular in-hospital visits to follow-up the patient’s response to the therapy, to monitor whether the ICD is working optimally and, eventually, to modify the pacing parameters. More recently, wireless remote monitoring of ICD has become a priority for all major ICD constructors, in order to perform this follow-up more frequently, avoiding hospital visits and reducing costs. In both cases, the surface electrocardiogram (ECG) is necessary, since it is the main signal used by the cardiologist for the analysis of the cardiac electrical activity. However, the cardiac electrical activity acquired from the ICD, called electrograms (EGM), are collected by electrodes placed on the endocardium and/or the epicardium and show different morphologies than those of the surface ECG. The synthesis, or reconstruction, of the surface ECG from a set of EGM is thus of first importance in this context.

This challenging problem has been dealt by a number of studies [29]. From a methodological point of view, they can be categorized according to the method used to estimate the Transfer Function (TF) between EGM and ECG: i) linear filtering methods based upon Recursive Least Squares (RLS) estimation or by processing the data in blocks [24,9], ii) single, fixed-dipole modeling algorithms [8,9], and iii) non linear filtering methods [2,5,6].

Concerning the linear approach, several works have been proposed in the literature. In [2], each surface QRS complex was synthesized using a direct single-input single-output scheme. We showed in this work that the direct ECG synthesis depends strongly on the chosen EGM and that a multivariate approach would be of benefit. In this sense, we proposed in [3,4] an indirect method to estimate the linear TF between three-dimensional (3D) representations of cardiac activity [10], namely the signal EGM3 D , obtained from the orthogonalization of EGM signals and the signal ECG3 D derived through the orthogonalization of ECG signals.

In the same line of the above-mentioned methods, Mendenhall et al presented recently the use of a multivariate linear TF [8,9]. Although the performance of these linear methods is satisfactory, especially for patients with surface ECG containing only one beat morphology, some improvements are still needed. In fact, in a real application, noise, artifacts and the natural evolution of the pathology may influence the relationship, over time, between the EGM and the ECG. Thus stochastic and non linear phenomena crop up, and time series dynamics cannot be robustly described using classical linear filtering.

Two fixed-dipole modeling algorithms were proposed in [8,9]. They require both a QRS detection stage (as in [2]), and the simultaneous measurement of the surface cardiac electrical activity, using the standard 12-lead ECG and the modified-Frank VCG systems. In addition, it is shown in [9] that the synthesis of the surface ECG from EGM by using these two algorithms gave poor results.

Different multivariate non linear approaches have been proposed by our group [5,6]. These methods require simultaneously recorded intracardiac and surface ECG signals of each patient to train a Time Delay artificial Neural Network (TDNN) [11] and are patient specific. This strategy has shown, during our preliminary evaluations, to provide an improved performance compared to RLS, particularly when the patient presents multiple QRS morphologies [5].

This paper can be viewed as a natural extension of our previous works in this field, with significant improvements concerning: i) the optimization of the TDNN structure and parameters, ii) the analysis of the sensitivity of the reconstruction performance to the chosen EGM configuration, and iii) the quantitative and qualitative performance evaluation of the methods through a more comprehensive methodology.

Section II presents in details the two proposed non-linear reconstruction schemes (direct and indirect methods). The database used for the experimentation is described in section III. Section IV is devoted to the results, in terms of optimization of the parameters, selection of the EGM leads and comparison of the synthesis methods. We also discuss the case of real industrial implementation. Finally, section V summarizes the main concerns and conclusions of this study.

As described in the introduction, the ECG synthesis may be performed by a direct or an indirect strategy described in this section. This paragraph also presents the proposed non linear estimation method of the transfer function and the linear approach that will be used for performance comparison.

Let s(k) = [s(1, k), …, s(M, k)]T and x(k) = [x(1, k), …, x(N, k)]T, for k = 1, …, L, denote, respectively, an EGM and an ECG dataset, where M is the number of EGM leads available from the implant, N corresponds to the number of ECG leads and L is the size of the observation vectors. Surface ECG signal synthesis can thus be modeled as follows:
(1)

In other words, the ECG is supposed to be the output of an unspecified non linear function inline-graphic halms747058ig1 driven by the EGM, corrupted by an additive noise b(k)=[b(1, k), …, b(N, k)]T.

The estimation of inline-graphic halms747058ig1 can be performed by a classical two-step procedure, including a training step and a synthesis step as depicted in Figure 1.

1. Training step The objective of this step is to identify the transfer function inline-graphic halms747058ig1, specific to each patient, by using a couple of learning datasets s1(k) (M EGM leads) and x1(k) (N ECG leads), of length L 1. N different Multi-In Single-Out (MISO) systems (or transfer functions), inline-graphic halms747058ig2, inline-graphic halms747058ig3, …, inline-graphic halms747058ig4, between the M-rows input vector s 1(k) and each row of the output vector x 1(k), namely x 1(i, k), are identified. Typically, the training step can be performed during the implantation, or whenever s 1(k) and x 1(k) can be simultaneously acquired.
2. Synthesis step It is devoted to the follow-up of the patient during a remote monitoring session, or during the regular in-hospital visits. In the latter case, a new EGM dataset s2(k), of length L 2, is continuously measured and acquired by the device and used to synthesize a surface ECG 2(i, k), by using the estimates inline-graphic halms747058ig5, for i = 1, …, N, such that:
(2)
The principle of the indirect method is depicted in Figure 2. The main difference relies on the computation of 3D representations of the surface (ECG3 D ) and intracardiac (EGM3 D ) electrical activities of the heart, from the available ECG and EGM signals, through linear transformations W ECG and W EGM , respectively. Transfer function inline-graphic halms747058ig6 will thus be estimated between EGM3 D and ECG3 D , which will be respectively denoted v s 1 (k) and v x 1 (k). This indirect method implies the application of a specific pre-processing approach for the training and synthesis steps.
1. Three-Dimensional representation of the cardiac electrical activity Contrary to the standard 12-lead ECG, the analysis based on beat loops has been found to i) better compensate the changes in the electrical axis caused by various extracardiac factors [10], such as respiration, body position, electrode positioning, and so forth, ii) give a compact representation of the cardiac electrical activity, minimizing storage needs, and iii) provide a solution to the time synchronization problem which arises in cardiac data.

In [5], we evaluated four different approaches to perform the calculation of the EGM3 D and the ECG3 D : Principal Component Analysis (PCA) [12], Robust Principal Component Analysis (RobPCA) [13], Independent Component Analysis based on Second Order statistics (ICASO) [14] and Independent Component Analysis based on Fourth Order statistics (ICAFO) [15]. We showed that the results can be considered equivalent and that a classical PCA is a satisfactory solution. PCA has thus been retained in this work.

In both cases (direct and indirect methods), the transfer function is modeled as a nonlinear function, based on a Time Delay artificial Neural Network (TDNN). It is well-known that feed-forward Artificial Neural Networks (ANN) with an input layer, a single hidden layer, and an output layer may be used as universal function approximators, under very general conditions for the activation functions [11,16]. TDNN are a particular implementation of feed-forward ANN, in which delayed versions of the input signals are presented at the input layer of the network. TDNN have thus an extended capability for time series processing, with respect to feed-forward ANNs, since they include a representation of the d past samples of each input signal. In this work, each TDNN is defined with an input layer of N I = N × d samples, a hidden layer of N H neurons with a sigmoid activation function and one linear output neuron. The implementation is based on the approach proposed by D. MacKay [17], to improve the generalization of the procedure and to avoid overfitting.
D. Linear approach
The performance provided by the two TDNN-based approaches will be compared with a classical linear approach. Indeed, the transfer function between an input y(k) and an output z(k) is commonly supposed to be a linear Wiener filter h, such that:
(6)
where * is the convolution operation and h(k) = [h(0), …, h(Lh ))] is the impulse response of a linear time invariant filter of length Lh . Least square estimation of h leads to the classical relation:
(7)
where Rzz is the autocorrelation matrix of the output, Rzy is the intercorrelation matrix between the output and the input and is the estimate of h. Several implementations, block or recursive, can be applied to find the optimal estimator of h. In the recursive way, Least Mean Square (LMS), Normalized Least Mean Square (NLMS) or Recursive Least Square (RLS) algorithms have been proposed. Best results were obtained with the RLS approach, moreover known to exhibit fast convergence and the exact implementation of the block form.

In the direct case, the filter represents the transfer function between EGM and ECG signals (as in [9]). During the training step, N MISO filters have to be computed, between the input EGM vector s 1(k) (y(k) = s 1(k)) and each row of the output ECG vector x 1(k) (z(k) = x 1(i, k) for i = 1, …, N). In the indirect case, the filter represents the transfer function between EGM3 D and ECG3 D signals [4]. During the training step, three MISO filters have to be computed, between the input EGM3 D vector v s 1 (k) (y(k) = v s 1 (k)) and each row of the output ECG3 D vector v x 1 (k) (z(k) = v x 1 (i, k) for i = 1, …, 3).

A dataset issued from 15 patients (P1 to P15) is used to evaluate the performance of the above-mentioned ECG reconstruction methods. The ECG and EGM were simultaneously recorded with a GE Cardiolab station during the implant of an ICD with an initial sampling rate equal to 1000 Hz and then subsampled at 128 Hz and low-pass filtered at 45 Hz. Each record of the database is composed of:

Three EGM configurations will be considered:

Each patient file has been segmented into two blocks: the first one, of length L 1, containing nt heartbeats of concurrent ECG and EGM signals, is used during the training step, and a different second block, of length L 2, with ns beats, is devoted to the synthesis step and performance evaluation. The 15 patient records have been classified into three categories, according to their beat morphologies (Fig. 3):

  • Type I: the surface ECG contains only one beat morphology (P1 to P10);
  • Type II: the surface ECG contains one prevailing beat morphology, with presence of ventricular ectopic beats (P11 to P13). Ectopic beats are only included in the testing set;
  • Type III: at least three different beat morphologies compose the ECG (P14 and P15). Each of them are present both in the training and testing sets.

The behavior of the non linear estimation methods depends on the values of the TDNN parameters, that are optimized on the training dataset in the first part of this section. In a second part, we present the results obtained on the synthesis dataset as a function of the EGM leads and the type of patients. We evaluate the two TDNN-based methods (direct and indirect) and compare the results with an RLS filter (direct and indirect). This leads to four different methods, denoted by:

The last part of this section presents the practical importance of the proposed methodology, in the real configuration of an implantable device. We study the effect of the reconstruction process on parameters extracted from the ECG and in a diagnosis purpose, since the ECG is also used in practice for the control of the device and for arrhythmia detection.

Performance of the two TDNN-based methods (direct and indirect) are highly dependent on the adjustment of the NN parameters: the delay d, the number of neurons NH and the length L 1 of the training set. The delay d has been optimized in a previous work to 50 ms (d = 4 samples) [18]. For the two others parameters, a bootstrap analysis is performed on the database. Mean Square Errors (MSE) between the 12 leads of the real ECG and the synthesized ECG have been computed as the function of the number of neurons NH ∈ {4, 6, …, 50} and the number of beats in the training set nt ∈ {2, 3, …, 10}. Due to length duration of the training data set, ten beats were the maximal possible value.

Figure 4 shows that the MSE decreases when the number of neurons NH increases, with a ’plateau’ phase above 20 neurons. Regarding the influence of the number of beats nt , Figure 5 shows that MSE decreases when this number is increasing. From these previous experiments and for the following, the number of neurons NH and the number nt of beats in the training set are tuned to 20 and 10 respectively.

B. Performance analysis of the synthesized ECG
The quality of the synthesized ECG obtained from the four methods has been evaluated by calculating the correlation coefficient between the synthesized ECG and the real ECG, for all the available patients, and for all EGM configurations. Results have been grouped by the type of recording, as depicted in Fig. 3 and are presented on Fig. 6.
1. Analysis of the results as a function of the recording type The highest performance is obtained for Type I (sinus rhythm), whatever the reconstruction method and the EGM configuration. Then still high correlation coefficients are obtained with patients of Type III, containing polymorphic beat sequences, which means that all the heartbeats are well estimated.

Regarding recordings of Type II, correlation coefficient values are lower. This is mainly due to the ectopic beat, not included in the training set, which is not correctly reproduced. It is important to notice that this is a particularly difficult case. This result shows that unfortunately none of the evaluated methods is able to estimate the true transfer function when using a limited number of beat morphologies in the training set. However, the TDNN approaches provide a slightly higher performance (see section IV.B.3).

2. Selection of the EGM configuration The best performances are obtained with the configuration ’Bip+Prox’ whatever the method. We can also observe that configuration ’Bip’ provides the lowest performances. Finally, results obtained with ’Prox’ are very interesting since they show that this configuration, using only two electrodes, allows to obtain almost the same results that the configuration ’Bip+Prox’, using four electrodes.
3. Comparison of the synthesis methods Considering the whole database, the best correlation coefficients are obtained with the D_TDNN method, in the configuration ’Bip+Prox’, leading to 0.99 for Type I, 0.84 for Type II and 0.95 for Type III. Furthermore, TDNN-based approaches provide globally the best correlation coefficients compared to the RLS-based approaches, but the gain in performance is limited for configurations ’Prox’ and ’Bip+Prox’. However, in the configuration ’Bip’, the difference between TDNN and RLS is more important (this case will be detailed in the next section).

In addition, D_TDNN provides, in most of the cases, slightly higher performances than I_TDNN. However, I_TDNN can be considered as a good compromise since its complexity is lower (identification of 3 transfer functions for I_TDNN and 12 transfer functions for D_TDNN) and its performances are equivalent.

C. Quantitative and qualitative performances in the real configuration of an implantable device
Up to now, we have considered the reconstruction for a research protocol. This section is now devoted to the real industrial case, where only 3 ECG leads are generally observed (and assumed sufficient during the follow-up process). Concerning the EGM, only 2 bipolar leads are often used in practice, which corresponds to a difficult configuration, as demonstrated in section IV-B2. By comparing three of the four methods (D_TDNN, I_TDNN and D_RLS) and by considering all the patients, a quantitative and a qualitative analysis have been performed on three ECG leads (I, II and V1), containing 10 heartbeats. Let us mention that in a previous work, we have shown that performances do not depend on the ECG lead which is synthesized [4].
1. A typical reconstruction example As an illustration, Figure 7 shows an example of lead I, synthesized with the three different methods. Results show the superiority of the TDNN-based approaches. RLS synthesizes a small high frequency reconstruction noise around each QRS of the patient of Type I and fails in reconstructing the three different morphologies of the patient of Type III. With the patient of Type II, sinus rhythm is still better estimated with the two TDNN approaches. However, results also show that the three methods fail to reconstruct the premature ventricular complexes (PVC), not included into the training set. Nevertheless, the synthesized beat is clearly different than the sinus beats and could be interpreted, by an automatic system, as a PVB.
2. Quantitative evaluation We propose now to evaluate the quality of the synthesized ECG by comparing feature values extracted from the real ECG and from the synthesized ECG. The objective is to appreciate if the reconstruction process modifies some of the main parameters generally measured on the ECG signal for diagnosis purposes. It is well-known that the QRS duration and amplitude of the R wave are part of such important features and moreover can be used for the adjustment of the implantable device. In the same manner, heart rate variability and measurement of the ST segment are important for patient follow-up, the ST segment being a crucial parameter for ischemia monitoring.

The RR interval, QRS durations and R wave and ST segment (measured 60 ms after J point and using as a reference the mean value of the PR interval) magnitudes have been extracted from real and synthesized ECGs using a software developed in our laboratory [19]. Figure 8 shows boxplots of the absolute error (median and interquartile ranges) measured for each beat between parameters obtained from the real ECG and the different synthesized ECG.

Results are globally in accordance with those reported in the previous section after computation of the correlation coefficient. As already stressed in Figure 7, performances obtained with D_TDNN and I_TDNN are generally close and superior to performances of D_RLS, for which the measured dispersion is always higher. Likewise, best results are obtained for Type I. However, differences of performances that were observed between Types II and III are less important. This might be explained by the reduced training set in Type III, which makes the generalization of the transfer function suboptimal. This point has a direct influence on the quality of the features extracted from the synthesized ECG. For instance, we observe that the RR interval is well reproduced, with median errors always lower than 5 ms (except for the RLS method on Type III), the D_TDNN method providing always the lowest dispersion. Regarding the errors in reproducing amplitude parameters (R wave and ST segment), median errors are still acceptable, with values around 50 μV. Finally, the estimation of the QRS duration produced median errors around 15 ms.

3. Qualitative evaluation In order to obtain a clinical appreciation of the results, beyond numerical results, we requested a cardiologist of the Rennes Hospital to perform a blinded qualitative evaluation of the synthesized ECG.

For each of the patients, 4 different ECG (the real ECG and three synthesized ECG using D_TDNN, I_TDNN and D_RLS), were shown to the cardiologist. He has been asked to mark, out of 10, each of the synthetic ECG, globally for the three leads, related to the real ECG, from two points of view:

  • The first should reflect the visual quality of the synthesized ECG. The whole signal has been analyzed: QRS morphology, P wave and also baseline.
  • The second should evaluate the capability to make the right diagnosis, the database containing several arrhythmias or conduction defects (PVC, bundle-branch blocks, atrial fibrillation, …).

Results are presented in Figure 9. Firstly, we observe a good agreement between scores obtained by the visual quality analysis and by the capability to give the right diagnosis (same classification of the methods). In addition, results are in accordance with conclusions of the quantitative analysis since no significant differences between the direct method (D_TDNN) and the indirect method (I_TDNN) were observed for the proposed TDNN approach. Again, performances of the two TDNN-based methods are judged superior to the D_RLS, for the three types of patients. As already reported in this communication, worse results are obtained with patients of Type II. Finally, it is worth to mention that scores of diagnosis are always higher than those of visual quality. In other words, even if the signal is not always correctly reconstructed, the diagnosis may be preserved.

This paper proposes a methodology to synthesize a standard 12-lead ECG from a set of EGM leads, for implanted patients, which is of main practical importance in cardiology. For this purpose, two methods, a direct and an indirect, based on a dynamic TDNN, are proposed and evaluated in this study. Experiments show that four main issues are of concern when performing ECG reconstruction. Firstly, a quantitative comparison, conducted on a database issued from 15 patients, shows that performances of both TDNN-based methods are equivalent and outperform the RLS approach. However, the direct method requires the learning of 12 transfer functions, whereas the indirect method only requires three transfer functions. It is worth to notice that, in practice, it is also possible to compute only eight transfer functions and derive the other four by calculations, since only two of the six extremity leads are independent. Whatever the approach direct or indirect, the transfer functions are patient specific and must be trained either during the implantation or the day just after by collecting as set of synchronous ECG and EGM, eventually by using the telemetric system of the device.

Secondly, interesting results were observed for patients with sinus rhythm or with polymorphic beats if all the beat morphologies are present in the training set (Types I and III). When ectopic complexes are not included in the training set (Type II), both methods show limitations on the reproduction of PVC. However, the synthesized morphologies still remain very different from sinus beats, preserving at least the possibility of identifying a pathological beat.

Thirdly, results have also been presented as a function of the number and the type of EGM leads. As expected, the best reconstructions are obtained using two bipolar and two proximal leads, but we also demonstrated that the two proximal electrodes provide almost the same results.

Finally, we have shown that the industrial configuration, using only two bipolar EGM, leads to quantitative and qualitative efficient performances. These interesting results suggest that the proposed TDNN-based reconstruction module could be embedded into an ICD.

It should be noted that the long term validity of this reconstruction relies on the hypothesis that the ECG morphology is stable over time. This hypothesis can be assumed to hold in healthy subjects and we recently verified this stability in different recording conditions [20,21]. However, this may not be the case in implanted patients, who may present ECG modifications as a consequence of cardiac remodeling, clinical evolution or transient phenomena (new medication, …). These factors represent the main limitations of any ECG reconstruction method from EGM data. One way to minimize the effects of these factors would be to re-estimate the transfer function during the required follow-up visits at the hospital.

This work also opens the way to new telecardiology challenges, such as the optimal management of cardiac rhythm pathologies, the follow-up of implanted patients [22,23], and the continuous control of implantable devices [24]. Indeed, recent studies have demonstrated the interest of a daily data transmission from the implantable device, through a wireless remote monitoring system, to improve the care of cardiac device recipients. It was also claimed in [25] that ambulatory monitoring of the electrocardiogram is an important complement to pacemaker follow-up, particularly about changes in the ST segment of the ECG.