Biomedical
Multi-Expert Indicators for Monitoring and Decision Support: Biomedical, Personalized Medicine, and Personal Wellness Applications
2.1 Generalized Heart Rate Variability (HRV) Analysis
2.3 Universal Indicators for Robust Detection of Multiple Abnormalities
2.4 Robust Detection of Emerging and Transient Abnormalities
2.5 Real-Time Prediction of Acute and Critical Events
2.6 Rare Psycho-Physiological States and Pathologies: Robust Detection and Quantitative Description
2.8 Robust Quantitative Monitoring in Personal Wellness and Personalized Medicine Applications
3.1 Diagnostics and Monitoring of Neuro-Degenerative Diseases from Gait Dynamics
3.2 Remote Biometrics Based on Gait Dynamics
4.1 Multi-Type Complexity Indicators for Diagnostics and Real-Time Prediction
5.1 Multi-Type Complexity Indicators for Diagnostics
6. BIOINFORMATICS APPLICATIONS
6.1 Multi-Type Complexity Analysis of Coding and Noncoding DNA Sequencies
7.1 Multi-Type Spatial Complexity Indicators for Diagnostics
Introduction
EKG Time Series Analysis

The majority of diagnostic features used by cardiologists are based on local patterns extracted from electrocardiogram (ECG) waveforms. However, these features may not be effective in the early stage of developing cardiac abnormalities as well as for clinically significant pathologies lacking specific ECG signatures.
Human organism is a complex adaptive system. Signal variability analysis provides a generic non-invasive technology for evaluation of the overall properties of a complex system. The association between altered variability and illness is ubiquitous. One of the most common applications of this principle is heart rate variability (HRV) analysis which is known to play an important role in cardiac diagnostics. HRV indicators calculated from beat-to-beat (RR) time series offer not only a complementary diagnostic modality but also potential detection of subtle dynamical changes in the early stages of emerging pathologies. Analysis of RR data is also significantly more noise-tolerant compared to waveform analysis. This is especially important for data collected by portable and wearable systems.
Traditional diagnostics could typically reveal only the known localized patterns while information from long-range multi-scale correlations in the dynamics of ECG or other physiological time series is ignored. However, measures based on such subtle changes in dynamics may be sensitive indicators of an emerging abnormality or hard-to-detect pathology. HRV analysis offers a set of measures that are sensitive to such changes in heart rate dynamics and can provide complementary and alternative insight in cardiac diagnostics
Although variability analysis combines many desirable characteristics, sensitivity of linear HRV indicators to data artifacts and non-stationarity, as well as long-period requirements for more accurate and stable nonlinear dynamics (NLD) measures lead to many challenges in practice. Nevertheless, in our recent works we have demonstrated that NLD indicators could preserve significant part of their discriminative abilities even for short periods (down to several minutes). More importantly, we have shown that accuracy of normal-abnormal classifiers based on different NLD and advanced linear measures could be significantly increased by boosting-like ensemble learning methods.
We found that it is feasible to discriminate between normal condition and multiple abnormalities using two-class classification framework. Such an approach is tolerant to training data with vaguely specified or non-specific diagnoses, data incompleteness for certain well-known and novel abnormalities, and to complex cases of co-existing pathologies. Our approach allows discovering generic meta-indicators capable of detecting both single abnormalities irrespective of their origin, and conditions specified by complex combination of different pathologies using short RR time series.
Such multi-expert meta-indicators are also effective for detailed and robust quantification of various psycho-physiological states. We found that these state representations could be used as early signals of emerging abnormalities and other negative physiological changes (e.g., overtraining in professional sports) as well as for real-time prediction of acute and critical events.
Discriminative abilities of the normal-abnormal classifiers can be represented by plotting detection rates vs false alarm rates to obtain ROC (Receiver Operating Characteristic) curve: the larger is ROC-curve�s shift towards the upper left corner from the diagonal, the better is classifier. In all subsequent illustrations ROC curves are used to compare different HRV indicators.

Arrhythmia is very serious cardiac abnormality that could also accompany other cardiac pathologies. As shown in the figure on the left (click to enlarge), for low false alarm rates (<10-15%), arrhythmia detection rate of the AQSCS multi-expert indicator could be up to 40-50% higher than that of the typical single NLD indicator. Presented indicators are calculated on short, 256-beat, RR time series.
Indicators capable of detecting developing CHF and arrhythmia signatures during regular express diagnostic procedures could be of significant practical importance. For example, hypertrophic cardiomyopathy (HCM) is hard-to-detect cardiac disease that is a leading cause of sudden death in young athletes. Although different in nature, HCM shares common diagnostics features with CHF and is often accompanied by arrhythmias. Therefore, indicators capable of robust detection of CHF and arrhythmias may provide early warning signs of developing HCM and could be very useful for express diagnostics of athletes from short segments of RR data.
UNIVERSAL INDICATORS FOR ROBUST DETECTION OF MULTIPLE ABNORMALITIES
AQSCS multi-expert meta-indicators are universal in nature. Such multi-expert indicators are capable of detecting a wide range of different cardiac abnormalities as well as conditions with complex combination of different pathologies. ROC curves for one of such meta-indicators (in the figure on the left) demonstrate such universal capabilities. It is clear that for the acceptable false alarm rates (<20%), detection rates for a number of cardiac pathologies and complex abnormal conditions could be in the range of 80-100%. Presented AQSCS meta-indicator is calculated on short, 256-beat, RR time series.
ROBUST DETECTION OF EMERGING AND TRANSIENT ABNORMALITIES
ROC curves presented above demostrate advantages of AQSCS multi-expert measures over indicators based on single NLD measures. However, ROC curves cannot explicitely show how such advantages would be manifested in the real applications dealing with analysis of EKG time series. These applications include critical real-time analysis of the data collected by intensive care unit (ICU) equipment or portable wearable monitors as well as automated analysis of long time series collected by Holter monitors. Indicators for such critical analysis should demonstrate high detection rate of even short-lived or emerging abnormalities. To provide any practical utility, these indicators should also have low false alarm rate.
In the adjacent figure (click to enlarge), we plot indicator values computed from short consequitive RR segments of a single healthy person. Medians of indicator value distribution for population of normal (healthy) subjects as well as patients with CHF and arrhythmia are plotted as horizontal lines. It is clear that single NLD indicator values (two lower panels) are quite noisy and jumping from normal median to CHF/arrhythmia medians and back. This will result in high false alarm rates which critically diminishes practical utility of such indicators. In contrast, AQSCS multi-expert indicator (upper panel) is much less noisy and demonstrates significant stability by staying in the vicinity of normal median where it should be.
As discussed and illustrated in the above figure, AQSCS multi-expert approach can drastically reduce noise of individual HRV indicators computed from short RR segments. This makes possible detection of even short-lived and/or emerging abnormalities. Real-data illustration is provided in the figure on the left (click to enlarge), where 4 short 512-beat arrhythmia RR segments (labeled by red “A” letter) are inserted in the RR time series of the normal (healthy) subject. We see that both single HRV indicators (two lower panels) generate several false-alarm signals, i.e. distincly low indicator values where no arrhythmia is present. However, AQSCS multi-expert indicator (upper panel) clearly detects all arrhythmia events without any false alarm generation.
Robustness of the AQSCS multi-expert indicator in detecting short lived and intermittent abnormalities is further ilustrated in the figure on the left (click to enlarge). Here 4 short arrhythmia RR segments (labeled by red “A” letter) with decreasing length (from 512-beat down to 128-beat) are inserted in the RR time series of the normal (healthy) subject. AQSCS multi-expert indicator clearly detects even the shortest arrhythmia events without any false alarm generation.
REAL-TIME PREDICTION OF ACUTE AND CRITICAL EVENTS

Reliable prediction of AF, spontaneous ventricular fibrillation (VF) and tachycardia (VT), and other acute cardiac events is very important for applications in smart pacers and defibrillators. This set of problems is more challenging than diagnostics or detection of slowly emerging abnormalities. Prediction of an acute event involves detection of subtle and transient precursors before the actual state change where effective data analysis could often be limited to short segments before the event. In addition, the amount of reliable data for real life-threatening events is often more limited compared to data for different pathologies or other persistent physiological conditions. However, our analysis indicates that AQSCS multi-expert classifier trained on multi-abnormality data could be an effective predictor of acute cardiac events. It can be used without any additional re-training on the limited data containing examples of such events.
As an example we consider application of AQSCS multi-expert indicator to data from patients with paroxysmal atrial fibrillation (PAF). Half of PAF records immediately precedes atrial fibrillation (AF) event (PAF_A) and the other half is taken far from any such event (PAF_N). In the figure above (click to enlarge), AQSCS meta-indicator outputs on all PAF_A and PAF_N 512-beat overlapping RR segments with different distances to AF event start are computed and averaged. It is clear that separation between these classes increases closer to AF event. Similar results are obtained for VT and VF event prediction using spontaneous ventricular tachyarrhythmia data RARE PSYCHO-PHYSIOLOGICAL STATES AND PATHOLOGIES:
ROBUST DETECTION AND QUANTITATIVE DESCRIPTION

Diagnostics of complex and rare medical cases lacking clear symptoms of particular abnormalities is a very challenging problem. Case-based reasoning (CBR) is known to provide helpful and generic guidelines for the design of medical expert systems dealing with such non-standard diagnostic problems. Similarly, most practitioners implicitly follow case-based reasoning framework when they try to apply previously encountered cases to similar new situations.
Single-example learning (SEL) algorithms offer more formal machine learning framework for classification of rare and novel classes. SEL is a group of machine learning methods aimed at learning classifiers for novel classes by generalization from just one or a few training examples. Usually they use prior knowledge obtained previously while learning other classes from large databases.
However, both CBR and SEL approaches have serious limitations when applied to biomedical problems, the most important of which is requirement of a large number of examples of well-studied cases. In practice, data sets for well-defined abnormalities suited for quantification with existing indicators are often limited. However, significant amount of valuable clinical information from cases labeled only as normal or abnormal without particular diagnosis remains underutilized.
These coarsely classified data can be effectively employed to produce powerful normal-abnormal multi-expert classifiers. This is achieved by an optimal weighted combination of complementary indicators which are experts in different regimes of the considered biological complex system. Therefore, partial information of wide variety of dynamical regimes becomes implicitly encoded in the obtained ensemble of classifiers. However, only aggregated output is used for normal-abnormal classification, while the rich internal structure of the ensemble is completely ignored.
Extraction of this underutilized knowledge could be formalized in terms of ensemble decomposition learning (EDL)which was recently proposed by AQSCS team. Representation of complex and rare cases by the vector output of the ensemble of classifiers each element of which is an output of an individual classifier multiplied by its weight could prove to be more accurate and robust compared to significantly more coarse-grained representation typical for CBR and SEL approaches.
In the figure above, EDL application to real data is demonstrated. We show arrhythmia-CHF classification ROC curves for classifiers based on EDL, aggregated ensemble (classical ensemble approach), and single indicators. We see that, SEL-based approach in the EDL context is capable to construct quite accurate classifier for the two classes that are almost indistinguishable when standard aggregated ensemble output or single HRV indicators are used. It should be noted that presented SEL classifiers are based on just one reference example which suggests suitability of this approach for diagnostic of complex and rare cases characterized by extreme limitation of the available data.
PROFESSIONAL SPORT AND ACTIVE LIFESTYLE APPLICATIONS:
EXPRESS DIAGNOSTICS, OVERTRAINING DETECTION, AND QUANTITATIVE DESCRIPTION OF INDIVIDUAL ZONES OF OPTIMAL FUNCTIONING (IZOF)
Importance of systematic and objective psycho-physiological monitoring of athletes in professional sport constantly increases. Express medical diagnostics of emerging cardiac abnormalities, early detection of overtraining and similar negative changes in psycho-physiological state of athletes critically depend on the quality of such monitoring. Related challenging problem without existing universal solution is robust identification of the individual zones of optimal functioning (IZOF). Tuning an athlete into his/her own IZOF could dramatically improve stability of his/her performance and increase probability of the highest achievements. However, it is difficult to quantify such optimal zones.
Modern sport watches with ECG-type sensors and similar sensors working with smart phones can be used not only for programming personalized training sessions but also for simultaneous collection of beat-to-beat (RR) time series with accuracy comparable to clinical ECG equipment. Such personal RR data can be used for systematic heart rate variability (HRV) analysis to provide early indication of developing cardiac abnormalities, overtraining detection and for other purposes.
HRV-based indicators may offer fast and convenient detection of overtraining. This approach could provide an alternative for much less convenient and time-consuming psycho-physiological evaluations of overtraining currently adopted. However, individual HRV measures could often produce ambiguous results in practice. In contrast, quantification based on AQSCS multi-expert indicators could offer much more stable and practical solution. Moreover, AQSCS multi-expert framework can be also used for detection and quantification of optimal psycho-physiological states. The concept of IZOF introduced in sport psychology revealed multi-featured (multi-dimensional) nature of personal psycho-physiological states associated with the best performance.Therefore, while it is difficult to quantify such optimal zones using existing approaches, multi-component representation offered by AQSCS framework could be well-suited for this task.
Ability of AQSCS multi-expert measures to quantify psycho-physiological state changes are illustrated in the figure above. Here we analyzed RR data collected by Polar RS800 sport watch from one of the members of the “Rus’-Sport” rally team (Russian Federation). AQSCS multi-expert indicators (upper panel) and single NLD indicator (lower panel) are computed on short 256-beat RR segments. The analyzed states include: (1) base activity (sitting, standing, slow walk), (2) driving a car (regular home-office driving), (3) real rally training as co-pilot, and (4) running excersise. Distributions of a single indicator values are significantly overlapped which drastically reduces capabilities of discrimination among presented psycho-physiological regimes. On the other hand, AQSCS multi-expert measure provides clear and expected differentiation among these different states. Indeed, the number of stressful external events and associated emotional reactions is expected to be more during driving than in the base state. Even more such events and significant emotional reactions can occur during rally training. In accordance with this intuition, distribution of AQSCS multi-expert measure include more and more samples with lower indicator values and average indicator value shifts down. It is also known, that during physical excercise (e.g., running) mechanisms of HRV regulations are known to be quite different which causes HRV indicator shift towards “abnormal” range even for healthy subjects.
ROBUST QUANTITATIVE MONITORING IN PERSONAL WELLNESS AND PERSONALIZED MEDICINE APPLICATIONS
Unique capabilities of AQSCS multi-expert indicators could be effectively utilized in the emerging field of personalized medicine. Direct optimization of personal treatment based on genotype is not effective for abnormalities related simultaneously to multiple genes and influenced by history of non-genomic factors. Even more challenging is to personalize and objectively evaluate alternative therapies and relaxation techniques. For example, biofeedback have been shown to be effective not only in clinical settings but also in achieving optimal psycho-physiological states in professional sport and other areas. It is difficult to choose an optimal protocol since noticeable effects are hard to detect in early stages without robust quantitative indicators that are sensitive to small changes of psycho-physiological states. In general, early detection of subtle positive or negative changes caused by a particular treatment is a very important and challenging task in personalization and optimization of any medical treatment strategy.
AQSCS multi-expert measures could effectively detect emerging as well as short-lived and intermittent changes of the physiological state. This capability is illustrated in the above figure, where physilogical states before and during Chi meditation are quantified by AQSCS multi-expert indicator (upper panel) and by single NLD measure (lower panel). For each subject, indicators are computed on several 256-beat RR segments. AQSCS meta-measure clearly shows the difference between “before” and “during” meditation states. On the other hand, single indicator values are significantly overlaping, making such distinction between two states much less obvious.
Gait Time Series Analysis
DIAGNOSTICS AND MONITORING OF NEURO-DEGENERATIVE DISEASES FROM GAIT DYNAMICS




On the other hand, as evident from figure on the left (click to enlarge), aggregated output of the AQSCS multi-complexity ensemble indicator shows clear trend towards gait dynamics of healthy adults as age increases. Remember that the range of probability-like output is between -1 and +1, with healthy state corresponding to positive numbers. Therefore, aggregated output of multicomplexity ensemble measures could be a promising metrics for early detection of subtle and/or slow changes in gait dynamics.
REMOTE BIOMETRICS BASED ON GAIT DYNAMICS
Recently, gait recognition has become a topic of interest within the computer vision applications, due to its growing importance as a biometric modality. Considerable research efforts are being devoted to characterize and extract gait dynamics automatically from video. Biometric systems for human identification at a distance are in increasing demand in various real-life applications. Many biometric modalities such as face recognition, iris, fingerprints, palm prints, and hand geometry suffer from two main disadvantages: (1) high failure rate when only low resolution images and pictures taken at a distance are available, and (2) necessity of subject cooperation for accurate results. For these reasons, innovative biometric recognition methods for human identification at a distance have significant potential and appeal for surveillance and forensic applications.
Most of existing approaches require significant amount of details to be extracted from sequential video images. Unfortunately, in many practical cases the quality of available video may be poor due to insufficient lighting, covering of motion details by cloth, and other reasons. In such circumstances, one of the gait characteristics that could still be extracted is time period between consecutive steps.
Therefore, AQSCS multi-complexity ensemble measures for gait time series analysis could be potentially employed as complementary approach in biometric applications, especially in cases where quality of the video prohibits usage of more traditional techniques. Besides remote gait-based biometrics from video sequences, gait recognition from accelerometer data, that are currently available in all standard smartphones and other wearable devices, becomes comparably important. Indeed, remote acquisition of accelerometer time series could be used to identify the person currently carrying the phone and in related applications. Again, AQSCS multi-complexity measures could be applicable to this set of gait-based biometric problems.
Our initial results indicate that AQSCS multi-complexity ensemble measures could differentiate between different individuals using their stride interval time series. This suggests applicability of our approach to gait-based biometrics problems discussed above. Our conclusions are based on the analysis of gait time series from 10 healthy subjects available at http://www.physionet.org and from our proprietary data sets.
EEG TIME SERIES ANALYSIS
MULTI-TYPE COMPLEXITY INDICATORS FOR DIAGNOSTICS AND REAL-TIME PREDICTION
To be done
To be done