Deep Learning Transformer Model for Human Activity Recognition

Ionuţ-Adrian Iftode; Cristian-Ioan Foşalău

doi:10.2478/bipie-2024-0011

Abstract

Human Activity Recognition (HAR) leveraging wearable sensors has emerged as a critical research area, with broad applications spanning healthcare, elderly assistance, sports analytics, and human-computer interaction. While traditional approaches using Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks have effectively extracted local spatial and sequential temporal features from multi-channel sensor data, recent advancements incorporate Transformer-based architectures featuring attention mechanisms that capture long-range temporal dependencies without recurrence. This paper introduces a novel multivariate Transformer model designed to integrate multiple physiological and kinematic data streams such as: electrocardioagram-ECG, photoplethysmogram-PPG (wrist and finger infrared/red), Galvanic Skin Response (GSR), respiration, body temperature, three-axis acceleration, and gyroscope signals. Distinctively, the designed architecture assigns dedicated encoders to individual streams to effectively handle signal diversity, sampling frequency variations, and latency discrepancies, using multi-head attention and learnable positional encodings. Evaluated across five experimental scenarios (rest, standing, sitting, running, and walking) segmented into uniform 30-seconds windows, the Transformer-based model demonstrated exceptional performance, achieving approximately 99% accuracy, along with near-perfect sensitivity and F1-scores, highlighting its robustness and superior generalization capability.

References

Cao K., Wang M., Human behavior recognition based on sparse transformer with channel attention mechanism, Frontiers in physiology, 2023, 14, 1239453.
Search in Google Scholar Back to article
Demrozi F., Pravadelli G., Bihorac A., Rashidi P., Human Activity Recognition Using Inertial, Physiological and Environmental Sensors: A Comprehensive Survey, IEEE Access, 2020, 8, 210816-210836.
Search in Google Scholar Back to article
Ek S., Portet F., Lalanda P., Transformer-based models to deal with heterogeneous environments in Human Activity Recognition, Personal and Ubiquitous Computing, 2023, 27, 1-14.
Search in Google Scholar Back to article
Meng W., Liu Z., Li B., Cui W., Zhou J.T., Zhang L., GrapHAR: A Lightweight Human Activity Recognition Model by Exploring the Sub-Carrier Correlations, IEEE Transactions on Wireless Communications, 2024, 23, 2755-2770.
Search in Google Scholar Back to article
Ordóñez F.J., Roggen D., Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition, Sensors, 2016, 16.
Search in Google Scholar Back to article
Shavit Y., Klein I., Boosting Inertial-Based Human Activity Recognition With Transformers, IEEE Access, 2021, 9, 53540-53547.
Search in Google Scholar Back to article
Wang M., A Comprehensive Survey on Human Activity Recognition Using Sensing Technology, Highlights in Science, Engineering and Technology, 2022, 376-389.
Search in Google Scholar Back to article
Yang J.-B., Nhut N., San P., Li X., Shonali P., Deep Convolutional Neural Networks on Multichannel Time Series for Human Activity Recognition, In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI), 2015, 3995-4001.
Search in Google Scholar Back to article

Deep Learning Transformer Model for Human Activity Recognition

Abstract

Paradigm

My account