IAS-LAB PUBLICATIONS

Clear

Deep Spiking Neural Network for Video-Based Disguise Face Recognition Based on Dynamic Facial Movements

Authors: Liu Daqi; Bellotto Nicola; Yue Shigang

Journal: IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Published: 2020

DOI: 10.1109/TNNLS.2019.2927274

With the increasing popularity of social media and smart devices, the face as one of the key biometrics becomes vital for person identification. Among those face recognition algorithms, video-based face recognition methods could make use of both temporal and spatial information just as humans do to achieve better classification performance. However, they cannot identify individuals when certain key facial areas, such as eyes or nose, are disguised by heavy makeup or rubber/digital masks. To this end, we propose a novel deep spiking neural network architecture in this paper. It takes dynamic facial movements, the facial muscle changes induced by speaking or other activities, as the sole input. An event-driven continuous spike-timing-dependent plasticity learning rule with adaptive thresholding is applied to train the synaptic weights. The experiments on our proposed video-based disguise face database (MakeFace DB) demonstrate that the proposed learning method performs very well, i.e., it achieves from 95% to 100% correct classification rates under various realistic experimental scenarios.

Volume: 31 Pages: 1843-1855

Keywords: Continuous learning; deep learning; event-driven spike-timing-dependent plasticity (STDP); spiking neural network (SNN); video-based disguise face recognition (VDFR);

ENRICHME: Perception and Interaction of an Assistive Robot for the Elderly at Home

Authors: Cosar Serhan; Fernandez-Carmona Manuel; Agrigoroaie Roxana; Pages Jordi; Ferland Francois; Zhao Feng; Yue Shigang; Bellotto Nicola; Tapus Adriana; Coşar Serhan; Ferland François

Journal: INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS

Published: 2020

DOI: 10.1007/s12369-019-00614-y

Recent technological advances enabled modern robots to become part of our daily life. In particular, assistive robotics emerged as an exciting research topic that can provide solutions to improve the quality of life of elderly and vulnerable people. This paper introduces the robotic platform developed in the ENRICHME project, with particular focus on its innovative perception and interaction capabilities. The project’s main goal is to enrich the day-to-day experience of elderly people at home with technologies that enable health monitoring, complementary care, and social support. The paper presents several modules created to provide cognitive stimulation services for elderly users with mild cognitive impairments. The ENRICHME robot was tested in three pilot sites around Europe (Poland, Greece, and UK) and proven to be an effective assistant for the elderly at home.

Volume: 12 Pages: 779-805

Keywords: Assistive robotics; Human–robot interaction; Robot perception;

Online learning for 3D LiDAR-based human detection: experimental analysis of point cloud clustering and classification methods

Authors: Yan Zhi; Duckett Tom; Bellotto Nicola

Journal: AUTONOMOUS ROBOTS

Published: 2020

DOI: 10.1007/s10514-019-09883-y

This paper presents a system for online learning of human classifiers by mobile service robots using 3D LiDAR sensors, and its experimental evaluation in a large indoor public space. The learning framework requires a minimal set of labelled samples (e.g. one or several samples) to initialise a classifier. The classifier is then retrained iteratively during operation of the robot. New training samples are generated automatically using multi-target tracking and a pair of “experts” to estimate false negatives and false positives. Both classification and tracking utilise an efficient real-time clustering algorithm for segmentation of 3D point cloud data. We also introduce a new feature to improve human classification in sparse, long-range point clouds. We provide an extensive evaluation of our the framework using a 3D LiDAR dataset of people moving in a large indoor public space, which is made available to the research community. The experiments demonstrate the influence of the system components and improved classification of humans compared to the state-of-the-art.

Volume: 44 Pages: 147-164

Keywords: 3D LiDAR-based tracking; Dataset; Human detection; Online learning; Point cloud segmentation;

ENRICHME: Perception and Interaction of an Assistive Robot for the Elderly at Home

Authors: Cosar Serhan; Fernandez-Carmona Manuel; Agrigoroaie Roxana; Pages Jordi; Ferland Francois; Zhao Feng; Yue Shigang; Bellotto Nicola; Tapus Adriana; Coşar Serhan; Ferland François

Journal: INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS

Published: 2020

DOI: 10.1007/s12369-019-00614-y

Recent technological advances enabled modern robots to become part of our daily life. In particular, assistive robotics emerged as an exciting research topic that can provide solutions to improve the quality of life of elderly and vulnerable people. This paper introduces the robotic platform developed in the ENRICHME project, with particular focus on its innovative perception and interaction capabilities. The project’s main goal is to enrich the day-to-day experience of elderly people at home with technologies that enable health monitoring, complementary care, and social support. The paper presents several modules created to provide cognitive stimulation services for elderly users with mild cognitive impairments. The ENRICHME robot was tested in three pilot sites around Europe (Poland, Greece, and UK) and proven to be an effective assistant for the elderly at home.

Volume: 12 Pages: 779-805

Keywords: Assistive robotics; Human–robot interaction; Robot perception;

Experimental Analysis of a Spatialised Audio Interface for People with Visual Impairments

Authors: Lock Jacobus C.; Gilchrist Iain D.; Cielniak Grzegorz; Bellotto Nicola

Journal: ACM TRANSACTIONS ON ACCESSIBLE COMPUTING

Published: 2020

DOI: 10.1145/3412325

Sound perception is a fundamental skill for many people with severe sight impairments. The research presented in this article is part of an ongoing project with the aim to create a mobile guidance aid to help people with vision impairments find objects within an unknown indoor environment. This system requires an effective non-visual interface and uses bone-conduction headphones to transmit audio instructions to the user. It has been implemented and tested with spatialised audio cues, which convey the direction of a predefined target in 3D space. We present an in-depth evaluation of the audio interface with several experiments that involve a large number of participants, both blindfolded and with actual visual impairments, and analyse the pros and cons of our design choices. In addition to producing results comparable to the state-of-The-Art, we found that Fitts’s Law (a predictive model for human movement) provides a suitable metric that can be used to improve and refine the quality of the audio interface in future mobile navigation aids.

Volume: 13

Keywords: active vision; audio interface; Fitts Law; guidance system; Visual impairment;

Robot perception of static and dynamic objects with an autonomous floor scrubber

Authors: Yan Zhi; Schreiberhuber Simon; Halmetschlager Georg; Duckett Tom; Vincze Markus; Bellotto Nicola

Journal: INTELLIGENT SERVICE ROBOTICS

Published: 2020

DOI: 10.1007/s11370-020-00324-9

This paper presents the perception system of a new professional cleaning robot for large public places. The proposed system is based on multiple sensors including 3D and 2D lidars, two RGB-D cameras and a stereo camera. The two lidars together with an RGB-D camera are used for dynamic object (human) detection and tracking, while the second RGB-D and stereo camera are used for detection of static objects (dirt and ground objects). A learning and reasoning module for spatial–temporal representation of the environment based on the perception pipeline is also introduced. Furthermore, a new dataset collected with the robot in several public places, including a supermarket, a warehouse and an airport, is released. Baseline results on this dataset for further research and comparison are provided. The proposed system has been fully implemented into the Robot Operating System (ROS) with high modularity, also publicly available to the community.

Volume: 13 Pages: 403-417

Keywords: Dataset; Human detection and tracking; Object and dirt detection; Robot perception; ROS; Spatial–temporal representation;

Continuous game theory pedestrian modelling method for autonomous vehicles

Authors: Camara Fanta; Cosar Serhan; Bellotto Nicola; Merat Natasha; Fox Charles W.

Journal: 21101046686

Published: 2020

Autonomous Vehicles (AVs) must interact with other road users. They must understand and adapt to complex pedestrian behaviour, especially during crossings where priority is not clearly defined. This includes feedback effects such as modelling a pedestrian’s likely behaviours resulting from changes in the AVs behaviour. For example, whether a pedestrian will yield if the AV accelerates, and vice versa. To enable such automated interactions, it is necessary for the AV to possess a statistical model of the pedestrian’s responses to its own actions. A previous work demonstrated a proof-ofconcept method to fit parameters to a simplified model based on data from a highly artificial discrete laboratory task with human subjects. The method was based on LIDAR-based person tracking, game theory, and Gaussian process analysis. The present study extends this method to enable analysis of more realistic continuous human experimental data. It shows for the first time how game-theoretic predictive parameters can be fit into pedestrians natural and continuous motion during road-crossings, and how predictions can be made about their interactions with AV controllers in similar real-world settings.

Pages: 1-20

Active preservation of analogue audio documents: A summary of the last seven years of digitization at CSC

Authors: Pretto Niccolò; Russo Alessandro; Bressan Federica; Burini Valentina; Rodà Antonio; Canazza Sergio

Journal: 21100985330

Published: 2020

During the last 40 years, a large number of musical works was composed and recorded at the Centro di Sonologia Computazionale (CSC) of the University of Padova. The problem of how to preserve this increasing amount of audio documents arose and, in order to meet this need, the CSC started to carry out a research to develop a scientific methodology for preservation. In the last 7 years, this methodology was refined and applied to the digitization of more than 2,500 audio documents both from Italian and international audio archives, mainly stored on analogue magnetic carriers such as open-reel tapes and cassettes. Their content consists of electronic, folk and other kinds of music, as well as speech recordings. The methodology provides for collecting numerous metadata about the original carrier on which the audio information was stored, i.e. flange diameter, brand and material, its physical state and the related preservation copy, necessary for a correct preservation of the content and the contextual information. This work aims to extract and interpret the information from this wealth of data and metadata, which was collected and structured through the software PSKit. This may be useful for audio technicians, archives and policy makers to plan future preservation projects.

Volume: 2020- Pages: 394-398

Acoustic Experiences for Cultural Heritage Sites: A Pilot Experiment on Spontaneous Visitors’ Interest

Authors: Vitale Vincenzo Norman; Olivieri Marco; Origlia Antonio; Pretto Niccolo; Roda Antonio; Cutugno Francesco; Pretto Niccolò; Rodà Antonio

Journal: CULTURE AND COMPUTING, C&C 2020

Published: 2020

DOI: 10.1007/978-3-030-50267-6_23

Providing technologies to support the visiting experience in cultural venues of artistic value is an important issue that needs to be addressed by considering the delicate nature of the places. Architectural heritage and visual arts are two valuable examples: the most sensible choice for augmenting the comprehension and the experience concerning this kind of cultural heritage is through audio cues (e.g., using audio guides). This work describes a pilot experiment to evaluate the impact of soundscapes and environmental acoustics reconstruction in the visitors’ experience. We proposed, to the visitors, an audioguide integrated with an optional experience presenting a comparison between a choir recorded in an anechoic chamber and the same choir as it would have sounded in the Church of the San Martino Charterhouse, using a preliminary acoustic survey. Experiments were conducted with real visitors to maximise the ecology of the collected data and results show that people are significantly interested towards the proposed experience, motivating further efforts to improve the quality and depth of the contents provided with this strategy.

Volume: 12215 Pages: 300-311

Keywords: Audioguide; Cultural heritage experience; Environmental acoustic reconstruction; Impulse response measurement;

Lung Ultrasound and Pulmonary Congestion During Stress Echocardiography

Authors: ITA; JACC. CARDIOVASCULAR IMAGING###1936-878X; Goal 3: Good health and well-being###25122; DQI-2361-2022; KVY-1424-2024; U-6951-2017; ETC-8873-2022; GCA-8341-2022; CMN-3938-2022; GIW-5359-2022; ISP-6364-2023; CAI-9702-2022; FWA-4381-2022; ERV-6809-2022; GFD-2228-2022; K-8027-2015; PHE-2297-2026; DUV-6459-2022; G-8642-2013; DTG-0942-2022; GHH-7471-2022; ABA-3543-2020; J-7174-2017; FZO-7738-2022; ELN-1565-2022; FOS-3534-2022; FZO-3399-2022; FKP-3459-2022; CUF-4965-2022; CNF-9050-2022; ABG-9721-2021; CRF-7941-2022; CLV-7955-2022; FXW-2768-2022; DCD-1773-2022; AAB-2223-2019; DYN-2947-2022; N-2282-2014; AGF-4358-2022; DWN-2198-2022; MEO-7169-2025; HYQ-3823-2023; GLT-2362-2022; Z-1305-2019; ICT-2404-2023; AAC-5268-2022; P-4267-2015; AAJ-2842-2021; DIQ-9726-2022; CLI-6997-2022; FWW-6498-2022; DUY-1900-2022; H-6449-2018; GCC-0228-2022; CFW-1959-2022; MTF-3609-2025; DPH-7277-2022; E-8121-2016; GDS-4570-2022; KQD-0450-2024; DNL-4934-2022; ABB-3006-2020; HNW-1712-2023; DHX-4733-2022; N-8144-2016; K-5306-2018; CLZ-2225-2022; AAX-9367-2020; CIO-4605-2022; DDF-0100-2022; JCR-0697-2023; 55929478400; 22939399700; 6602299243; 55663049600; 55612687400; 57192979152; 22939707600; 35452933600; 12772110800; 57200859205; 57003143600; 6701355424; 57207720679; 6508290354; 36197477200; 55884115100; 23391267500; 57218480094; 6602887127; 6701757460; 15921921800; 37066931100; 57218481743; 57201675703; 57191339958; 56112672500; 6602478787; 6701803166; 11840443000; 56194369000; 7006690054; 7005850087; 36478990700; 57203882101; 7402581476; 36943779100; 56109055800; 55499164600; 7102315827; 55206815100; 55238465000; 57201099810; 6603190525; 6507133552; 22834420800; 24290552500; 55959365300; 6603491191; 14055534600; 35956952500; 25121768000; 6603038461; 44261600800; 59158416100; 6701802597; 34572650500; 59432908700; 57210067725; 56377673100; 7003610825; 6701858723; 57221823607; 57964799700; 6507042094; 7003660393; 7003751506; 7103408229; 7102408994

Journal: JACC-CARDIOVASCULAR IMAGING

Published: 2020

DOI: 10.1016/j.jcmg.2020.04.020

Objectives: The purpose of this study was to assess the functional and prognostic correlates of B-lines during stress echocardiography (SE). Background: B-profile detected by lung ultrasound (LUS) is a sign of pulmonary congestion during SE. Methods: The authors prospectively performed transthoracic echocardiography (TTE) and LUS in 2,145 patients referred for exercise (n = 1,012), vasodilator (n = 1,054), or dobutamine (n = 79) SE in 11 certified centers. B-lines were evaluated in a 4-site simplified scan (each site scored from 0: A-lines to 10: white lung for coalescing B-lines). During stress the following were also analyzed: stress-induced new regional wall motion abnormalities in 2 contiguous segments; reduced left ventricular contractile reserve (peak/rest based on force, ≤2.0 for exercise and dobutamine, ≤1.1 for vasodilators); and abnormal coronary flow velocity reserve ≤2.0, assessed by pulsed-wave Doppler sampling in left anterior descending coronary artery and abnormal heart rate reserve (peak/rest heart rate) ≤1.80 for exercise and dobutamine (≤1.22 for vasodilators). All patients completed follow-up. Results: According to B-lines at peak stress patients were divided into 4 different groups: group I, absence of stress B-lines (score: 0 to 1; n = 1,389; 64.7%); group II, mild B-lines (score: 2 to 4; n = 428; 20%); group III, moderate B-lines (score: 5 to 9; n = 209; 9.7%) and group IV, severe B-lines (score: ≥10; n = 119; 5.4%). During median follow-up of 15.2 months (interquartile range: 12 to 20 months) there were 38 deaths and 28 nonfatal myocardial infarctions in 64 patients. At multivariable analysis, severe stress B-lines (hazard ratio [HR]: 3.544; 95% confidence interval [CI]: 1.466 to 8.687; p = 0.006), abnormal heart rate reserve (HR: 2.276; 95% CI: 1.215 to 4.262; p = 0.010), abnormal coronary flow velocity reserve (HR: 2.178; 95% CI: 1.059 to 4.479; p = 0.034), and age (HR: 1.031; 95% CI: 1.002 to 1.062; p = 0.037) were independent predictors of death and nonfatal myocardial infarction. Conclusions: Severe stress B-lines predict death and nonfatal myocardial infarction. (Stress Echo 2020–The International Stress Echo Study [SE2020]; NCT03049995)

Volume: 13 Pages: 2085-2095

Keywords: coronary artery disease; heart failure; lung ultrasound; stress echocardiography;