Abstract
Due to the popularity and mobility of smart phones, phone-related pedestrian distracted behaviors, e.g., Texting, Game Playing, and Phone calls, have caused many traffic fatalities and accidents. As an advanced driver-assistance or autonomous-driving system, computer vision could be used to automatically detect distractions from cameras installed on the vehicle for useful safety intervention. The state-of-the-art method models this problem as a standard supervised learning method with a two-branch Convolutional Neural Network (CNN) followed by a voting on all image frames. In contrast, this paper proposes a new synthetic dataset named SYN-PPDB (448 synchronized video pairs of 53, 760 computer game images) for this research problem and models it as a transfer learning problem from synthetic data to real data. A new deep learning model embedded with spatial-temporal feature learning and pose-aware transfer learning is proposed. Experimental results show that we could improve the state-of-the-art overall recognition accuracy from 84.27% to 96.67%.
| Original language | English |
|---|---|
| Title of host publication | IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops |
| Place of Publication | usa |
| Publisher | IEEE Computer Society |
| Pages | 2975-2983 |
| Number of pages | 9 |
| ISBN (Electronic) | 9781665448994 |
| DOIs | |
| State | Published - Jun 1 2021 |
| Event | 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2021 - Virtual, Online, United States Duration: Jun 19 2021 → Jun 25 2021 |
Conference
| Conference | 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2021 |
|---|---|
| Country/Territory | United States |
| City | Virtual, Online |
| Period | 06/19/21 → 06/25/21 |
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver