TY - GEN
T1 - HapticFormers
T2 - 28th IEEE Haptics Symposium, HAPTICS 2024
AU - Fahmy, Israa
AU - Hassan, Taimur
AU - Hussain, Irfan
AU - Werghi, Naoufel
AU - Seneviratne, Lakmal
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Detecting maturity of fruits and vegetables, especially avocados, is a critical task in modern agriculture and supply chain management. Moreover, the accurate assessment of maturity can improve the harvesting time and ensure consistent quality for consumers through the supply chain process. A key approach to achieving this is the non-destructive estimation of produce quality. Vision-Based Tactile Sensing (VBTS) technologies, which mimic human tactile perception, offer a novel approach to address this challenge. This paper focuses on the use of two notable VBTS technologies, GelSight and Facebook's DIGIT sensor. Using these technologies, we developed two novel datasets that assess the avocado maturity using the transformer models, marking a novel contribution in this area. We adapted several transformer architectures to the task, conducting experiments on both image classification and regression to estimate avocado firmness. Among the variants tested, the PoolFormer displayed notable results with accuracy of 92% in detecting avocado maturity level when used with tactile data. The datasets and code used in this study will be shared at this URL.
AB - Detecting maturity of fruits and vegetables, especially avocados, is a critical task in modern agriculture and supply chain management. Moreover, the accurate assessment of maturity can improve the harvesting time and ensure consistent quality for consumers through the supply chain process. A key approach to achieving this is the non-destructive estimation of produce quality. Vision-Based Tactile Sensing (VBTS) technologies, which mimic human tactile perception, offer a novel approach to address this challenge. This paper focuses on the use of two notable VBTS technologies, GelSight and Facebook's DIGIT sensor. Using these technologies, we developed two novel datasets that assess the avocado maturity using the transformer models, marking a novel contribution in this area. We adapted several transformer architectures to the task, conducting experiments on both image classification and regression to estimate avocado firmness. Among the variants tested, the PoolFormer displayed notable results with accuracy of 92% in detecting avocado maturity level when used with tactile data. The datasets and code used in this study will be shared at this URL.
KW - maturity classification
KW - self-attention block
KW - Vision Transformer (ViT)
KW - Vision-based tactile sensors (VBTS)
UR - http://www.scopus.com/inward/record.url?scp=85193541106&partnerID=8YFLogxK
U2 - 10.1109/HAPTICS59260.2024.10520836
DO - 10.1109/HAPTICS59260.2024.10520836
M3 - Conference contribution
AN - SCOPUS:85193541106
T3 - IEEE Haptics Symposium, HAPTICS
SP - 347
EP - 352
BT - 2024 IEEE Haptics Symposium, HAPTICS 2024
PB - IEEE Computer Society
Y2 - 7 April 2024 through 10 April 2024
ER -