Paper Reading AI Learner

Moir'eTac: A Dual-Mode Visuotactile Sensor for Multidimensional Perception Using Moir'e Pattern Amplification

2025-09-16 06:09:43
Kit-Wa Sou, Junhao Gong, Shoujie Li, Chuqiao Lyu, Ziwu Song, Shilong Mu, Wenbo Ding

Abstract

Visuotactile sensors typically employ sparse marker arrays that limit spatial resolution and lack clear analytical force-to-image relationships. To solve this problem, we present \textbf{MoiréTac}, a dual-mode sensor that generates dense interference patterns via overlapping micro-gratings within a transparent architecture. When two gratings overlap with misalignment, they create moiré patterns that amplify microscopic deformations. The design preserves optical clarity for vision tasks while producing continuous moiré fields for tactile sensing, enabling simultaneous 6-axis force/torque measurement, contact localization, and visual perception. We combine physics-based features (brightness, phase gradient, orientation, and period) from moiré patterns with deep spatial features. These are mapped to 6-axis force/torque measurements, enabling interpretable regression through end-to-end learning. Experimental results demonstrate three capabilities: force/torque measurement with R^2 > 0.98 across tested axes; sensitivity tuning through geometric parameters (threefold gain adjustment); and vision functionality for object classification despite moiré overlay. Finally, we integrate the sensor into a robotic arm for cap removal with coordinated force and torque control, validating its potential for dexterous manipulation.

Abstract (translated)

视觉触觉传感器通常使用稀疏的标记阵列,这限制了空间分辨率,并且缺乏明确的力到图像的关系。为了解决这个问题,我们提出了**MoiréTac**,这是一种双模式传感器,它通过透明架构内的微栅格重叠生成密集的干涉图案。当两个栅格以不对齐的方式重叠时,它们会产生莫尔条纹,放大微观变形。这种设计在保持光学清晰度的同时用于视觉任务,并产生连续的莫尔场用于触觉感应,从而能够同时进行六轴力/扭矩测量、接触定位和视觉感知。 我们结合了来自莫尔图案的基于物理特性的特征(亮度、相位梯度、方向和周期)与深度空间特性。这些特性被映射到六轴力/扭矩测量上,通过端到端学习实现了可解释的回归分析。实验结果展示了三个能力:在所有测试轴上的力/扭矩测量 R^2 > 0.98;通过几何参数(三倍增益调整)进行灵敏度调节;即使存在莫尔图案覆盖的情况下也能实现物体分类的视觉功能。 最后,我们将传感器集成到机器人手臂上用于瓶盖去除操作,并实现了协调的力和扭矩控制,验证了其在灵巧操作中的潜力。

URL

https://arxiv.org/abs/2509.12714

PDF

https://arxiv.org/pdf/2509.12714.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot