Papers - SUGIURA, Komei
-
Mobile Manipulation Instruction Generation From Multiple Images With Automatic Metric Enhancement
K Katsumata, M Kambara, D Yashima, R Korekata, K Sugiura
IEEE Robotics and Automation Letters 2025
-
R Korekata, K Kaneda, S Nagashima, Y Imai, K Sugiura
Advanced Robotics, 1-16 abs/2408.07910 2025
-
M Kambara, K Sugiura
2025 19th International Conference on Machine Vision and Applications (MVA), 1-5 1 - 5 2025
-
NaiLIA: 緩和損失に基づくネイルデザインのマルチモーダル検索
雨宮佳音, 小松拓実, 八島大地, 是方諒介, 勝又圭, 杉浦孔明
人工知能学会全国大会論文集 第 39 回 (2025), 2Win555-2Win555 (The Japanese Society for Artificial Intelligence) JSAI2025 ( 0 ) 2Win555 - 2Win555 2025
-
Interactive robot action replanning using multimodal llm trained from human demonstration videos
C Hori, M Kambara, K Sugiura, K Ota, S Khurana, S Jain, R Corcodel, ...
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and … 1 - 5 2025
-
Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
S Nagashima, K Sugiura
Proceedings of the IEEE/CVF International Conference on Computer Vision … abs/2508.07847 2025
-
Crosslingual Visual Prompt に基づくテキスト付き画像からの日常物体検索
戸倉健登, 是方諒介, 小松拓実, 今井悠人, 杉浦孔明
人工知能学会全国大会論文集 第 39 回 (2025), 1Win452-1Win452 (The Japanese Society for Artificial Intelligence) JSAI2025 ( 0 ) 1Win452 - 1Win452 2025
-
Pre-manipulation alignment prediction with parallel deep state-space and transformer models
M Kambara, K Sugiura
Advanced Robotics 39 (13), 806-816 2025
-
Takayuki Nishimura, Katsuyuki Kuyo, Motonari Kambara, Komei Sugiura
IROS 9549 - 9556 2024
-
Trimodal Navigable Region Segmentation Model: Grounding Navigation Instructions in Urban Areas
N Hosomi, S Hatanaka, Y Iioka, W Yang, K Kuyo, T Misu, K Yamada, ...
IEEE Robotics and Automation Letters 9 (5), 4162-4169 2024
-
Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine.
Kanta Kaneda, Shunya Nagashima, Ryosuke Korekata, Motonari Kambara, Komei Sugiura
9 ( 3 ) 2088 - 2095 2024
-
Co-scale cross-attentional transformer for rearrangement target detection
H Matsuo, S Ishikawa, K Sugiura
Advanced Robotics 38 (18), 1277-1286 2024
-
Cooperative Control of Multiple CAs
T Nagai, T Nakamura, K Sugiura, T Taniguchi, Y Suzuki, M Hirata
Cybernetic Avatar, 151-207 2024
-
Deneb: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
K Matsuda, Y Wada, K Sugiura
Proceedings of the Asian Conference on Computer Vision, 3570-3586 2024
-
Layer-Wise Relevance Propagation with Conservation Property for ResNet
S Otsuki, T Iida, F Doublet, T Hirakawa, T Yamashita, H Fujiyoshi, ...
European Conference on Computer Vision, 349-364 2024
-
Mask-Attention A3C: Visual Explanation of Action-State Value in Deep Reinforcement Learning
H Itaya, T Hirakawa, T Yamashita, H Fujiyoshi, K Sugiura
IEEE Access 12 86553 - 86571 2024
-
Multimodal Target Localization with Landmark-Aware Positioning for Urban Mobility
N Hosomi, Y Iioka, S Hatanaka, T Misu, K Yamada, N Tsukamoto, ...
IEEE Robotics and Automation Letters 10 ( 1 ) 716 - 723 2024
-
Nearest neighbor future captioning: generating descriptions for possible collisions in object placement tasks
T Komatsu, M Kambara, S Hatanaka, H Matsuo, T Hirakawa, T Yamashita, ...
Advanced Robotics 38 (18), 1265-1276 2024
-
Polos: Multimodal Metric Learning from Human Feedback for Image Captioning.
Yuiga Wada, Kanta Kaneda, Daichi Saito, Komei Sugiura
CVPR 13559 - 13568 2024
-
Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning With Dense Labeling
D Yashima, R Korekata, K Sugiura
IEEE Robotics and Automation Letters 2024