Akio Hayakawa

Research engineer at Sony Research

Links

Google Scholar / Github (private account) / Github (company account) / Email

Educations

M.S. in Information Science and Technology, the University of Tokyo (2016/04 - 2018/03)
B.S. in Engineering, the University of Tokyo (2012/04 - 2016/03)

Work Experiences

R&D Center, Sony Corp. (2018/04 - 2023/03)
Sony Research Inc. (2023/04 - )

Publications

Christian Simon, Masato Ishii, Akio Hayakawa, Zhi Zhong, Shusuke Takahashi, Takashi Shibuya, Yuki Mitsufuji, “TITAN-Guide: Taming Inference-Time Alignment for Guided Text-to-Video Diffusion Models,” The International Conference on Computer Vision (ICCV), 2025 [arXiv] [project]
Masato Ishii, Akio Hayakawa, Takashi Shibuya, Yuki Mitsufuji, “A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation”, The IEEE International Joint Conference on Neural Network (IJCNN), 2025 [arXiv] [code]
Ho Kei Cheng, Masato Ishii, Akio Hayakawa, Takashi Shibuya, Alexander Schwing, Yuki Mitsufuji, “Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis,” The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025 [arXiv] [code]
Akio Hayakawa, Masato Ishii, Takashi Shibuya, Yuki Mitsufuji, “MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation”, The International Conference on Learning Representation (ICLR), 2025 [arXiv] [code]
Hiromichi Kamata, Yuiko Sakuma, Akio Hayakawa, Masato Ishii, Takuya Narihira, “Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion”, The 26th Meeting on Image Recognition and Understanding (MIRU), 2023 [arXiv]
Naoki Matsunaga, Masato Ishii, Akio Hayakawa, Kenji Suzuki, Takuya Narihira, “Fine-grained Image Editing by Pixel-wise Guidance Using Diffusion Models”, AI for Content Creation workshop (CVPRW AI4CC), 2023 [arXiv]
Akio Hayakawa, Jun Nishikawa, Masato Ishii, “任意の画像生成モデルに対する汎用リファイナーとしての拡散確率モデルの応用”, The 25th Meeting on Image Recognition and Understanding (MIRU), 2022
Takuya Narihira, Javier Alonsogarcia, Fabien Cardinaux, Akio Hayakawa, Masato Ishii, Kazunori Iwaki, Thomas Kemp, Yoshiyuki Kobayashi, Lukas Mauch, Akira Nakamura, Yukio Obuchi, Andrew Shin, Kenji Suzuki, Stephen Tiedmann, Stefan Uhlich, Takuya Yashima, Kazuki Yoshiyama, “Neural Network Libraries: A Deep Learning Framework Designed from Engineers’ Perspectives”, arXiv, 2021 [arXiv]
Naofumi Akimoto, Akio Hayakawa, Andrew Shin, Takuya Narihira, “Reference-based video colorization with spatiotemporal correspondence”, The 24th Meeting on Image Recognition and Understanding (MIRU), 2021 [arXiv]
Akio Hayakawa, Takuya Narihira, “Out-of-core training for extremely large-scale neural networks with adaptive window-based scheduling”, The 24th Meeting on Image Recognition and Understanding (MIRU), 2021 [arXiv]
Akio Hayakawa, Yusuke Kurose, Kiyohito Tanaka, Kento Aida, Shin’ichi Satoh, Masaru Kitsuregawa, Tatsuya Harada, “Gastric cancer detection for gastroenterological endoscopy with local and multi-scale global information”, The International Conference of Computer Assisted Radiology and Surgery (CARS), 2019

Patents

Yoshiyuki Kobayashi, Andrew Shin, Akio Hayakawa, Takayoshi Takayanagi, Hirotaka Suzuki, “Bias adjustment device, information processing device, information processing method, and information processing program”, US Patent App. 17/771,051, 2022

Prizes

Mitou Creator selected by Information-technology Promotion Agency (IPA), Japan, 2017 [link]

Invited presentations

“Tutorial on Diffusion Models” in The 29th Symposium on Sensing via Image Information (SSII) 2023 [slides]
“Tutorial on Diffusion Models and Recent Trends” in The 37th Annual Conference of Japanese Society for Artificial Intelligence (JSAI) 2023