Akio Hayakawa
Research engineer at Sony Research
Links
Google Scholar /
Github (private account) /
Github (company account) /
Email
Educations
- M.S. in Information Science and Technology, the University of Tokyo (2016/04 - 2018/03)
- B.S. in Engineering, the University of Tokyo (2012/04 - 2016/03)
Work Experiences
- R&D Center, Sony Corp. (2018/04 - 2023/03)
- Sony Research Inc. (2023/04 - )
Publications
- Masato Ishii, Akio Hayakawa, Takashi Shibuya, Yuki Mitsufuji, “A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation”, The IEEE International Joint Conference on Neural Network (IJCNN), 2025 [arXiv] [code]
- Ho Kei Cheng, Masato Ishii, Akio Hayakawa, Takashi Shibuya, Alexander Schwing, Yuki Mitsufuji, “Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis,” The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025 [arXiv] [code]
- Akio Hayakawa, Masato Ishii, Takashi Shibuya, Yuki Mitsufuji, “MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation”, The International Conference on Learning Representation (ICLR), 2025 [arXiv] [code]
- Hiromichi Kamata, Yuiko Sakuma, Akio Hayakawa, Masato Ishii, Takuya Narihira, “Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion”, The 26th Meeting on Image Recognition and Understanding (MIRU), 2023 [arXiv]
- Naoki Matsunaga, Masato Ishii, Akio Hayakawa, Kenji Suzuki, Takuya Narihira, “Fine-grained Image Editing by Pixel-wise Guidance Using Diffusion Models”, AI for Content Creation workshop (CVPRW AI4CC), 2023 [arXiv]
- Akio Hayakawa, Jun Nishikawa, Masato Ishii, “任意の画像生成モデルに対する汎用リファイナーとしての拡散確率モデルの応用”, The 25th Meeting on Image Recognition and Understanding (MIRU), 2022
- Takuya Narihira, Javier Alonsogarcia, Fabien Cardinaux, Akio Hayakawa, Masato Ishii, Kazunori Iwaki, Thomas Kemp, Yoshiyuki Kobayashi, Lukas Mauch, Akira Nakamura, Yukio Obuchi, Andrew Shin, Kenji Suzuki, Stephen Tiedmann, Stefan Uhlich, Takuya Yashima, Kazuki Yoshiyama, “Neural Network Libraries: A Deep Learning Framework Designed from Engineers’ Perspectives”, arXiv, 2021 [arXiv]
- Naofumi Akimoto, Akio Hayakawa, Andrew Shin, Takuya Narihira, “Reference-based video colorization with spatiotemporal correspondence”, The 24th Meeting on Image Recognition and Understanding (MIRU), 2021 [arXiv]
- Akio Hayakawa, Takuya Narihira, “Out-of-core training for extremely large-scale neural networks with adaptive window-based scheduling”, The 24th Meeting on Image Recognition and Understanding (MIRU), 2021 [arXiv]
- Akio Hayakawa, Yusuke Kurose, Kiyohito Tanaka, Kento Aida, Shin’ichi Satoh, Masaru Kitsuregawa, Tatsuya Harada, “Gastric cancer detection for gastroenterological endoscopy with local and multi-scale global information”, The International Conference of Computer Assisted Radiology and Surgery (CARS), 2019
Patents
- Yoshiyuki Kobayashi, Andrew Shin, Akio Hayakawa, Takayoshi Takayanagi, Hirotaka Suzuki, “Bias adjustment device, information processing device, information processing method, and information processing program”, US Patent App. 17/771,051, 2022
Prizes
- Mitou Creator selected by Information-technology Promotion Agency (IPA), Japan, 2017 [link]
Invited presentations
- “Tutorial on Diffusion Models” in The 29th Symposium on Sensing via Image Information (SSII) 2023 [slides]
- “Tutorial on Diffusion Models and Recent trends” in The 37th Annual Conference of Japanese Society for Artificail Intelligence (JSAI) 2023