Publications

Thesis
Nelson Yalta, "Robot Audition Framework using Deep Learning Techniques", Doctoral Degree, ResearchGate, (Feb. 2020)
Nelson Yalta, "Enhancing Sound Source Localization and Separation Using Deep Learning Models for Robot Audition", Master Degree, (Feb. 2017)
Jorge Luis Gonzalez, Nelson Yalta, "Procesamiento Digital de Senales implementando HPRC (Digital Signal Processing implementing HPRC)", Bachelor Degree, (Feb. 2009)
Journals (Referred) | Peer-Reviewed
Nelson Yalta, Kazuhiro Nakadai, and Tetsuya Ogata, "Sound Source Localization Using Deep Learning Models", JRM Vol.29 No.1 (Feb. 20, 2017)
International Conference and Workshop (Referred) | Peer-Reviewed
Pin-Chu Yang, Mohammed Al-Sada, Chang-Chieh Chiu, Kevin Kuo, Tito Pradhono Tomo, Kanata Suzuki, Nelson Yalta, Kuo-Hao Shu, Tetsuya Ogata, "HATSUKI : An anime character like robot figure platform with anime-style expressions and imitation learning based action generation", Ro-MAN 2020. (accepted), arXiv
Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Enrique Yalta Soplin, Tomoki Hayashi, and Shinji Watanabe, "ESPnet-ST: All-in-One Speech Translation Toolkit", Proc. ACL'20 (demo paper) (accepted), arXiv
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shnji Watanabe, Takenori Yoshimura, Wangyou Zhang, "A Comparative Study on Transformer vs RNN in Speech Applications", Proc. ASRU'19 (accepted), arXiv
Shigeki Karita, Nelson Yalta, Shinji Watanabe, Marc Delcroix, Atsunori Ogawa and Tomohiro Nakatani, "Improving Transformer Based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration", Proc. Interspeech'19 (accepted), ISCA-Speech
Nelson Yalta, Shinji Watanabe, Takaaki Hori, Kazuhiro Nakadai, Tetsuya Ogata, "CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments", Proc. EUSIPCO'19 (accepted), arXiv, IEEEXplore
Nelson Yalta, Shinji Watanabe, Kazuhiro Nakadai and Tetsuya Ogata, "Weakly-Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation", IJCNN'19 (accepted), arXiv, IEEEXplore
Jaejin Cho, Murali Karthick Baskar, Ruizhi Li, Matthew Wiesner, Sri Harish Mallidi, Nelson Yalta, Martin Karafiat, Shinji Watanabe and Takaaki Hori, "Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling", Proc. SLT'18 (accepted), arXiv, IEEEXplore
Naoyuki Kanda, Rintaro Ikeshita, Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu, Xiaofei Wang, Vimal Manohar, Nelson Enrique Yalta Soplin, Matthew Maciejewski, Szu-Jui Chen, Aswin Shanmugam Subramanian, Ruizhi Li, Zhiqi Wang, Jason Naradowsky, L Paola Garcia-Perera, Gregory Sell, "The Hitachi/JHU CHiME-5 system: Advances in speech recognition for everyday home environments using multiple microphone arrays", The 5th International Workshop on Speech Processing in Everyday Environments (CHiME 2018), Interspeech'18, CHiME-5
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala and Tsubasa Ochiai, "ESPnet: End-to-End Speech Processing Toolkit", Proc. Interspeech'18 (accepted), arXiv, ISCA-Speech
Domestic Conference and Publications
Nelson Yalta, Takashi Sumiyoshi, Yohei Kawaguchi, "THE HITACHI DCASE 2021 TASK 3 SYSTEM: HANDLING DIRECTIVE INTERFERENCE WITH SELF ATTENTION LAYERS", DCASE (2021)
Shota Horiguchi, Nelson Yalta, Paola Garcia, Yuki Takashima, Yawen Xue, Desh Raj, Zili Huang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur, "The Hitachi-JHU DIHARD III system: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap", CoRR abs/2102.01363 (2021)
Nelson Yalta, Kazuhiro Nakadai, Tetsuya Ogata, "Sequential Deep Learning for Dancing Motion Generation", SIG-Challenge (2016)