Computation and Language · Computer Science
Streaming Punctuation for Long-form Dictation with Transformers
Piyush Behre, Sharman Tan, Padma Varadharajan, Shuangyu Chang
2022-12-07
Computation and Language · Computer Science
Streaming Punctuation: A Novel Punctuation Technique Leveraging Bidirectional Context for Continuous Speech Recognition
Piyush Behre, Sharman Tan, Padma Varadharajan, Shuangyu Chang
2023-01-11
Computation and Language · Computer Science
Large-Scale Streaming End-to-End Speech Translation with Neural Transducers
Jian Xue, Peidong Wang, Jinyu Li, Matt Post +1
2022-07-05
Computation and Language · Computer Science
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Chengwei Qin +3
2024-05-24
Computation and Language · Computer Science
Streaming Non-Autoregressive Model for Accent Conversion and Pronunciation Improvement
Tuan-Nam Nguyen, Ngoc-Quan Pham, Seymanur Akti, Alexander Waibel
2025-06-23
Computation and Language · Computer Science
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding
Yuchen Liu, Jiajun Zhang, Hao Xiong, Long Zhou +4
2019-12-17
Computation and Language · Computer Science
Reducing the gap between streaming and non-streaming Transducer-based ASR by adaptive two-stage knowledge distillation
Haitao Tang, Yu Fu, Lei Sun, Jiabin Xue +7
2023-06-28
Audio and Speech Processing · Electrical Eng. & Systems
Two-Pass End-to-End ASR Model Compression
Nauman Dawalatabad, Tushar Vatsal, Ashutosh Gupta, Sungsoo Kim +3
2022-01-11
Computer Vision and Pattern Recognition · Computer Science
STAR: Scale-wise Text-conditioned AutoRegressive image generation
Xiaoxiao Ma, Mohan Zhou, Tao Liang, Yalong Bai +4
2025-02-20
Audio and Speech Processing · Electrical Eng. & Systems
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition
Wenyong Huang, Wenchao Hu, Yu Ting Yeung, Xiao Chen
2020-08-14
Audio and Speech Processing · Electrical Eng. & Systems
Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition
Niko Moritz, Ruiming Xie, Yashesh Gaur, Ke Li +4
2024-12-23
Computation and Language · Computer Science
Streaming Simultaneous Speech Translation with Augmented Memory Transformer
Xutai Ma, Yongqiang Wang, Mohammad Javad Dousti, Philipp Koehn +1
2020-11-03
Machine Learning · Computer Science
Transformers from Compressed Representations
Juan C. Leon Alcazar, Mattia Soldan, Mohammad Saatialsoruji, Alejandro Pardo +3
2025-10-30
Computation and Language · Computer Science
Segmentation-Free Streaming Machine Translation
Javier Iranzo-Sánchez, Jorge Iranzo-Sánchez, Adrià Giménez, Jorge Civera +1
2024-05-29
Computer Vision and Pattern Recognition · Computer Science
Spanning Tree Autoregressive Visual Generation
Sangkyu Lee, Changho Lee, Janghoon Han, Hosung Song +5
2025-11-24
Audio and Speech Processing · Electrical Eng. & Systems
Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR
Xilai Li, Goeric Huybrechts, Srikanth Ronanki, Jeff Farris +1
2023-04-27
Audio and Speech Processing · Electrical Eng. & Systems
Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation
Muhammad Shakeel, Yui Sudo, Yifan Peng, Shinji Watanabe
2024-09-12
Computation and Language · Computer Science
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation
Sara Papi, Peidong Wang, Junkun Chen, Jian Xue +3
2023-10-24
Computation and Language · Computer Science
Streaming Models for Joint Speech Recognition and Translation
Orion Weller, Matthias Sperber, Christian Gollan, Joris Kluivers
2021-01-25