English
Related papers

Related papers: Representation Learning for Stack Overflow Posts: …

200 papers

Active learning has been shown to be an effective way to alleviate some of the effort required in utilising large collections of unlabelled data for machine learning tasks without needing to fully label them. The representation mechanism…

Information Retrieval · Computer Science 2020-04-29 Jinghui Lu , Brian MacNamee

Context: Navigating the knowledge of Stack Overflow (SO) remains challenging. To make the posts vivid to users, SO allows users to write and edit posts with Markdown or HTML so that users can leverage various formatting styles (e.g., bold,…

Computation and Language · Computer Science 2024-04-29 Shahla Shaan Ahmed , Shaowei Wang , Yuan Tian , Tse-Hsun , Chen , Haoxiang Zhang

Word embeddings (e.g., word2vec) have been applied successfully to eCommerce products through~\textit{prod2vec}. Inspired by the recent performance improvements on several NLP tasks brought by contextualized embeddings, we propose to…

Computation and Language · Computer Science 2021-06-24 Federico Bianchi , Bingqing Yu , Jacopo Tagliabue

BERT (Bidirectional Encoder Representations from Transformers) and related pre-trained Transformers have provided large gains across many language understanding tasks, achieving a new state-of-the-art (SOTA). BERT is pre-trained on two…

Computation and Language · Computer Science 2020-06-22 Michael Glass , Alfio Gliozzo , Rishav Chakravarti , Anthony Ferritto , Lin Pan , G P Shrivatsa Bhargav , Dinesh Garg , Avirup Sil

Recently, pre-trained models have been the dominant paradigm in natural language processing. They achieved remarkable state-of-the-art performance across a wide range of related tasks, such as textual entailment, natural language inference,…

Computation and Language · Computer Science 2019-05-21 Dongfang Li , Yifei Yu , Qingcai Chen , Xinyu Li

Context: Stack Overflow is very helpful for software developers who are seeking answers to programming problems. Previous studies have shown that a growing number of questions are of low quality and thus obtain less attention from potential…

Computation and Language · Computer Science 2022-08-26 Fengji Zhang , Xiao Yu , Jacky Keung , Fuyang Li , Zhiwen Xie , Zhen Yang , Caoyuan Ma , Zhimin Zhang

Stack Overflow is often viewed as the most influential Software Question Answer (SQA) website with millions of programming-related questions and answers. Tags play a critical role in efficiently structuring the contents in Stack Overflow…

Software Engineering · Computer Science 2022-05-12 Junda He , Bowen Xu , Zhou Yang , DongGyun Han , Chengran Yang , David Lo

Manually labelling large collections of text data is a time-consuming, expensive, and laborious task, but one that is necessary to support machine learning based on text datasets. Active learning has been shown to be an effective way to…

Computation and Language · Computer Science 2019-10-11 Jinghui Lu , Maeve Henchion , Brian Mac Namee

Pre-trained transformer models shine in many natural language processing tasks and therefore are expected to bear the representation of the input sentence or text meaning. These sentence-level embeddings are also important in…

Computation and Language · Computer Science 2025-02-21 Lukas Stankevičius , Mantas Lukoševičius

Stack Overflow is one of the most influential Software Question & Answer (SQA) websites, hosting millions of programming-related questions and answers. Tags play a critical role in efficiently organizing the contents in Stack Overflow and…

Software Engineering · Computer Science 2024-08-06 Junda He , Bowen Xu , Zhou Yang , DongGyun Han , Chengran Yang , Jiakun Liu , Zhipeng Zhao , David Lo

Software developers often resort to Stack Overflow (SO) to fill their programming needs. Given the abundance of relevant posts, navigating them and comparing different solutions is tedious and time-consuming. Recent work has proposed to…

Software Engineering · Computer Science 2023-05-29 Bonan Kou , Muhao Chen , Tianyi Zhang

Recent advancements in AI have sparked a trend in constructing large, generalist language models that handle a multitude of tasks, including many code-related ones. While these models are expensive to train and are often closed-source, they…

Computation and Language · Computer Science 2025-02-24 Manisha Mukherjee , Vincent J. Hellendoorn

User representation learning is vital to capture diverse user preferences, while it is also challenging as user intents are latent and scattered among complex and different modalities of user-generated data, thus, not directly measurable.…

Social and Information Networks · Computer Science 2019-12-03 Lin Gong , Lu Lin , Weihao Song , Hongning Wang

APIs (Application Programming Interfaces) are reusable software libraries and are building blocks for modern rapid software development. Previous research shows that programmers frequently share and search for reviews of APIs on the…

Software Engineering · Computer Science 2022-01-28 chengran Yang , Bowen Xu , Junaed younus Khan , Gias Uddin , Donggyun Han , Zhou Yang , David Lo

Representation learning is a critical ingredient for natural language processing systems. Recent Transformer language models like BERT learn powerful textual representations, but these models are targeted towards token- and sentence-level…

Computation and Language · Computer Science 2020-05-21 Arman Cohan , Sergey Feldman , Iz Beltagy , Doug Downey , Daniel S. Weld

Learning high-quality sentence representations benefits a wide range of natural language processing tasks. Though BERT-based pre-trained language models achieve high performance on many downstream tasks, the native derived sentence…

Computation and Language · Computer Science 2021-05-26 Yuanmeng Yan , Rumei Li , Sirui Wang , Fuzheng Zhang , Wei Wu , Weiran Xu

Text classification algorithms investigate the intricate relationships between words or phrases and attempt to deduce the document's interpretation. In the last few years, these algorithms have progressed tremendously. Transformer…

Computation and Language · Computer Science 2022-06-28 Snehal Khandve , Vedangi Wagh , Apurva Wani , Isha Joshi , Raviraj Joshi

Tremendous amounts of multimedia associated with speech information are driving an urgent need to develop efficient and effective automatic summarization methods. To this end, we have seen rapid progress in applying supervised deep neural…

Computation and Language · Computer Science 2020-06-03 Shi-Yan Weng , Tien-Hong Lo , Berlin Chen

Recent developments in unsupervised representation learning have successfully established the concept of transfer learning in NLP. Mainly three forces are driving the improvements in this area of research: More elaborated architectures are…

Computation and Language · Computer Science 2020-07-22 Matthias Aßenmacher , Christian Heumann

In this paper, we propose Stacked DeBERT, short for Stacked Denoising Bidirectional Encoder Representations from Transformers. This novel model improves robustness in incomplete data, when compared to existing systems, by designing a novel…

Computation and Language · Computer Science 2021-01-15 Gwenaelle Cunha Sergio , Minho Lee
‹ Prev 1 2 3 10 Next ›