Related papers: Sequence-to-Set Generative Models

Order Matters: Sequence to sequence for sets

Sequences have become first class citizens in supervised learning thanks to the resurgence of recurrent neural networks. Many complex tasks that require mapping from or to a sequence of observations can now be formulated with the…

Machine Learning · Statistics 2016-02-25 Oriol Vinyals , Samy Bengio , Manjunath Kudlur

Set2Model Networks: Learning Discriminatively To Learn Generative Models

We present a new "learning-to-learn"-type approach that enables rapid learning of concepts from small-to-medium sized training sets and is primarily designed for web-initialized image retrieval. At the core of our approach is a deep…

Computer Vision and Pattern Recognition · Computer Science 2017-10-30 A. Vakhitov , A. Kuzmin , V. Lempitsky

Seq2Slate: Re-ranking and Slate Optimization with RNNs

Ranking is a central task in machine learning and information retrieval. In this task, it is especially important to present the user with a slate of items that is appealing as a whole. This in turn requires taking into account interactions…

Information Retrieval · Computer Science 2019-03-21 Irwan Bello , Sayali Kulkarni , Sagar Jain , Craig Boutilier , Ed Chi , Elad Eban , Xiyang Luo , Alan Mackey , Ofer Meshi

A Sequence-to-Sequence&Set Model for Text-to-Table Generation

Recently, the text-to-table generation task has attracted increasing attention due to its wide applications. In this aspect, the dominant model formalizes this task as a sequence-to-sequence generation task and serializes each table into a…

Computation and Language · Computer Science 2023-06-02 Tong Li , Zhihao Wang , Liangying Shao , Xuling Zheng , Xiaoli Wang , Jinsong Su

Conditional set generation using Seq2seq models

Conditional set generation learns a mapping from an input sequence of tokens to a set. Several NLP tasks, such as entity typing and dialogue emotion tagging, are instances of set generation. Seq2Seq models, a popular choice for set…

Computation and Language · Computer Science 2022-10-25 Aman Madaan , Dheeraj Rajagopal , Niket Tandon , Yiming Yang , Antoine Bosselut

Plan, Attend, Generate: Planning for Sequence-to-Sequence Models

We investigate the integration of a planning mechanism into sequence-to-sequence models using attention. We develop a model which can plan ahead in the future when it computes its alignments between input and output sequences, constructing…

Machine Learning · Computer Science 2017-11-29 Francis Dutil , Caglar Gulcehre , Adam Trischler , Yoshua Bengio

Sequence-to-Sequence Learning as Beam-Search Optimization

Sequence-to-Sequence (seq2seq) modeling has rapidly become an important general-purpose NLP tool that has proven effective for many text-generation and sequence-labeling tasks. Seq2seq builds on deep neural language modeling and inherits…

Computation and Language · Computer Science 2016-11-11 Sam Wiseman , Alexander M. Rush

A Sequential Set Generation Method for Predicting Set-Valued Outputs

Consider a general machine learning setting where the output is a set of labels or sequences. This output set is unordered and its size varies with the input. Whereas multi-label classification methods seem a natural first resort, they are…

Machine Learning · Computer Science 2019-03-14 Tian Gao , Jie Chen , Vijil Chenthamarakshan , Michael Witbrock

Sequence Modeling with Unconstrained Generation Order

The dominant approach to sequence generation is to produce a sequence in some predefined order, e.g. left to right. In contrast, we propose a more general model that can generate the output sequence by inserting tokens in any arbitrary…

Computation and Language · Computer Science 2019-11-04 Dmitrii Emelianenko , Elena Voita , Pavel Serdyukov

Large batch size training of neural networks with adversarial training and second-order information

The most straightforward method to accelerate Stochastic Gradient Descent (SGD) computation is to distribute the randomly selected batch of inputs over multiple processors. To keep the distributed processors fully utilized requires…

Machine Learning · Computer Science 2020-01-06 Zhewei Yao , Amir Gholami , Daiyaan Arfeen , Richard Liaw , Joseph Gonzalez , Kurt Keutzer , Michael Mahoney

A Generative Re-ranking Model for List-level Multi-objective Optimization at Taobao

E-commerce recommendation systems aim to generate ordered lists of items for customers, optimizing multiple business objectives, such as clicks, conversions and Gross Merchandise Volume (GMV). Traditional multi-objective optimization…

Information Retrieval · Computer Science 2025-05-13 Yue Meng , Cheng Guo , Yi Cao , Tong Liu , Bo Zheng

Convolutional Sequence to Sequence Learning

The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce an architecture based entirely on convolutional neural networks. Compared to…

Computation and Language · Computer Science 2017-07-26 Jonas Gehring , Michael Auli , David Grangier , Denis Yarats , Yann N. Dauphin

From Small to Large: A Graph Convolutional Network Approach for Solving Assortment Optimization Problems

Assortment optimization seeks to select a subset of substitutable products, subject to constraints, to maximize expected revenue. The problem is NP-hard due to its combinatorial and nonlinear nature and arises frequently in industries such…

Machine Learning · Computer Science 2025-11-18 Guokai Li , Pin Gao , Stefanus Jasin , Zizhuo Wang

Permutation Invariant Graph Generation via Score-Based Generative Modeling

Learning generative models for graph-structured data is challenging because graphs are discrete, combinatorial, and the underlying data distribution is invariant to the ordering of nodes. However, most of the existing generative models for…

Machine Learning · Computer Science 2020-03-03 Chenhao Niu , Yang Song , Jiaming Song , Shengjia Zhao , Aditya Grover , Stefano Ermon

Order Matters: Probabilistic Modeling of Node Sequence for Graph Generation

A graph generative model defines a distribution over graphs. One type of generative model is constructed by autoregressive neural networks, which sequentially add nodes and edges to generate a graph. However, the likelihood of a graph under…

Machine Learning · Statistics 2021-06-15 Xiaohui Chen , Xu Han , Jiajing Hu , Francisco J. R. Ruiz , Liping Liu

A Sequential Learning Procedure with Applications to Online Sales Examination

In this paper, we consider the problem of estimating parameters in a linear regression model. We propose a sequential learning procedure to determine the sample size for achieving a given small estimation risk, under the widely used…

Methodology · Statistics 2023-11-07 Jun Hu , Yan Zhuang , Shunan Zhao

Neighbor2Seq: Deep Learning on Massive Graphs by Transforming Neighbors to Sequences

Modern graph neural networks (GNNs) use a message passing scheme and have achieved great success in many fields. However, this recursive design inherently leads to excessive computation and memory requirements, making it not applicable to…

Machine Learning · Computer Science 2022-02-08 Meng Liu , Shuiwang Ji

Set Interdependence Transformer: Set-to-Sequence Neural Networks for Permutation Learning and Structure Prediction

The task of learning to map an input set onto a permuted sequence of its elements is challenging for neural networks. Set-to-sequence problems occur in natural language processing, computer vision and structure prediction, where…

Machine Learning · Computer Science 2022-06-09 Mateusz Jurewicz , Leon Derczynski

Time-Series Learning for Proactive Fault Prediction in Distributed Systems with Deep Neural Structures

This paper addresses the challenges of fault prediction and delayed response in distributed systems by proposing an intelligent prediction method based on temporal feature learning. The method takes multi-dimensional performance metric…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-05-28 Yang Wang , Wenxuan Zhu , Xuehui Quan , Heyi Wang , Chang Liu , Qiyuan Wu

Generative Model Selection Using a Scalable and Size-Independent Complex Network Classifier

Real networks exhibit nontrivial topological features such as heavy-tailed degree distribution, high clustering, and small-worldness. Researchers have developed several generative models for synthesizing artificial networks that are…

Social and Information Networks · Computer Science 2014-02-04 Sadegh Motallebi , Sadegh Aliakbary , Jafar Habibi