English
Related papers

Related papers: Composer 2 Technical Report

200 papers

We introduce Composer's Assistant 2, a system for interactive human-computer composition in the REAPER digital audio workstation. Our work upgrades the Composer's Assistant system (which performs multi-track infilling of symbolic music at…

Sound · Computer Science 2024-07-23 Martin E. Malandro

High-fidelity text-to-music generation typically relies on massive proprietary datasets and immense computational resources. Existing models often struggle to generate coherent pure musical accompaniments and lack precise, localized…

Audio and Speech Processing · Electrical Eng. & Systems 2026-05-19 Huakang Chen , Wenkai Cheng , Guobin Ma , Chunbo Hao , Yuxuan Xia , Mengqi Wei , Zhixian Zhao , Pengcheng Zhu , Hanbing Zhang , Lei Xie

Compositional generalization is a basic and essential intellective capability of human beings, which allows us to recombine known parts readily. However, existing neural network based models have been proven to be extremely deficient in…

Artificial Intelligence · Computer Science 2020-10-27 Qian Liu , Shengnan An , Jian-Guang Lou , Bei Chen , Zeqi Lin , Yan Gao , Bin Zhou , Nanning Zheng , Dongmei Zhang

Advancing complex reasoning in large language models relies on high-quality, verifiable datasets, yet human annotation remains cost-prohibitive and difficult to scale. Current synthesis paradigms often face a recurring trade-off:…

Artificial Intelligence · Computer Science 2026-02-04 Zhengbo Jiao , Shaobo Wang , Zifan Zhang , Xuan Ren , Wei Wang , Bing Zhao , Hu Wei , Linfeng Zhang

Text-to-image generative models have achieved remarkable visual quality but still struggle with compositionality$-$accurately capturing object relationships, attribute bindings, and fine-grained details in prompts. A key limitation is that…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Arman Zarei , Jiacheng Pan , Matthew Gwilliam , Soheil Feizi , Zhenheng Yang

Recent large-scale generative models learned on big data are capable of synthesizing incredible images yet suffer from limited controllability. This work offers a new generation paradigm that allows flexible control of the output image,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-23 Lianghua Huang , Di Chen , Yu Liu , Yujun Shen , Deli Zhao , Jingren Zhou

Typical schedulers in multi-tenancy environments make use of reactive, feedback-oriented mechanisms based on performance counters to avoid resource contention but suffer from detection lag and loss of performance. In this paper, we address…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-11-02 Girish Mururu , Sharjeel Khan , Bodhisatwa Chatterjee , Chao Chen , Chris Porter , Ada Gavrilovska , Santosh Pande

Hybrid model architectures that combine computational primitives (e.g., Attention, MLP) in different ratios have shown promising performance beyond Transformers. Some studies have shown that different interleavings of primitives can affect…

Existing generative models, such as diffusion and auto-regressive networks, are inherently static, relying on a fixed set of pretrained parameters to handle all inputs. In contrast, humans flexibly adapt their internal generative…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Minh-Tuan Tran , Xuan-May Le , Quan Hung Tran , Mehrtash Harandi , Dinh Phung , Trung Le

Systematic compositionality is an essential mechanism in human language, allowing the recombination of known parts to create novel expressions. However, existing neural models have been shown to lack this basic ability in learning symbolic…

Computation and Language · Computer Science 2021-10-01 Yichen Jiang , Mohit Bansal

When writing programs, people have the ability to tackle a new complex task by decomposing it into smaller and more familiar subtasks. While it is difficult to measure whether neural program synthesis methods have similar capabilities, what…

Machine Learning · Computer Science 2023-10-31 Kensen Shi , Joey Hong , Manzil Zaheer , Pengcheng Yin , Charles Sutton

The fine-tuning of deep pre-trained models has revealed compositional properties, with multiple specialized modules that can be arbitrarily composed into a single, multi-task model. However, identifying the conditions that promote…

Artificial Intelligence · Computer Science 2025-03-04 Angelo Porrello , Lorenzo Bonicelli , Pietro Buzzega , Monica Millunzi , Simone Calderara , Rita Cucchiara

Composition is a cornerstone of visual aesthetics, influencing the appeal of an image. While its principles operate independently of specific content, in practice, composition is often coupled with semantics. As a result, existing methods…

Computer Vision and Pattern Recognition · Computer Science 2026-05-07 Kai Zou , Zhiwei Zhao , Bin Liu , Nenghai Yu

Agents powered by large language models (LLMs) are increasingly adopted in the software industry, contributing code as collaborators or even autonomous developers. As their presence grows, it becomes important to assess the current…

Software Engineering · Computer Science 2026-02-12 Qixing Zhou , Jiacheng Zhang , Haiyang Wang , Rui Hao , Jiahe Wang , Minghao Han , Yuxue Yang , Shuzhe Wu , Feiyang Pan , Lue Fan , Dandan Tu , Zhaoxiang Zhang

Xmodel-2 is a 1.2-billion-parameter large language model designed specifically for reasoning tasks. Its architecture enables different model scales to share a unified set of hyperparameters, allowing for extensive experimentation on smaller…

Artificial Intelligence · Computer Science 2024-12-30 Wang Qun , Liu Yang , Lin Qingquan , Qu Zhijiu , Jiang Ling

Transformer-based language models for code have shown remarkable performance in various software analytics tasks, but their adoption is hindered by high computational costs, slow inference speeds, and substantial environmental impact. Model…

Software Engineering · Computer Science 2026-04-15 Md. Abdul Awal , Mrigank Rochan , Chanchal K. Roy

Large language models have been successfully applied to programming assistance tasks, such as code completion, code insertion, and instructional code editing. However, these applications remain insufficiently automated and struggle to…

Computation and Language · Computer Science 2025-05-14 Hao Jiang , Qi Liu , Rui Li , Shengyu Ye , Shijin Wang

Agentic repository-level code understanding is essential for automating complex software engineering tasks, yet the field lacks reliable benchmarks. Existing evaluations often overlook the long tail topics and rely on popular repositories…

The evolution of Large Language Models (LLMs) into autonomous agents has expanded the scope of AI coding from localized code generation to complex, repository-level, and execution-driven problem solving. However, current benchmarks…

Software Engineering · Computer Science 2026-01-19 Jie Yang , Honglin Guo , Li Ji , Jiazheng Zhou , Rui Zheng , Zhikai Lei , Shuo Zhang , Zhiheng Xi , Shichun Liu , Yuxin Wang , Bo Wang , Yining Zheng , Tao Gui , Xipeng Qiu
‹ Prev 1 2 3 10 Next ›