English
Related papers

Related papers: VecTrans: Enhancing Compiler Auto-Vectorization th…

200 papers

Vectorization via Single Instruction, Multiple Data (SIMD) architectures is a cornerstone of high-performance computing. To fully exploit hardware potential, developers often resort to explicit vectorization using intrinsics, as…

Computation and Language · Computer Science 2026-05-19 Shangzhan Li , Xinyu Yin , Xuanyu Jin , Ye He , Yuxin Zhou , Yuxuan Li , Xu Han , Wanxiang Che , Qi Shi , Ting Liu , Maosong Sun

Vectorization is a powerful optimization technique that significantly boosts the performance of high performance computing applications operating on large data arrays. Despite decades of research on auto-vectorization, compilers frequently…

Software Engineering · Computer Science 2024-06-10 Jubi Taneja , Avery Laird , Cong Yan , Madan Musuvathi , Shuvendu K. Lahiri

One of the key challenges arising when compilers vectorize loops for today's SIMD-compatible architectures is to decide if vectorization or interleaving is beneficial. Then, the compiler has to determine how many instructions to pack…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-01-07 Ameer Haj-Ali , Nesreen K. Ahmed , Ted Willke , Sophia Shao , Krste Asanovic , Ion Stoica

While model serving has unlocked unprecedented capabilities, the high cost of serving large-scale models continues to be a significant barrier to widespread accessibility and rapid innovation. Compiler optimizations have long driven…

Machine Learning · Computer Science 2026-02-05 Annabelle Sujun Tang , Christopher Priebe , Rohan Mahapatra , Lianhui Qin , Hadi Esmaeilzadeh

Large language models (LLMs) have shown remarkable in-context learning (ICL) capabilities on textual data. We explore whether these capabilities can be extended to continuous vectors from diverse domains, obtained from black-box pretrained…

Computation and Language · Computer Science 2025-02-21 Yufan Zhuang , Chandan Singh , Liyuan Liu , Jingbo Shang , Jianfeng Gao

The unprecedented advancements in Large Language Models (LLMs) have profoundly impacted natural language processing but have yet to fully embrace the realm of scalable vector graphics (SVG) generation. While LLMs encode partial knowledge of…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Ximing Xing , Juncheng Hu , Guotao Liang , Jing Zhang , Dong Xu , Qian Yu

Modern processors increasingly rely on SIMD instruction sets, such as AVX and RVV, to significantly enhance parallelism and computational performance. However, production-ready compilers like LLVM and GCC often fail to fully exploit…

Programming Languages · Computer Science 2025-10-07 Shihan Fang , Wenxin Zheng

Recently, the use of large language models (LLMs) for software code generation, e.g., C/C++ and Python, has proven a great success. However, LLMs still suffer from low syntactic and functional correctness when it comes to the generation of…

Hardware Architecture · Computer Science 2024-07-29 Mingzhe Gao , Jieru Zhao , Zhe Lin , Wenchao Ding , Xiaofeng Hou , Yu Feng , Chao Li , Minyi Guo

Multimodal large language models (MLLMs) suffer from high computational costs due to excessive visual tokens, particularly in high-resolution and video-based scenarios. Existing token reduction methods typically focus on isolated pipeline…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Hanxun Yu , Wentong Li , Xuan Qu , Song Wang , Junbo Chen , Jianke Zhu

A current trend in HPC systems is the utilization of architectures with SIMD or vector extensions to exploit data parallelism. There are several ways to take advantage of such modern vector architectures, each with a different impact on the…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-11-05 Marc Blancafort , Roger Ferrer , Guillaume Houzeaux , Marta Garcia-Gasulla , Filippo Mantovani

Hardware accelerators, in particular accelerators for tensor processing, have many potential application domains. However, they currently lack the software infrastructure to support the majority of domains outside of deep learning.…

Hardware Architecture · Computer Science 2024-08-08 Charles Hong , Sahil Bhatia , Altan Haan , Shengjun Kris Dong , Dima Nikiforov , Alvin Cheung , Yakun Sophia Shao

Code translation tools (transpilers) are developed for automatic source-to-source translation. Although learning-based transpilers have shown impressive enhancement against rule-based counterparts, owing to their task-specific pre-training…

Software Engineering · Computer Science 2024-05-14 Zhen Yang , Fang Liu , Zhongxing Yu , Jacky Wai Keung , Jia Li , Shuo Liu , Yifan Hong , Xiaoxue Ma , Zhi Jin , Ge Li

Modern microprocessors are equipped with single instruction multiple data (SIMD) or vector instruction sets which allow compilers to exploit superword level parallelism (SLP), a type of fine-grained parallelism. Current SLP…

Programming Languages · Computer Science 2018-10-31 Charith Mendis , Saman Amarasinghe

Large Language Models (LLMs) have demonstrated remarkable capabilities across a variety of software engineering and coding tasks. However, their application in the domain of code and compiler optimization remains underexplored. Training…

Programming Languages · Computer Science 2024-07-04 Chris Cummins , Volker Seeker , Dejan Grubisic , Baptiste Roziere , Jonas Gehring , Gabriel Synnaeve , Hugh Leather

Although Large Vision Language Models (LVLMs) have demonstrated impressive multimodal reasoning capabilities, their scalability and deployment are constrained by massive computational requirements. In particular, the massive amount of…

Machine Learning · Computer Science 2026-04-14 Surendra Pathak , Bo Han

Vision-language models (VLMs) offer a promising paradigm for image classification by comparing the similarity between images and class embeddings. A critical challenge lies in crafting precise textual representations for class names. While…

Computer Vision and Pattern Recognition · Computer Science 2024-02-20 Songhao Han , Le Zhuo , Yue Liao , Si Liu

Multimodal large language models (MLLMs) have demonstrated remarkable potential for enhancing scene understanding in autonomous driving systems through powerful logical reasoning capabilities. However, the deployment of these models faces…

Computer Vision and Pattern Recognition · Computer Science 2024-09-18 Yunsheng Ma , Amr Abdelraouf , Rohit Gupta , Ziran Wang , Kyungtae Han

Automatically extracting vectorized building contours from remote sensing imagery is crucial for urban planning, population estimation, and disaster assessment. Current state-of-the-art methods rely on complex multi-stage pipelines…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Tao Zhang , Shiqing Wei , Shihao Chen , Wenling Yu , Muying Luo , Shunping Ji

Large Language Models (LLMs) have shown promise in the autonomous driving sector, particularly in generalization and interpretability. We introduce a unique object-level multimodal LLM architecture that merges vectorized numeric modalities…

In the realm of vision models, the primary mode of representation is using pixels to rasterize the visual world. Yet this is not always the best or unique way to represent visual content, especially for designers and artists who depict the…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Bocheng Zou , Mu Cai , Jianrui Zhang , Yong Jae Lee
‹ Prev 1 2 3 10 Next ›