Related papers: Scalable Stellar Parameter Inference Using Python-…

PISP: Projected-Space Inference of Stellar Parameters

To improve the accuracy and efficiency of high-dimensional stellar parameter inference in large spectroscopic datasets, we propose a projection-assisted parameter-inference framework -- Projected-Space Inference of Stellar Parameters…

Solar and Stellar Astrophysics · Physics 2026-04-20 Jun-Chao Liang , Yin-Bi Li , A-Li Luo , Shuo Li , Xiao-Xiao Ma , Hai-Ling Lu , Shu-Guo Ma , Ming-Hui Jia , Shuo Ye , Hao Zeng , Ke-Fei Wu , Zhi-Hua Zhong , Xiao Kong , Li-Li Wang , Hugh R. A. Jones

The LAMOST Stellar Parameter Pipeline at Peking University --- LSP3

We introduce the LAMOST Stellar Parameter Pipeline at Peking University --- LSP3, developed and implemented for the determinations of radial velocity $V_{\rm r}$ and stellar atmospheric parameters (effective temperature $T_{\rm eff}$,…

Astrophysics of Galaxies · Physics 2015-06-23 Maosheng Xiang , Xiaowei Liu , Haibo Yuan , Yang Huang , Zhiying Huo , Huawei Zhang , Bingqiu Chen , Huihua Zhang , Ningchen Sun , Chun Wang , Yongheng Zhao , Jianrong Shi , Ali Luo , Guoping Li , Yue Wu , Zongrui Bai , Yong Zhang , Yonghui Hou , Hailong Yuan , Guangwei Li

Automatic stellar spectral parameterization pipeline for LAMOST survey

The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) project performed its five year formal survey since Sep. 2012, already fulfilled the pilot survey and the 1st two years general survey with an output - spectroscopic…

Instrumentation and Methods for Astrophysics · Physics 2014-07-09 Yue Wu , Ali Luo , Bing Du , Yongheng Zhao , Hailong Yuan

A comprehensive study on ILP acceleration accounting for sparsity, area, energy, data movement using near-memory architecture

Integer Linear Programming (ILP) is widely used for solving real-world optimization problems, including network routing, map routing, and traffic scheduling. However, ILP algorithms are sparse and branch-intensive, making them inefficient…

Hardware Architecture · Computer Science 2026-05-28 Siddhartha Raman Sundara Raman , Lizy K John , Jaydeep P. Kulkarni

Estimation of stellar atmospheric parameters from LAMOST DR8 low-resolution spectra with 20$\leq$SNR$<$30

The accuracy of the estimated stellar atmospheric parameter decreases evidently with the decreasing of spectral signal-to-noise ratio (SNR) and there are a huge amount of this kind observations, especially in case of SNR$<$30. Therefore, it…

Astrophysics of Galaxies · Physics 2023-12-27 Xiangru Li , Zhu Wang , Si Zeng , Caixiu Liao , Bing Du , X. Kong , Haining Li

High Performance Computing Applied to Logistic Regression: A CPU and GPU Implementation Comparison

We present a versatile GPU-based parallel version of Logistic Regression (LR), aiming to address the increasing demand for faster algorithms in binary classification due to large data sets. Our implementation is a direct translation of the…

Machine Learning · Computer Science 2023-08-22 Nechba Mohammed , Mouhajir Mohamed , Sedjari Yassine

Linear Attention Sequence Parallelism

Sequence parallelism (SP) serves as a prevalent strategy to handle long sequences that exceed the memory limit of a single device. However, for linear sequence modeling methods like linear attention, existing SP approaches do not take…

Machine Learning · Computer Science 2025-05-19 Weigao Sun , Zhen Qin , Dong Li , Xuyang Shen , Yu Qiao , Yiran Zhong

Ensemble Parameter Estimation for the Lumped Parameter Linear Superposition (LPLSP) Framework: A Rapid Approach to Reduced-Order Modeling for Transient Thermal Systems

This work introduces an ensemble parameter estimation framework that enables the Lumped Parameter Linear Superposition (LPLSP) method to generate reduced order thermal models from a single transient dataset. Unlike earlier implementations…

Numerical Analysis · Mathematics 2026-05-26 Neelakantan Padmanabhan

Accelerating Sparse Transformer Inference on GPU

Large language models (LLMs) are popular around the world due to their powerful understanding capabilities. As the core component of LLMs, accelerating Transformer through parallelization has gradually become a hot research topic. Mask…

Machine Learning · Computer Science 2026-05-29 Wenhao Dai , Haodong Deng , Mengfei Rong , Xinyu Yang , Hongyu Liu , Fangxin Liu , Hailong Yang , Qianwen Cao , Qingxiao Sun

EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization

Speculative decoding is an effective and lossless method for Large Language Model (LLM) inference acceleration. It employs a smaller model to generate a draft token sequence, which is then verified by the original base model. In multi-GPU…

Machine Learning · Computer Science 2025-12-09 Yize Wu , Ke Gao , Ling Li , Yanjun Wu

GaLLoP: Gradient-based Sparse Learning on Low-Magnitude Parameters

Sparse fine-tuning techniques adapt LLMs to downstream tasks by only tuning a sparse subset of model parameters. However, the effectiveness of sparse adaptation depends on optimally selecting the model parameters to be fine-tuned. In this…

Machine Learning · Computer Science 2025-10-23 Anand Choudhary , Yasser Sulaıman , Lukas Mauch , Ghouthi Boukli Hacene , Fabien Cardinaux , Antoine Bosselut

Accelerating scientific codes by performance and accuracy modeling

Scientific software is often driven by multiple parameters that affect both accuracy and performance. Since finding the optimal configuration of these parameters is a highly complex task, it extremely common that the software is used…

Computational Engineering, Finance, and Science · Computer Science 2016-08-17 Diego Fabregat-Traver , Ahmed E. Ismail , Paolo Bientinesi

Estimating Atmospheric Parameters from LAMOST Low-Resolution Spectra with Low SNR

Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) acquired tens of millions of low-resolution stellar spectra. The large amount of the spectra result in the urgency to explore automatic atmospheric parameter estimation…

Instrumentation and Methods for Astrophysics · Physics 2022-07-14 Xiangru Li , Si Zeng , Zhu Wang , Bing Du , Xiao Kong , Caixiu Liao

DuaLip-GPU Technical Report

Large-scale linear programs (LPs) arise in many decision systems, including ranking, allocation, and matching problems that must be solved repeatedly at massive scale. Prior work such as ECLIPSE and LinkedIn's open-source DuaLip showed that…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-03-06 Gregory Dexter , Aida Rahmattalabi , Sanjana Garg , Qinquan Song , Ruby Tu , Yuan Gao , Yi Zhang , Zhipeng Wang , Rahul Mazumder

Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual Sparsity

Accelerating large language model (LLM) inference is critical for real-world deployments requiring high throughput and low latency. Contextual sparsity, where each token dynamically activates only a small subset of the model parameters,…

Machine Learning · Computer Science 2025-11-13 Susav Shrestha , Brad Settlemyer , Nikoli Dryden , Narasimha Reddy

Integrated Nested Laplace Approximations for Large-Scale Spatial-Temporal Bayesian Modeling

Bayesian inference tasks continue to pose a computational challenge. This especially holds for spatial-temporal modeling where high-dimensional latent parameter spaces are ubiquitous. The methodology of integrated nested Laplace…

Computation · Statistics 2023-03-28 Lisa Gaedke-Merzhäuser , Elias Krainski , Radim Janalik , Håvard Rue , Olaf Schenk

Practical offloading for fine-tuning LLM on commodity GPU via learned sparse projectors

Fine-tuning large language models (LLMs) requires significant memory, often exceeding the capacity of a single GPU. A common solution to this memory challenge is offloading compute and data from the GPU to the CPU. However, this approach is…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-02-11 Siyuan Chen , Zhuofeng Wang , Zelong Guan , Yudong Liu , Phillip B. Gibbons

Differentiable Particle Optimization for Fast Sequential Manipulation

Sequential robot manipulation tasks require finding collision-free trajectories that satisfy geometric constraints across multiple object interactions in potentially high-dimensional configuration spaces. Solving these problems in real-time…

Robotics · Computer Science 2025-10-14 Lucas Chen , Shrutheesh Raman Iyer , Zachary Kingston

ASPD: Unlocking Adaptive Serial-Parallel Decoding by Exploring Intrinsic Parallelism in LLMs

The increasing scale and complexity of large language models (LLMs) pose significant inference latency challenges, primarily due to their autoregressive decoding paradigm characterized by the sequential nature of next-token prediction. By…

Computation and Language · Computer Science 2025-08-15 Keyu Chen , Zhifeng Shen , Daohai Yu , Haoqian Wu , Wei Wen , Jianfeng He , Ruizhi Qiao , Xing Sun

PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation

Inference of Large Language Models (LLMs) across computer clusters has become a focal point of research in recent times, with many acceleration techniques taking inspiration from CPU speculative execution. These techniques reduce…

Computation and Language · Computer Science 2024-11-19 Branden Butler , Sixing Yu , Arya Mazaheri , Ali Jannesari