Jerry Chee — Scifaro

Datasets for Navigating Sensitive Topics in Recommendation Systems

Personalized AI systems, from recommendation systems to chatbots, are a prevalent method for distributing content to users based on their learned preferences. However, there is growing concern about the adverse effects of these systems,…

Information Retrieval · Computer Science 2025-09-10 Amelia Kovacs , Jerry Chee , Kimia Kazemian , Sarah Dean

DiscQuant: A Quantization Method for Neural Networks Inspired by Discrepancy Theory

Quantizing the weights of a neural network has two steps: (1) Finding a good low bit-complexity representation for weights (which we call the quantization grid) and (2) Rounding the original weights to values in the quantization grid. In…

Machine Learning · Computer Science 2025-01-14 Jerry Chee , Arturs Backurs , Rainie Heck , Li Zhang , Janardhan Kulkarni , Thomas Rothvoss , Sivakanth Gopi

Harm Mitigation in Recommender Systems under User Preference Dynamics

We consider a recommender system that takes into account the interplay between recommendations, the evolution of user interests, and harmful content. We model the impact of recommendations on user behavior, particularly the tendency to…

Information Retrieval · Computer Science 2024-06-17 Jerry Chee , Shankar Kalyanaraman , Sindhu Kiranmai Ernala , Udi Weinsberg , Sarah Dean , Stratis Ioannidis

QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks

Post-training quantization (PTQ) reduces the memory footprint of LLMs by quantizing their weights to low-precision. In this work, we introduce QuIP#, a weight-only PTQ method that achieves state-of-the-art results in extreme compression…

Machine Learning · Computer Science 2024-06-05 Albert Tseng , Jerry Chee , Qingyao Sun , Volodymyr Kuleshov , Christopher De Sa

QuIP: 2-Bit Quantization of Large Language Models With Guarantees

This work studies post-training parameter quantization in large language models (LLMs). We introduce quantization with incoherence processing (QuIP), a new method based on the insight that quantization benefits from $\textit{incoherent}$…

Machine Learning · Computer Science 2024-01-17 Jerry Chee , Yaohui Cai , Volodymyr Kuleshov , Christopher De Sa

Model Preserving Compression for Neural Networks

After training complex deep learning models, a common task is to compress the model to reduce compute and storage demands. When compressing, it is desirable to preserve the original model's per-example decisions (e.g., to go beyond top-1…

Machine Learning · Computer Science 2022-10-18 Jerry Chee , Megan Renz , Anil Damle , Christopher De Sa

How Low Can We Go: Trading Memory for Error in Low-Precision Training

Low-precision arithmetic trains deep learning models using less energy, less memory and less time. However, we pay a price for the savings: lower precision may yield larger round-off error and hence larger prediction error. As applications…

Machine Learning · Computer Science 2022-03-18 Chengrun Yang , Ziyang Wu , Jerry Chee , Christopher De Sa , Madeleine Udell

Performance optimizations on deep noise suppression models

We study the role of magnitude structured pruning as an architecture search to speed up the inference time of a deep noise suppression (DNS) model. While deep learning approaches have been remarkably successful in enhancing audio quality,…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-12 Jerry Chee , Sebastian Braun , Vishak Gopal , Ross Cutler

Understanding and Detecting Convergence for Stochastic Gradient Descent with Momentum

Convergence detection of iterative stochastic optimization methods is of great practical interest. This paper considers stochastic gradient descent (SGD) with a constant learning rate and momentum. We show that there exists a transient…

Machine Learning · Computer Science 2020-08-28 Jerry Chee , Ping Li

Convergence diagnostics for stochastic gradient descent with constant step size

Many iterative procedures in stochastic optimization exhibit a transient phase followed by a stationary phase. During the transient phase the procedure converges towards a region of interest, and during the stationary phase the procedure…

Machine Learning · Statistics 2018-02-26 Jerry Chee , Panos Toulis