Efficient Inference-time Control and Alignment

Rashid, Ahmad

Efficient Inference-time Control and Alignment

dc.contributor.author	Rashid, Ahmad
dc.date.accessioned	2026-04-30T19:54:36Z
dc.date.available	2026-04-30T19:54:36Z
dc.date.issued	2026-04-30
dc.date.submitted	2026-04-06
dc.description.abstract	Modern foundation models are typically trained in three broad stages. First, large-scale pre-training is performed using self-supervised learning on massive corpora. Second, models are adapted through mid-training using supervised fine-tuning or instruction tuning on labeled datasets. Finally, a post-training stage is often applied using preference data and reinforcement learning in order to align the model and improve its safety, reliability, and usefulness. Although effective, post-training methods can be computationally expensive and inflexible once large models are deployed. This thesis explores an alternative paradigm: enforcing behavioral objectives at inference time rather than modifying model parameters during post-training. In this approach, smaller modular control models are combined with a base model to shape predictions during the decision process. Our aim is to design alignment mechanisms that are both mathematically grounded and empirically strong while remaining computationally efficient and easy to deploy. We apply this perspective of inference-time control to three problems. First, we address reliability in neural classifiers. We introduce PreLoad, an inference-time mechanism that mitigates arbitrarily high confidence on inputs that lie outside the training support while preserving accuracy and training efficiency. Second, we study reward-guided text generation (RGTG) in large language models as a form of inference-time alignment. We show that stable reward-guided decoding requires carefully designed token-level reward models and propose two algorithms, PARGS and FaRMA, that enable effective reward-guided generation. Third, we address the computational cost of RGTG and propose an efficient algorithm that adds only a minor overhead during inference while preserving the performance and benefits of reward-guided decoding. Together, these results demonstrate that inference-time control provides a flexible and computationally efficient framework for shaping the behavior of modern neural systems. By decoupling representation learning from the decision-time objectives, this work introduces new tools for improving the reliability, alignment, and efficiency of large-scale machine learning models without retraining them.
dc.identifier.uri	https://hdl.handle.net/10012/23136
dc.language.iso	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	Artificial Intelligence
dc.subject	Alignment
dc.subject	Large Language Models
dc.subject	Deep Learning
dc.subject	Reliable AI
dc.subject	Inference-time Control
dc.subject	Machine Learning
dc.subject	Natural Language Processing
dc.subject	Reinforcement Learning
dc.subject	Test-time Compute
dc.subject	Controlled Decoding
dc.subject	Algorithms
dc.subject	Efficient AI
dc.subject	Reward Models
dc.subject	Value Functions
dc.subject	Reward Guided Text Generation
dc.subject	OOD Detection
dc.subject	Out-of-Distribution
dc.title	Efficient Inference-time Control and Alignment
dc.type	Doctoral Thesis
uws-etd.degree	Doctor of Philosophy
uws-etd.degree.department	David R. Cheriton School of Computer Science
uws-etd.degree.discipline	Computer Science
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0
uws.contributor.advisor	Poupart, Pascal
uws.contributor.affiliation1	Faculty of Mathematics
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Rashid_Ahmad.pdf
Size:: 1017.3 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses