Calibrated Act–Ask–Abstain Gating for Agentic Language Models in Resource-Constrained Interactive Tasks

Dutta, Balram

doi:https://doi.org/10.55041/ijcope.v2i5.306

Volume 02, Issue 05

Published on: May 2026

CALIBRATED ACT–ASK–ABSTAIN GATING FOR AGENTIC LANGUAGE MODELS IN RESOURCE-CONSTRAINED INTERACTIVE TASKS

Balram Dutta

J.C. Bose University of Science and Technology YMCA Faridabad India

DOI:https://doi.org/10.55041/ijcope.v2i5.306

Article Status

Plagiarism Passed Peer Reviewed Open Access

Available Documents

Download PDF Review Report

Abstract

Agentic AI systems increasingly operate in interac- tive, multi-step environments where correct behavior demands not merely generating responses, but disciplining how and when to act, solicit clarification, or refrain altogether. Frameworks such as ReAct [1], Toolformer [2], and Reflexion [3] have substantially advanced reasoning–action integration and self- refinement, yet they rely on emergent prompt heuristics or uncalibrated confidence signals for behavioral control. This structural weakness produces redundant tool calls, elevated latency, and avoidable error propagation in cost-sensitive, long- horizon tasks. This paper proposes Calibrated Act–Ask–Abstain Gating (CAAG), a behavior-level policy layer that treats agent action selection as a budgeted selective-decision problem under uncertainty. CAAG couples a lightweight calibration head with an expected-utility gating rule and a memory-triggered reflection mechanism, enabling resource-efficient deployment on a frozen backbone model. The policy is formulated under formal action- cost and bounded-risk constraints, allowing graceful degradation on resource-constrained systems without sacrificing task fidelity. Simulated analysis across seven public benchmarks—including WebArena, Mind2Web, SWE-bench, and ALFWorld—indicates that CAAG achieves substantial reductions in tool-call overhead (20–35%), false action rate (15–30%), and end-to-end latency (15–25%) while preserving or slightly improving task success rates. CAAG positions basic behavioral calibration as a first-class optimization target in agentic AI, a missing design principle in contemporary agent architectures.

Index Terms—Agentic AI, uncertainty calibration, selective prediction, tool use, resource-efficient agents, act–ask–abstain policy, behavioral gating.

How to Cite this Paper

Dutta, B. (2026). Calibrated Act–Ask–Abstain Gating for Agentic Language Models in Resource-Constrained Interactive Tasks. International Journal of Creative and Open Research in Engineering and Management, <i>02</i>(05). https://doi.org/10.55041/ijcope.v2i5.306

Dutta, Balram. "Calibrated Act–Ask–Abstain Gating for Agentic Language Models in Resource-Constrained Interactive Tasks." International Journal of Creative and Open Research in Engineering and Management, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijcope.v2i5.306.

Dutta, Balram. "Calibrated Act–Ask–Abstain Gating for Agentic Language Models in Resource-Constrained Interactive Tasks." International Journal of Creative and Open Research in Engineering and Management 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijcope.v2i5.306.

Search & Index

References

[1]     S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. R. Narasimhan, and Y. Cao, “ReAct: Synergizing Reasoning and Acting in Language Models,” in Proc. Int. Conf. Learn. Representations (ICLR), 2023. [Online]. Avail- able: https://openreview.net/forum?id=WE vluYUL-X

[2]     T. Schick, J. Dwivedi-Yu, R. Dessi, R. Raileanu, M. Lomeli, L. Zettle- moyer, N. Cancedda, and T. Scialom, “Toolformer: Language Models Can Teach Themselves to Use Tools,” in Proc. Adv. Neural Inf. Process. Syst. (NeurIPS), vol. 36, 2023.

[3]     N. Shinn, F. Cassano, B. Labash, A. Gopinath, K. Narasimhan, andYao, “Reflexion: Language Agents with Verbal Reinforcement Learn- ing,” in Proc. Adv. Neural Inf. Process. Syst. (NeurIPS), vol. 36, 2023. [arXiv:2303.11366].

[4]     S. Yao, D. Yu, J. Zhao, I. Shafran, T. L. Griffiths, Y. Cao, andR. Narasimhan, “Tree of Thoughts: Deliberate Problem Solving with Large Language Models,” in Proc. Adv. Neural Inf. Process. Syst. (NeurIPS), vol. 36, 2023.

[5]     Y. Geifman and R. El-Yaniv, “SelectiveNet: A Deep Neural Network with an Integrated Reject Option,” in Proc. Int. Conf. Mach. Learn. (ICML), 2019, pp. 2151–2159.

[6]     H. Liu, Z.-Y. Dou, Y. Wang, N. Peng, and Y. Yue, “Uncertainty Calibration for Tool-Using Language Agents,” in Findings of the Assoc. Comput. Linguistics: EMNLP, 2024, pp. 16781–16805.

[7]     X. Liu et al., “AgentBench: Evaluating LLMs as Agents,” arXiv preprint arXiv:2308.03688, 2023.

[8]     S. Zhou et al., “WebArena: A Realistic Web Environment for Building Autonomous Agents,” in Proc. Int. Conf. Learn. Representations (ICLR), 2024. [arXiv:2307.13854].

Ethical Compliance & Review Process

•All submissions are screened under plagiarism detection.
•Review follows editorial policy.
•Authors retain copyright.
•Peer Review Type: Double-Blind Peer Review
•Published on: May 09 2026

CCBYNC

This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. You are free to share and adapt this work for non-commercial purposes with proper attribution.

View License

Back to Volume 02, Issue 05 View All Issues Next Article

← Previous Article

Blockchain-Powered Solution for Authenticating Genuine Products

Next Article →

Campus Kart: Student to Student Marketplace