IJCOPE Journal

UGC Logo DOI / ISO Logo

International Journal of Creative and Open Research in Engineering and Management

A Peer-Reviewed, Open-Access International Journal Supporting Multidisciplinary Research, Digital Publishing Standards, DOI Registration, and Academic Indexing.
Journal Information
ISSN: 3108-1754 (Online)
Crossref DOI: Available
ISO Certification: 9001:2015
Publication Fee: 599/- INR
Compliance: UGC Journal Norms
License: CC BY 4.0
Peer Review: Double Blind
Volume 02, Issue 05

Published on: May 2026

CALIBRATED ACT–ASK–ABSTAIN GATING FOR AGENTIC LANGUAGE MODELS IN RESOURCE-CONSTRAINED INTERACTIVE TASKS

Balram Dutta

J.C. Bose University of Science and Technology YMCA Faridabad India

Article Status

Plagiarism Passed Peer Reviewed Open Access

Available Documents

Abstract

Agentic AI systems increasingly operate in interac- tive, multi-step environments where correct behavior demands not merely generating responses, but disciplining how and when to act, solicit clarification, or refrain altogether. Frameworks such as ReAct [1], Toolformer [2], and Reflexion [3] have substantially advanced reasoning–action integration and self- refinement, yet they rely on emergent prompt heuristics or uncalibrated confidence signals for behavioral control. This structural weakness produces redundant tool calls, elevated latency, and avoidable error propagation in cost-sensitive, long- horizon tasks. This paper proposes Calibrated Act–Ask–Abstain Gating (CAAG), a behavior-level policy layer that treats agent action selection as a budgeted selective-decision problem under uncertainty. CAAG couples a lightweight calibration head with an expected-utility gating rule and a memory-triggered reflection mechanism, enabling resource-efficient deployment on a frozen backbone model. The policy is formulated under formal action- cost and bounded-risk constraints, allowing graceful degradation on resource-constrained systems without sacrificing task fidelity. Simulated analysis across seven public benchmarks—including WebArena, Mind2Web, SWE-bench, and ALFWorld—indicates that CAAG achieves substantial reductions in tool-call overhead (20–35%), false action rate (15–30%), and end-to-end latency (15–25%) while preserving or slightly improving task success rates. CAAG positions basic behavioral calibration as a first-class optimization target in agentic AI, a missing design principle in contemporary agent architectures.

Index Terms—Agentic AI, uncertainty calibration, selective prediction, tool use, resource-efficient agents, act–ask–abstain policy, behavioral gating.

How to Cite this Paper

Dutta, B. (2026). Calibrated Act–Ask–Abstain Gating for Agentic Language Models in Resource-Constrained Interactive Tasks. International Journal of Creative and Open Research in Engineering and Management, <i>02</i>(05). https://doi.org/10.55041/ijcope.v2i5.306

Dutta, Balram. "Calibrated Act–Ask–Abstain Gating for Agentic Language Models in Resource-Constrained Interactive Tasks." International Journal of Creative and Open Research in Engineering and Management, vol. 02, no. 05, 2026, pp. . doi:https://doi.org/10.55041/ijcope.v2i5.306.

Dutta, Balram. "Calibrated Act–Ask–Abstain Gating for Agentic Language Models in Resource-Constrained Interactive Tasks." International Journal of Creative and Open Research in Engineering and Management 02, no. 05 (2026). https://doi.org/https://doi.org/10.55041/ijcope.v2i5.306.

Search & Index

References

[1]     S. Yao, J. Zhao, D. Yu, N. Du, I. Shafran, K. R. Narasimhan, and Y. Cao, “ReAct: Synergizing Reasoning and Acting in Language Models,” in Proc. Int. Conf. Learn. Representations (ICLR), 2023. [Online]. Avail- able: https://openreview.net/forum?id=WE vluYUL-X

[2]     T. Schick, J. Dwivedi-Yu, R. Dessi, R. Raileanu, M. Lomeli, L. Zettle- moyer, N. Cancedda, and T. Scialom, “Toolformer: Language Models Can Teach Themselves to Use Tools,” in Proc. Adv. Neural Inf. Process. Syst. (NeurIPS), vol. 36, 2023.

[3]     N. Shinn, F. Cassano, B. Labash, A. Gopinath, K. Narasimhan, andYao, “Reflexion: Language Agents with Verbal Reinforcement Learn- ing,” in Proc. Adv. Neural Inf. Process. Syst. (NeurIPS), vol. 36, 2023. [arXiv:2303.11366].

[4]     S. Yao, D. Yu, J. Zhao, I. Shafran, T. L. Griffiths, Y. Cao, andR. Narasimhan, “Tree of Thoughts: Deliberate Problem Solving with Large Language Models,” in Proc. Adv. Neural Inf. Process. Syst. (NeurIPS), vol. 36, 2023.

[5]     Y. Geifman and R. El-Yaniv, “SelectiveNet: A Deep Neural Network with an Integrated Reject Option,” in Proc. Int. Conf. Mach. Learn. (ICML), 2019, pp. 2151–2159.

[6]     H. Liu, Z.-Y. Dou, Y. Wang, N. Peng, and Y. Yue, “Uncertainty Calibration for Tool-Using Language Agents,” in Findings of the Assoc. Comput. Linguistics: EMNLP, 2024, pp. 16781–16805.

[7]     X. Liu et al., “AgentBench: Evaluating LLMs as Agents,” arXiv preprint arXiv:2308.03688, 2023.

[8]     S. Zhou et al., “WebArena: A Realistic Web Environment for Building Autonomous Agents,” in Proc. Int. Conf. Learn. Representations (ICLR), 2024. [arXiv:2307.13854].

Ethical Compliance & Review Process

  • All submissions are screened under plagiarism detection.
  • Review follows editorial policy.
  • Authors retain copyright.
  • Peer Review Type: Double-Blind Peer Review
  • Published on: May 09 2026
CCBYNC

This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. You are free to share and adapt this work for non-commercial purposes with proper attribution.

View License
Scroll to Top