具有k期逆向有限理性者的交替议价谈判博弈

doi:10.12005/orms.2025.0371

运筹与管理 ›› 2025, Vol. 34 ›› Issue (12): 25-30.DOI: 10.12005/orms.2025.0371

具有k期逆向有限理性者的交替议价谈判博弈

杨光敬¹, 侯东爽², 孙攀飞²

1.西安电子科技大学经济与管理学院,陕西西安 710126;
2. 西北工业大学数学与统计学院,陕西西安 710072

收稿日期:2024-05-17 出版日期:2025-12-25 发布日期:2026-04-29
通讯作者: 杨光敬(1993-),男,湖北襄阳人,博士,副教授,硕士生导师,研究方向:博弈论及其应用。Email: gjyang@xidian.edu.cn。
基金资助:
国家自然科学基金资助项目(72201200,72271199,72001172);陕西省社会科学基金项目(2023R009);陕西省创新能力支撑计划项目(2024ZC-YBXM-190);中央高校基本科研业务费专项资金项目(KYFZ24020)

Alternating Offers Bargaining Games with k-period Backward Bounded Rational Players

YANG Guangjing¹, HOU Dongshuang², SUN Panfei²

1. School of Economics and Management, Xidian University, Xi’an 710126, China;
2. School of Mathematics and Statistics, Northwestern Polytechnical University, Xi’an 710072, China

Received:2024-05-17 Online:2025-12-25 Published:2026-04-29

摘要/Abstract

摘要： 在有限期交替议价谈判博弈中,参与者通过逆向归纳法寻找子博弈完美纳什均衡策略。这种方法假定参与者具备完美理性和无限计算能力,然而实际中,参与者的认知限制使得他们无法达到这种理想状态。为了更真实地描述现实中的谈判行为,本文引入k期逆向有限理性者的概念,这类参与者在博弈的最后k期内能够完美制定策略,但在更早的阶段则可能由于计算能力或记忆力的限制,无法完全遵循理性路径。本文分析了这类参与者在有限期交替议价谈判博弈过程中的策略,并探讨了当对手为完全理性参与者时,双方的期望效用如何受到影响。研究表明:(1)当k期逆向有限理性者首先提议时,完全理性者的期望效用始终高于k期逆向有限理性者,且随着贴现因子的增加而逐渐获得全部谈判价值; (2)当完全理性者首先提议时,其期望效用并不总是高于k期逆向有限理性者; (3)当完全理性者首先提议时,若k期逆向有限理性者的非理性时期数足够大,且贴现因子趋近于1,则完全理性者可获得全部谈判价值。通过数值算例,本研究验证了理论分析的准确性。

关键词: 交替议价谈判博弈, 有限理性, 逆向归纳法, 子博弈完美纳什均衡

Abstract: Bargaining games have long been a central focus in game theory research due to their wide applicability in representing various economic, political and social interactions. One prominent example is the alternating offers bargaining game, initially proposed by Ariel Rubinstein in 1982. In Rubinstein’s model, two players take turns to offer terms of agreement to each other indefinitely until an agreement is reached. The players discount future payoffs, which provides an incentive to reach an agreement sooner. The analysis of subgame perfect equilibrium in Rubinstein’s model of alternating offers utilizes backward induction, which is a form of reasoning used extensively in game theory, especially in finite extensive form games. To find the subgame perfect equilibrium of the game, it is assumed that the players make decisions that maximize their own payoff, taking into account the anticipated decisions of other players. This method assumes that each player anticipates that all players will act rationally in the future and employs this anticipation to determine their current optimal action. Hence, players are assumed to be fully rational, while also assuming that they believe all other players are fully rational.
However, these assumptions may not hold in many real-world situations. Players might not be entirely rational due to cognitive limitations, lack of information, limited memory capacity, or other factors. They may also doubt that other players are fully rational. Following Herbert Simon’s introduction of bounded rationality, scholars began to explore how such cognitive limitations affect decision-making processes. Most literature to date abstracts bounded rationality as a limitation in participants’ forward-looking capabilities. Although they provide important perspectives on how bounded rationality influences decision-making processes, these theories show certain limitations when studying finite-horizon alternating bargaining games. In such games, when the number of bargaining periods is fixed, rational participants typically use backward induction to devise strategies, rather than merely observe future outcomes. Typically, participants with bounded rationality, after being able to deduce the optimal strategies for several periods through backward reasoning, may deviate from rational behavior due to changes in circumstances, computational limitations, or memory constraints, thus leading to irrational behaviors. In practical game scenarios, this manifestation of bounded rationality is usually observed when participants are able to make fully rational decisions only in smaller-scale subgames.
In light of these complexities and the limitations, our research aims to explore the dynamics of a finite-period alternating offers bargaining game, where players exhibit this specific form of bounded rationality, namely k-period backward bounded rationality. This refers to players who can only perform backward induction reasoning up to k-periods. We construct a model that captures the essence of k-period backward bounded rationality within a finite alternating offers bargaining game. A comprehensive formalization and characterization of k-period backward bounded rationality is then presented. Moreover, we turn our attention to a bargaining game that pairs a fully rational player with a player who is only rational for the final k periods of the game, where k is less than the total number of periods m. The fully rational participant is designated as player 1 and the k-period backward bounded rational player is designated as player 2. For player 2, we posit that their behavior in the initial m-k periods is characterized by unpredictability, with both proposals and acceptances distributed uniformly at random.
Our main results depend on which player is the initial proposer. Specifically, when player 2 is the initial proposer, we find that the expected utility of player 1 is always above 1/2 and increases with the discount factor. Player 2 has an expected utility that remains below 1/2. This indicates an inherent disadvantage for the player with bounded rationality when making the initial offer. On the flip side, when player 1 is the initial proposer, the expected utilities of both players trend towards 1/4 as the discount factor approaches 0. In the opposing limit, as the discount factor approaches 1 and the number of periods of irrationality m-k for the player 2 is sufficiently large, the expected utility of player 1 asymptotically approaches 1, while that of player 2 trends towards 0. Finally, the main results are validated through computer numerical simulations to ensure the accuracy of the theoretical analysis. Surprisingly, numerical simulations reveal that, in certain scenarios, the expected utility of the k-period backward bounded rational player surpasses that of the fully rational player, a phenomenon that needs further investigation.

Key words: alternating offers bargaining games, bounded rationality, backward induction, subgame perfect equilibrium

中图分类号:

O225

杨光敬, 侯东爽, 孙攀飞. 具有k期逆向有限理性者的交替议价谈判博弈[J]. 运筹与管理, 2025, 34(12): 25-30.

YANG Guangjing, HOU Dongshuang, SUN Panfei. Alternating Offers Bargaining Games with k-period Backward Bounded Rational Players[J]. Operations Research and Management Science, 2025, 34(12): 25-30.

[1]	陈玲娟, 赵成花, 马东方, 柳祖鹏. MaaS模式下城市路网异质出行者逐日流量分配模型[J]. 运筹与管理, 2024, 33(7): 137-143.
[2]	王明涛, 孙西明. 交易者有限理性、信息噪声关系与金融市场信息效率[J]. 运筹与管理, 2024, 33(7): 180-186.
[3]	裘江南, 张方芳, 尤玥, 李梦洁, 高双燕. 人类理性程度与人机协作绩效:基于计算实验的演化分析[J]. 运筹与管理, 2024, 33(11): 37-43.
[4]	江泽武, 赵晓波, 薛超, 朱万山, 谢金星. 基于随机反应均衡的供应链激励机制设计[J]. 运筹与管理, 2023, 32(8): 1-8.
[5]	张钰, 王金亭. 面对有限理性顾客的体验式服务系统最优定价与服务速度研究[J]. 运筹与管理, 2023, 32(3): 85-91.
[6]	刘雯雯, 胡振华. 市场自治与低碳认证情形下企业低碳生产行为研究[J]. 运筹与管理, 2022, 31(4): 232-239.
[7]	吴新林, 肖海燕. 有限注意力下一类满意启发式决策的建模与理性分析[J]. 运筹与管理, 2022, 31(3): 57-62.
[8]	韩瑜, 徐海燕, 陈璐, 赵士南. 基于序列稳定性的冲突当局者行为分析模型[J]. 运筹与管理, 2022, 31(12): 38-45.
[9]	蒋惠凤, 刘益平. 基于演化博弈的大运河文化带城市间合作发展策略研究[J]. 运筹与管理, 2021, 30(10): 175-182.
[10]	陈刚, 付江月, 何美玲. 考虑居民选择行为的应急避难场所选址问题研究[J]. 运筹与管理, 2019, 28(9): 6-14.
[11]	严杰,刘人境. 基于Agent仿真的个体有限理性与众包绩效的关系研究[J]. 运筹与管理, 2018, 27(12): 115-124.
[12]	张广, 邬冬华, 唐剑雄. 不确定性下的一主多从博弈及其ε-平衡稳定性分析[J]. 运筹与管理, 2018, 27(1): 23-30.
[13]	李健, 李琳琳, 史浩. 考虑消费者有限理性退货决策的两阶段C2B电子商务供应链回购策略优化研究[J]. 运筹与管理, 2017, 26(12): 53-60.
[14]	丁占文, 葛栋梁, 蒋书敏. 基于多期有限理性的可再生资源寡头博弈模型与分析[J]. 运筹与管理, 2013, 22(5): 140-145.

具有k期逆向有限理性者的交替议价谈判博弈

Alternating Offers Bargaining Games with k-period Backward Bounded Rational Players

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 14

编辑推荐

Metrics