中文题名: | 剥削和冲突关系下的博弈行为研究 |
姓名: | |
保密级别: | 公开 |
论文语种: | 中文 |
学科代码: | 071101 |
学科专业: | |
学生类型: | 硕士 |
学位: | 理学硕士 |
学位类型: | |
学位年度: | 2020 |
校区: | |
学院: | |
研究方向: | 社会复杂系统 |
第一导师姓名: | |
第一导师单位: | |
提交日期: | 2020-06-18 |
答辩日期: | 2020-05-31 |
外文题名: | Research on Game Behavior under Exploitation and Conflict |
中文关键词: | |
外文关键词: | Behavioral experiment ; group decision-making ; evolutionary game ; exploitation ; intergroup conflict |
中文摘要: |
人类社会是一个典型的复杂系统。人类社会中很多复杂现象的涌现和演化都与人的决策行为密切相关。研究人的决策行为有助于理解人类社会复杂现象的背后机制与原理,对于提高社会管理效率具有重要的指导意义。本文主要围绕复杂系统下的群体博弈行为,通过剥削与贡献博弈、攻击与防守博弈的行为实验以及相应的理论建模,探究群体之内个体行为原则以及群体行为的成因。
剥削与贡献博弈是类比社会与自然界中剥削与贡献的博弈行为。通过不同群体规模的对照行为实验,我们发现当剥削者与贡献者之间的关系稳定时,贡献者会趋向合作,剥削者会趋向公平,双方合作共赢占据重要地位;当群体规模增加,tit-for-tat(TFT) 策略的惩罚会被群体规模分摊,导致惩罚力度下降。因此,随着群体规模的增加,贡献者的惩罚措施对剥削者没有明显作用,剥削者选择的分配比例越来越低,最终贡献者选择忍受剥削,也就是合作。接着,我们借助演化博弈理论,构建出基础的剥削与贡献博弈的收益矩阵,并加入了 TFT 策略、声誉进行拓展。根据拓展的收益矩阵,我们分别用 Moran 过程和加入“突变率”的复制动态方程进行数值模拟。数值模拟结果很好的验证了行为实验结论。
对于攻击与防守博弈实验,我们探究了催产素在群体间竞争和冲突中的作用,以及团队成员在群体间竞争和冲突中如何依赖历史信息来协调集体行动。我们发现催产素能增强个体的合作水平,显著提高攻击组成员的协调性。此外,决策顺序也会影响个体的行为决策。顺序决策时后置位的成员会根据前置位成员的贡献值来判断本轮次是否全力进攻或防守。同时决策时,小组内部成员贡献排名非常稳定,攻击者主要关注组内成员上一轮的贡献值,而防守者主要关心对手组上一轮的贡献值以及组内贡献最高的成员上一轮的贡献值。在攻击与防守实验中,防守者的贡献值普遍高于攻击者。为了解释角色差异带来的行为差异,我们利用最优反应理论构建理论模型。最优反应模型是在实验设计的基础上,假设本轮情况与上一轮相同时应该如何决策达到最优。我们发现对于防守者来说,赢得比赛的收益远高于输掉比赛,因此他们会更加积极的贡献。而对于攻击者来说,小组的输和赢与个人的利益存在冲突性,有时候不贡献输掉比赛反而个人利益更高,因此攻击者需要更多的权衡输赢和个人利益。
|
外文摘要: |
Human society is a typical complex system. The emergence and evolution of many complex phenomena in human society are closely related to human decision-making behavior. Studying human decision-making behavior is helpful to understand the mechanism and principle behind the complex phenomena of human society, and it is of great guiding significance to improve the efficiency of social management. Although individuals in complex systems may seem to have various characteristics, they have internal connections. And some seemingly random changes follow simple rules. This paper mainly focuses on the group behavior under the complex system, through the Exploitation-Contribution game experiment, Attacker-Defender Contest game experiment and the corresponding theoretical modelings, to explore the individual behavior principle steam within the group and the causes of group behavior. The Exploitation-Contribution game is the analogy to the behavior of exploitation and contribution in society and nature. Through behavior experiments under different group sizes, we found that when the relationship between exploiters and contributors is stable, contributors tend to cooperate and exploiters tend to be fair. Win-win cooperation between the two sides holds the dominant position. When the group size increases, the punishment of the TFT strategy is apportioned by the group size, resulting in a decrease in the penalty. Thus, as the size of the group increases, contributors' punitive measures have no obvious effect on exploiters, the proportion of distribution chosen by exploiters is getting lower and lower, and ultimately contributors have to cooperate and endure exploitation. Then, with the help of evolutionary game theory, we construct the payoff matrix of the basic exploitation and contribution game and join the TFT strategy and reputation to expand. Based on the expanded payoff matrix, we use the Moran process to perform numerical simulations using the Moran process and the replication dynamic equations that add "mutation rates". The numerical simulation results are a good validation of the behavioral experiment conclusions. For the Attacker-Defender Contest game experiment, we explore the role of oxytocin in group conflict, and how team members rely on historical information to coordinate collective action in group conflict. We found that oxytocin enhances the level of cooperation of individuals and significantly improves the coordination of attack group members. Besides, the order of decision-making can affect individual behavioral decisions. In the simultaneous decision-protocol, the rear-positioned member determines whether to attack based on the contribution value of the leading member. At the same time, the contribution ranking of the intra-group members is very stable, attackers sit primarily on the contribution value of the group member in the last round, while defenders focus on the contribution value of the previous round of the opposing group and the contribution value of the member who contributed the most in the group in the previous round. In the experiments, we observed that the defenders' contribution is generally higher than the attackers'. To explain the behavioral differences brought about by the role differences, we use the best response theory to construct the theoretical model. The best response model is based on the experimental design, assuming that the people will make decisions according to the information of the previous round. We find that for defenders, the benefits of winning games are much higher than losing games, so they contribute more positively. For attackers, the winning or losing of the group conflicts with personal interests. Sometimes losing the game will get higher personal interests, so the attacker needs more trade-offs between win-loss and personal interests. |
参考文献总数: | 50 |
馆藏号: | 硕071101/20009 |
开放日期: | 2021-06-18 |