查看论文信息

查看全文

查看论文信息

中文题名：	非对称重复博弈中的策略研究
姓名：	王年翼
保密级别：	公开
论文语种：	chi
学科代码：	070104
学科专业：	应用数学
学生类型：	硕士
学位：	理学硕士
学位类型：	学术学位
学位年度：	2023
校区：	北京校区培养
学院：	数学科学学院
研究方向：	演化博弈论
第一导师姓名：	张博宇
第一导师单位：	数学科学学院
提交日期：	2023-06-07
答辩日期：	2023-06-02
外文题名：	Strategy Research in Asymmetric Repeated Games
中文关键词：	重复囚徒困境博弈 ; 随机环境 ; 策略竞赛 ; 进化稳定策略
外文关键词：	repeated prisoner’s dilemma ; random environment ; Axelrod’s tournament ; evolutionary stable strategy
中文摘要：	︿本文研究了两种不同的重复博弈：非对称囚徒困境及收益随机环境下的博弈。在非对称囚徒困境研究中，非对称性体现为博弈双方选择合作的收益-代价之比不同。Axelrod 策略竞赛被用于检验不同非对称参数下不同策略的收益，最终得到了优势方更加倾向于合作的结论。在收益具有随机性的博弈中，收益矩阵中的每一项被设定为一个均值为 0 的随机变量，参与者根据一段时间的累计收益被决定在演化过程中死亡或复制自身策略以产生新策略。仅考虑响应式策略时的进化稳定策略被计算出来，并对两种极端情况下参与人会选择的策略做了理论解释。﹀
外文摘要：	︿ We investigates two different asymmetric repeated games: asymmetric prisoner’s dilemma and games in a random payoff environment. In the study of asymmetric prisoner’s dilemma, the asymmetry is represented by the different ratios of payoffs-costs incurred by the two players. The Axelrod strategy competition is used to test the payoffs of different strategies under different asym-metric parameters, and the conclusion is that the dominant player is more inclined to cooperate. In the game with random payoffs, each item in the payoff matrix is set as a random variable with a mean of zero. Participants are determined to die or replicate their strategy in the evolutionary process based on the cumulative payoff over a period of time. Evolutionarily stable strategies are calculated under settings of reactive strategies, and the strategies that participants will choose in two extreme situations are theoretically explained. ﹀
参考文献总数：	12
馆藏号：	硕070104/23007
开放日期：	2024-06-12

附件下载