Humans Expect Rationality and Cooperation from LLM Opponents in Strategic Games

Asep DarmawanDecember 2, 2025

0 1 7 minutes read

A recent groundbreaking study, published on arXiv as a preprint titled "Humans expect rationality and cooperation from LLM opponents in strategic games," reveals a surprising and potentially significant shift in human perception and behavior when interacting with Artificial Intelligence (AI) in competitive scenarios. The research, conducted by a team of academics, employed a meticulously designed laboratory experiment to investigate how humans adapt their strategies when pitted against Large Language Models (LLMs) in a multi-player strategic game. The findings suggest that humans attribute human-like reasoning and even a propensity for cooperation to LLMs, leading to distinct behavioral patterns compared to interactions with other humans. This research carries profound implications for the future integration of AI into various facets of society, from economic markets to social interactions.

The core of the study involved a monetarily incentivized, within-subject laboratory experiment focusing on the classic "p-beauty contest" game. In this game, participants are asked to choose a number between 0 and 100. The winner is the player whose chosen number is closest to a specified fraction (in this case, two-thirds) of the average of all numbers chosen by all players. The game is designed to test levels of strategic thinking and the ability to anticipate the reasoning of others. The researchers specifically chose a multi-player format to simulate more complex, real-world interactions where multiple agents are making decisions simultaneously.

What sets this research apart is its direct comparison of human behavior when playing against other humans versus when playing against LLM opponents. By employing a within-subject design, each participant played the game in both conditions, allowing for a direct, individual-level comparison of their choices. This methodology helps to control for inherent differences in strategic ability among individuals, isolating the impact of the opponent’s nature.

The results were striking. The study found that human subjects consistently chose significantly lower numbers when playing against LLMs compared to when they were playing against fellow human participants. This deviation from typical human behavior in the p-beauty contest is primarily attributed to an increased prevalence of "zero" Nash-equilibrium choices. The Nash equilibrium in game theory represents a state where no player can improve their outcome by unilaterally changing their strategy, assuming all other players keep their strategies unchanged. In the p-beauty contest, the theoretical perfect Nash equilibrium involves all players choosing zero, as this is the only number that, if universally chosen, would satisfy the condition that the average is multiplied by two-thirds to equal zero. However, in practice, achieving this perfect equilibrium requires a very high level of iterated reasoning, where players anticipate that others will also anticipate, and so on. Humans often struggle to reach this pure rational outcome, opting for numbers slightly above zero.

The research highlights that when facing LLMs, a greater proportion of human participants gravitated towards the zero choice. This suggests a belief that the LLM opponent would act in a perfectly rational manner, thus forcing the human player to also choose zero to optimize their chances.

Table of Contents

The Role of Perceived LLM Reasoning and Cooperation

Delving deeper into the motivations behind these strategic shifts, the study uncovered a fascinating psychological phenomenon. Subjects who opted for the zero Nash-equilibrium choice when playing against LLMs frequently cited two primary reasons for their decision: the perceived reasoning ability of the LLM and, unexpectedly, its perceived propensity towards cooperation.

The attribution of strong reasoning capabilities to LLMs aligns with the general public’s growing awareness and often awe of AI’s computational power. It appears that participants assumed the LLM would engage in the highest level of strategic thinking, thereby driving the game towards the theoretical equilibrium. This implies a degree of "anthropomorphism" of AI, where humans project human-like cognitive processes onto non-human entities.

The more surprising element is the perception of cooperation. In a competitive game like the p-beauty contest, cooperation is not explicitly defined as a winning strategy. However, participants’ explanations suggest they interpreted the LLM’s predictable, rational behavior as a form of implicit cooperation. They might have reasoned that a rational LLM would not engage in unpredictable or "irrational" moves that could exploit human deviations from perfect play. This predictable behavior, in turn, was interpreted as a willingness to engage in a "fair" or "logical" contest, which participants then translated into their own strategic choices. This implies a human tendency to seek patterns and cooperative undertones even in purely strategic, zero-sum environments when interacting with perceived intelligent agents.

Heterogeneity in Human Behavior and Beliefs

The study also emphasized the heterogeneity in both subjects’ behavior and their beliefs about LLM play. Not all participants exhibited the same response. The shift towards lower numbers and zero choices was particularly pronounced among subjects identified as having high strategic reasoning ability. This suggests that individuals who are already adept at complex strategic thinking are more likely to adjust their play significantly when they perceive their opponent as a highly rational AI. They might be more attuned to the theoretical implications of an LLM’s processing power and are thus more inclined to engage with that theoretical framework.

Conversely, participants with lower strategic reasoning abilities might have been less influenced by the nature of the opponent, or their responses were more varied and less predictable. The study’s analysis of beliefs also revealed that participants held diverse expectations about how LLMs would play. While some expected perfect rationality, others might have harbored different assumptions, leading to a wider range of strategic choices within the LLM-opponent condition. This highlights the nascent stage of human-AI interaction and the ongoing process of forming expectations and understanding the capabilities and limitations of these new agents.

Background Context: The Rise of LLMs and Human-AI Interaction

The integration of LLMs into everyday life has accelerated dramatically in recent years. From powering sophisticated chatbots and virtual assistants to assisting in creative writing and complex data analysis, LLMs are becoming increasingly ubiquitous. This rapid proliferation necessitates a deeper understanding of how humans will interact with these advanced AI systems, especially in contexts where decisions have tangible consequences.

Strategic games, like the p-beauty contest, serve as valuable microcosms for studying human behavior in decision-making under uncertainty and with interdependent payoffs. Historically, research in game theory has focused on human-versus-human interactions. However, as AI agents become more sophisticated, their role as participants in economic and social games is growing. This includes algorithmic trading in financial markets, automated negotiation systems, and even AI players in online gaming environments.

The "p-beauty contest" itself has a rich history in experimental economics and psychology. It was popularized by John Maynard Keynes, who famously described the stock market as akin to a beauty contest where investors try to pick the most popular beauty, rather than the most beautiful person, to win a prize. This analogy highlights the concept of second-order thinking – trying to predict what others will predict. Research on the p-beauty contest has consistently shown that most people do not perform perfect iterated reasoning, and their choices tend to cluster around predictable levels of deviation from the theoretical optimum.

Implications for Mechanism Design and Future AI Integration

The findings of this study carry significant implications for the design of systems that involve both humans and LLMs.

Economic Markets and Algorithmic Trading

In financial markets, where LLMs are increasingly used for trading and analysis, human traders might alter their strategies based on their perception of algorithmic behavior. If human traders believe LLMs are perfectly rational and cooperative (in the sense of predictable play), they might adjust their own trading strategies, potentially leading to unforeseen market dynamics. For instance, a widespread belief in LLM rationality could lead to a convergence of trading patterns, making markets more susceptible to certain types of shocks or manipulation. The study’s finding that sophisticated reasoners are more prone to this shift is particularly relevant, as these are often the participants most active in complex financial environments.

Human-AI Collaboration and Competition

As LLMs become more integrated into collaborative environments, such as team projects or joint problem-solving, understanding how humans perceive their AI teammates’ rationality and cooperative intent is crucial. If humans consistently attribute cooperation to LLMs, this could foster trust and encourage more effective collaboration. However, if this perceived cooperation is based on a misinterpretation of predictable algorithms, it could lead to over-reliance or misplaced trust.

The Design of AI Agents

The study suggests that the design of LLM agents themselves might need to consider these human perceptions. If the goal is to encourage specific human behaviors, developers might need to tailor the LLM’s observable behavior to elicit desired responses. For example, if a more diverse range of human strategies is desired in a game, an LLM might need to exhibit less predictable or seemingly less "cooperative" behavior, within ethical boundaries.

Trust and Transparency

The unexpected perception of LLM cooperation raises questions about trust. Humans seem to be projecting a form of social intelligence onto LLMs. This can be beneficial if it leads to smoother interactions, but it also highlights the need for transparency about AI capabilities and limitations. Overestimating an LLM’s cooperative intent could lead to exploitation if the AI is purely driven by its programmed objectives without inherent "cooperative" values.

Future Research Directions

This study opens up several avenues for future research. Expanding the experimental design to include a wider variety of strategic games, different types of LLMs, and varying levels of complexity could provide a more comprehensive understanding. Investigating the long-term effects of repeated interactions between humans and LLMs in strategic settings would also be valuable. Furthermore, exploring the underlying cognitive mechanisms that lead humans to attribute rationality and cooperation to AI could offer deeper insights into human-AI psychology.

The research also prompts questions about how LLMs themselves might learn to adapt to human strategic behavior. If LLMs are designed to be adaptive, they might eventually learn to exploit or mirror human expectations in strategic games, leading to a dynamic and complex interplay between human and artificial intelligence.

In conclusion, the research "Humans expect rationality and cooperation from LLM opponents in strategic games" provides a critical early glimpse into the evolving landscape of human-AI interaction. It demonstrates that as LLMs become more integrated into our lives, humans are not merely treating them as tools but are actively forming perceptions about their cognitive abilities and social predispositions. The tendency to expect rationality and even cooperation from LLM opponents has profound implications for how we design, deploy, and interact with AI systems in the future, shaping everything from economic markets to the very fabric of our social interactions.

The Role of Perceived LLM Reasoning and Cooperation

Heterogeneity in Human Behavior and Beliefs

Background Context: The Rise of LLMs and Human-AI Interaction

Implications for Mechanism Design and Future AI Integration

Economic Markets and Algorithmic Trading

Human-AI Collaboration and Competition

The Design of AI Agents

Trust and Transparency

Future Research Directions

Share this:

Related posts:

Asep Darmawan

Related Articles

Russian Military Intelligence Exploits Unpatched Routers to Harvest Microsoft Office Authentication Tokens

Threat Actors Exploit Three Zero-Day Vulnerabilities in Microsoft Defender, Threatening Endpoint Security and System Integrity

Mythos Sets the World on Edge: What Comes Next May Push Us Beyond

TeamPCP Cybercrime Group Targets Iran with Data-Wiping Worm, Leverages Supply Chain Attacks

Leave a Reply Cancel reply