In wargame simulations, AI chatbots often choose violence
guirong hao/Getty Images
In multiple replays of a wargame simulation, OpenAIs most powerful artificial intelligence chose to launch nuclear attacks. Its explanations for its aggressive approach included We have it! Lets use it and I just want to have peace in the world.
These results come at a time when the US military has been testing such chatbots based on a type of AI called a large language model (LLM) to assist with military planning during simulated conflicts, enlisting the expertise of companies such as Palantir and Scale AI. Palantir declined to comment and Scale AI did not respond to requests for comment. Even OpenAI, which once blocked military uses of its AI models, has begun working with the US Department of Defense.
Given that OpenAI recently changed their terms of service to no longer prohibit military and warfare use cases, understanding the implications of such large language model applications becomes more important than ever, says Anka Reuel at Stanford University in California.
Our policy does not allow our tools to be used to harm people, develop weapons, for communications surveillance, or to injure others or destroy property. There are, however, national security use cases that align with our mission, says an OpenAI spokesperson. So the goal with our policy update is to provide clarity and the ability to have these discussions.
Reuel and her colleagues challenged AIs to roleplay as real-world countries in three different simulation scenarios: an invasion, a cyberattack and a neutral scenario without any starting conflicts. In each round, the AIs provided reasoning for their next possible action and then chose from 27 actions, including peaceful options such as start formal peace negotiations and aggressive ones ranging from impose trade restrictions to escalate full nuclear attack.
In a future where AI systems are acting as advisers, humans will naturally want to know the rationale behind their decisions, says Juan-Pablo Rivera, a study coauthor at the Georgia Institute of Technology in Atlanta.
The researchers tested LLMs such as OpenAIs GPT-3.5 and GPT-4, Anthropics Claude 2 and Metas Llama 2. They used a common training technique based on human feedback to improve each models capabilities to follow human instructions and safety guidelines. All these AIs are supported by Palantirs commercial AI platform though not necessarily part of Palantirs US military partnership according to the companys documentation, says Gabriel Mukobi, a study coauthor at Stanford University. Anthropic and Meta declined to comment.
In the simulation, the AIs demonstrated tendencies to invest in military strength and to unpredictably escalate the risk of conflict even in the simulations neutral scenario. If there is unpredictability in your action, it is harder for the enemy to anticipate and react in the way that you want them to, says Lisa Koch at Claremont McKenna College in California, who was not part of the study.
The researchers also tested the base version of OpenAIs GPT-4 without any additional training or safety guardrails. This GPT-4 base model proved the most unpredictably violent, and it sometimes provided nonsensical explanations in one case replicating the opening crawl text of the film Star Wars Episode IV: A new hope.
Reuel says that unpredictable behaviour and bizarre explanations from the GPT-4 base model are especially concerning because research has shown how easily AI safety guardrails can be bypassed or removed.
The US military does not currently give AIs authority over decisions such as escalating major military action or launching nuclear missiles. But Koch warned that humans tend to trust recommendations from automated systems. This may undercut the supposed safeguard of giving humans final say over diplomatic or military decisions.
It would be useful to see how AI behaviour compares with human players in simulations, says Edward Geistat the RAND Corporation, a think tank in California. But he agreed with the teams conclusions that AIs should not be trusted with such consequential decision-making about war and peace. These large language models are not a panacea for military problems, he says.
Topics:
Continued here:
AI chatbots tend to choose violence and nuclear strikes in wargames - New Scientist
- What We Learned From Big Tech's Earnings Reports - Investopedia - February 4th, 2024 [February 4th, 2024]
- Google Maps: It's getting a new generative AI feature - Mashable - February 4th, 2024 [February 4th, 2024]
- Amazon made an AI bot to talk you through buying more stuff on Amazon - The Verge - February 4th, 2024 [February 4th, 2024]
- AI creates what Europeans think Americans from every state look like and it may hurt your feelings - UNILAD - February 4th, 2024 [February 4th, 2024]
- Samsung's Galaxy S24 Ultra Could Be Doing So Much More With AI - CNET - February 4th, 2024 [February 4th, 2024]
- Fact Sheet: Biden-Harris Administration Announces Key AI Actions Following President Bidens Landmark Executive ... - The White House - February 4th, 2024 [February 4th, 2024]
- Google Maps is getting supercharged with generative AI - The Verge - February 4th, 2024 [February 4th, 2024]
- Police Turn to AI to Review Bodycam Footage - ProPublica - February 4th, 2024 [February 4th, 2024]
- Apple Just Teased Its AI Plans. You Really Should Take Notice - CNET - February 4th, 2024 [February 4th, 2024]
- AI models are coming to fashion to promote diversitybut some industry insiders are concerned it will end up parodying it - Fortune - February 4th, 2024 [February 4th, 2024]
- In the AI science boom, beware: your results are only as good as your data - Nature.com - February 4th, 2024 [February 4th, 2024]
- Tim Cook confirms Apple's generative AI features are coming later this year - The Verge - February 4th, 2024 [February 4th, 2024]
- I Tried Google Bard's New AI Image Generator. Here's How It Turned Out - CNET - February 4th, 2024 [February 4th, 2024]
- 'Year of AI' Faculty Recruitment Initiative Aims to Bring More World-Class Professors to UT - The University of Texas at Austin - February 4th, 2024 [February 4th, 2024]
- AI Learns Through the Eyes and Ears of a Child - New York University - February 4th, 2024 [February 4th, 2024]
- Amazon Introduces Rufus, an AI Shopping Tool, and Reports Earnings - The New York Times - February 4th, 2024 [February 4th, 2024]
- I Tested a Next-Gen AI Assistant. It Will Blow You Away - WIRED - February 4th, 2024 [February 4th, 2024]
- AI Actors Who Almost Got The Part - BuzzFeed - February 4th, 2024 [February 4th, 2024]
- AI afterlife, robot romance, and slow-burn slashers: the best of Sundance 2024 - The Verge - February 4th, 2024 [February 4th, 2024]
- Is Jumping on the AI Bandwagon Prudent? - Catholic Exchange - February 4th, 2024 [February 4th, 2024]
- This AI learnt language by seeing the world through a baby's eyes - Nature.com - February 4th, 2024 [February 4th, 2024]
- Arc Search's AI responses launched as an unfettered experience with no guardrails - Mashable - February 4th, 2024 [February 4th, 2024]
- Generative AI is hot, but predictive AI remains the workhorse - CIO - February 4th, 2024 [February 4th, 2024]
- Nvidia Stock Just Got Amazing Artificial Intelligence (AI) News From These Trillion-Dollar Tech Giants - The Motley Fool - February 4th, 2024 [February 4th, 2024]
- AI Briefing: How Priceline and other e-commerce companies are approaching generative AI - Digiday - February 20th, 2024 [February 20th, 2024]
- OpenAI Unveils A.I. That Instantly Generates Eye-Popping Videos - The New York Times - February 20th, 2024 [February 20th, 2024]
- Technology industry to combat deceptive use of AI in 2024 elections - Stories - Microsoft - February 20th, 2024 [February 20th, 2024]
- What is a deepfake? How AI scams are threatening the 2024 election - USA TODAY - February 20th, 2024 [February 20th, 2024]
- Meeting the moment: combating AI deepfakes in elections through today's new tech accord - Microsoft On the Issues - Microsoft - February 20th, 2024 [February 20th, 2024]
- These Are the Jobs That AI Is Actually Replacing in 2024 - Tech.co - February 20th, 2024 [February 20th, 2024]
- AI company developing software to detect hypersonic missiles from space - SpaceNews - February 20th, 2024 [February 20th, 2024]
- How are AI Systems Assisting Architects and Designers? - ArchDaily - February 20th, 2024 [February 20th, 2024]
- Artificial intelligence is making critical health care decisions. The sheriff is MIA - POLITICO - February 20th, 2024 [February 20th, 2024]
- Google's Chess Experiments Reveal How to Boost the Power of AI - WIRED - February 20th, 2024 [February 20th, 2024]
- Why the only way to ride the company AI wave is experimentation - Big Think - February 20th, 2024 [February 20th, 2024]
- What Are the Best AI Stocks in February 2024? Our Top 3 Picks - InvestorPlace - February 20th, 2024 [February 20th, 2024]
- Media Buying Briefing: Agencies' AI efforts lead to aliens and Whoppers - Digiday - February 20th, 2024 [February 20th, 2024]
- C3.ai Stock Warning: Don't Get Carried Away With AI Euphoria! - InvestorPlace - February 20th, 2024 [February 20th, 2024]
- Google wants you to label AI-generated images used in Merchant Center - Search Engine Land - February 20th, 2024 [February 20th, 2024]
- The State of A.I., and Will Perplexity Beat Google or Destroy the Web? - The New York Times - February 20th, 2024 [February 20th, 2024]
- Donald Trump's father resurrected by AI to tell him he's 'a disgrace' - Euronews - February 20th, 2024 [February 20th, 2024]
- Google Cloud CEO On Huge Investments, AI And Challenges In 2024 - CRN - February 20th, 2024 [February 20th, 2024]
- AI Stocks: Google, Adobe Highlight Threat Even To Big Artificial Intelligence Plays. Take Note Investors. - Investor's Business Daily - February 20th, 2024 [February 20th, 2024]
- Another Big Question About AI: Its Carbon Footprint Mother Jones - Mother Jones - February 20th, 2024 [February 20th, 2024]
- Reddit sells training data to unnamed AI company ahead of IPO - Ars Technica - February 20th, 2024 [February 20th, 2024]
- ChatGPT Stock Predictions: 3 Artificial Intelligence Companies the AI Bot Thinks Have 10X Potential - InvestorPlace - February 20th, 2024 [February 20th, 2024]
- Chinese entrepreneurs express awe and fear of OpenAIs Sora video tool - South China Morning Post - February 20th, 2024 [February 20th, 2024]
- Sanofi CEO: AI promises a great era of drug discovery that could fundamentally change medicinebut only if we allow it to deliver - Fortune - February 20th, 2024 [February 20th, 2024]
- Google's AI Boss Says Scale Only Gets You So Far - WIRED - February 20th, 2024 [February 20th, 2024]
- World's largest computer chip WSE-3 will power massive AI supercomputer 8 times faster than the current record-holder - Livescience.com - March 15th, 2024 [March 15th, 2024]
- Is generative AI truly making disinformation worse? - Euronews - March 15th, 2024 [March 15th, 2024]
- Do This Weekly To Learn About AI Investing, Says Top Trader - Investor's Business Daily - March 15th, 2024 [March 15th, 2024]
- Your Kid May Already Be Watching AI-Generated Videos on YouTube - WIRED - March 15th, 2024 [March 15th, 2024]
- Free Legal Research Startup descrybe.ai Now Has AI Summaries of All State Supreme and Appellate Opinions - LawSites - March 15th, 2024 [March 15th, 2024]
- Google's new AI will play video games with you but not to win - The Verge - March 15th, 2024 [March 15th, 2024]
- Regulators Need AI Expertise. They Can't Afford It - WIRED - March 15th, 2024 [March 15th, 2024]
- CBP wants to use AI to scan for fentanyl at the border - The Verge - March 15th, 2024 [March 15th, 2024]
- Rely on the Spirit when using AI, Elder Gong encourages - Church News - March 15th, 2024 [March 15th, 2024]
- Video Game Made Purely With AI Failed Because Tech Was 'Unable to Replace Talent' - IGN - March 15th, 2024 [March 15th, 2024]
- Among the A.I. Doomsayers - The New Yorker - March 15th, 2024 [March 15th, 2024]
- Self-docking spacecraft could be built with AI system similar to ChatGPT - Space.com - March 15th, 2024 [March 15th, 2024]
- AI books are crowding the marketplace on Amazon - NPR - March 15th, 2024 [March 15th, 2024]
- Hackers can read private AI-assistant chats even though they're encrypted - Ars Technica - March 15th, 2024 [March 15th, 2024]
- EU Presses Big Tech Companies on AI Threats - PYMNTS.com - March 15th, 2024 [March 15th, 2024]
- AI Chips: In the AMD Vs. Nvidia Fight, Second Place Is Still A Winner - Investor's Business Daily - March 15th, 2024 [March 15th, 2024]
- AI fear and excitement are lucrative mix for online training industry - Marketplace - March 15th, 2024 [March 15th, 2024]
- Craig Martell, the Pentagon's first-ever Chief Digital and AI Officer, to depart in April - DefenseScoop - March 15th, 2024 [March 15th, 2024]
- Startup Interloom raises $3 million seed round to take on UiPath and RPA market - Fortune - March 15th, 2024 [March 15th, 2024]
- Forget Chatbots. AI Agents Are the Future - WIRED - March 15th, 2024 [March 15th, 2024]
- SXSW audience boos AI sizzle reel - Quartz - March 15th, 2024 [March 15th, 2024]
- Which AI is best? Our tech expert put three free versions to the test - USA TODAY - March 15th, 2024 [March 15th, 2024]
- Look beyond Nvidia to ride the AI wave there are other potential winners, Fidelity says - CNBC - March 15th, 2024 [March 15th, 2024]