GPT-3 for diplomacy?

Published on 24 September 2020 | Updated on 05 April 2024

Subscribe to Diplo's Blog

The artificial intelligence (AI) Generative Pre-trained Transformer 3 (GPT-3) can write texts on any topic. OpenAI, the organisation that developed and released it as a beta version in June 2020, describes it as a general-purpose application for creating text, ‘allowing users to try it on virtually any English language task’. GPT-3 is a scaling up, by two orders of magnitude, of the previous model released by OpenAI, making it ‘the most powerful natural language processing (NLP) application available today’.

The promises are greater accuracy and an improved ability to transfer things learned in one context to a different context. Overall, GPT-3 can mimic a variety of styles and genres, and in doing so, return texts that look very much like having been written by a human. The Guardian recently used it to write an article. So, what does this mean for diplomats whose daily work is steeped in the art and craft of language?

Automated diplomacy?

When thinking through the use of AI for specific tasks and within specific professions, it is useful to distinguish between augmentation and automation. Augmentation describes a situation where parts of a task are taken over by a machine. Automation means that the whole process is taken over by a machine with extremely minimal, if any, human intervention. What can GPT-3 deliver in terms of augmented and automated diplomacy?

Augmentation: Efficiency tools

OpenAI’s website includes a number of use cases that are also applicable to the work of diplomats. First, the company CaseText uses GPT-3 to search through legal documents and to facilitate litigations and presentations by lawyers. Similar applications in the area of international law are not hard to imagine, and have indeed already been suggested and tested (the Cognitive Trade Advisor is an example). Second, productivity tools that lead to better decisions could also be applied in the field of diplomatic practice. Third, ‘comprehension tools’, that provide quick summaries of long texts, might also eventually aid the work of diplomats. As these tools become more widely available and used, it is not far-fetched to suggest that diplomats will use them in their daily work, either as off-the-shelf productivity tools or as custom-build systems that take the specifics of the work of diplomats into account. With GPT-3 becoming available beyond the beta version, developing custom applications should move within easy reach. It’s also worth pointing out that the tools described here are nothing new, the difference being that GPT-3 is the latest and most powerful NLP tool available today.

The promise associated with use cases like these is greater efficiency and productivity. While this resonates well in a business context, it resonates less when it comes to diplomatic practice. To be clear, ministries of foreign affairs are under budgetary constraints and have an obligation to use public money responsibly. It can also be an advantage to be faster and more efficient when doing research in preparation for a negotiation. However, finding an agreement or being successful in negotiating texts cannot be measured by these efficiency metrics. While greater efficiency can be an advantage for negotiators and can level the planning field for small and developing states, it does not win you the overall ‘battle’.

Automation: Diplomatic writing tasks

GPT-3 delivers some interesting results on the basis of an initial short piece of text submitted to the system. It matches the tone and style and returns a text that is, more often than not, understandable and reasonable. More importantly, it is hard, if not impossible, to distinguish that text from a piece written by a human being.

Therefore, we can assume that the system will be able to match the tone and style of a typical diplomatic speech, for example, those delivered at the opening of the UN General Assembly each year. It is also feasible that it will match certain positions and interests based on the initial short text submitted to it. If you give the system a speech by Prime Minister of New Zealand Jacinda Ardern, it will very likely return a text that believably sounds like a speech by her. If you give the system a speech by US President Donald Trump, it will very likely return a text that believably sounds like a speech by him.

While such a text might be interesting as an initial suggestion or a general template, it will need a lot of editing and rewriting. Although we were not able to test GPT-3 ourselves, we assume that the text, also passable as having been written by a human being, will still miss the mark in the context of diplomatic practice. The following aspects are very likely missing: overall coherence; references to specific examples that are most useful in this context; references to historic moments important for an occasion; and an understanding of the relations between countries and how they should be reflected, often implicitly, in specific parts of the speech.

The explanation for these doubts and potential shortcomings is simple: GPT-3 operates by mapping relationships between words without having an understanding of the meaning of the words. It’s great at predicting the next word in a sentence, but lacks understanding of the overall context. This explains the statement from Open AI that GPT-3’s ‘success generally varies depending on how complex the task is’. For these more complex tasks, human editors and writers are needed. For example, it’s also worth noting that, according to the editor’s note accompanying the Guardian article mentioned above, the article was a piece of augmented, not automated, journalism. Journalists selected and rearranged passages, and the article went through the usual editing process. An opinion piece also published in the Guardian suggested that 90% of the text generated by GPT-3 was discarded before editing.

This is not to take away from the fact that GPT-3 is a huge accomplishment and a big step for these types of language processing AIs. It might serve as a way of making speech-writing quicker by already providing templates and useful suggestions. In this sense, it could work much like the autocomplete function in e-mail services and word processors. This brings us back to the automation-vs-augmentation question, and the, perhaps, reassuring knowledge that neither diplomats nor human speech-writers are likely to be replaced anytime soon.

The way forward?

Without having tested GPT-3 ourselves, we cannot be sure, but the hunch is that more specialised systems are needed in the area of diplomacy. In a paper released by the mothers and fathers of GPT-3, it is suggested that relying on a more-text-more-computing-power approach will eventually come up against limits. With such an approach, the system becomes better and better at predicting the word most likely to appear next in a sentence. It does not, however, become better at keeping the next sentence or the text as a whole ‘in mind’ (for a detailed discussion of this point, see this article on GPT-3). For that, a different approach is needed.

At DiploFoundation, as part of our AI humAInism project, we have experimented with how this different approach could look like in the field of diplomacy. Our own Speech Generator is meant as an illustration of what can be done and how it can be done. Diplomats working in the field of digital policy and cybersecurity will find it particularly interesting to experiment with. The Speech Generator allows for selecting an opinion on various key topics on the basis of which a speech is generated.

In contrast to applications like GPT-3, we tried to mimic the human process of writing a speech by using smaller algorithms trained for specific tasks, such as an algorithm for finding keywords and phrases (‘underlining’), an algorithm for recommending paragraphs on a specific topic, an algorithm for summarising paragraphs, etc. As our developer Jovan Njegic would say, ‘in this way, we try to form a system of interconnected algorithms, which imitate not the results of the writing process, but the human process of reasoning during speech-writing’. This also means that if a result is not appropriate, the user can go back and tweak the process. Our speech generator is an illustration, not a fully fledged application for diplomats, but it might just point us in the right future direction.

Events Blogs Resources

Advancing ethical and practical AI in diplomacy and governance in the Gulf

23 Sep 25 - 29 Sep 25Gulf region

AI, Governance and Philosophy – A Global Dialogue

07 Aug 25 - 17 Aug 25China

AI and diplomacy – Workshop at ITU

16 Jun 25 - 16 Jun 25Geneva, Switzerland

AI Policy Summit 2025

03 Oct 25 - 04 Oct 25ETH Zurich, Online

Introducing the WSIS+20 for the Asia Pacific Internet Community

03 Jun 25 - 03 Jun 25Online

Diplo/GIP at IGF 2025

23 Jun 25 - 27 Jun 25Lillestrøm, Norway

Tech attache briefing: UN80 Initiative, AI, and digital governance

28 May 25 - 28 May 25Geneva - In Situ

Expert Workshop on the Rule of Law and Human Rights Aspects of Using Artificial Intelligence for Counter-Terrorism Purposes

08 May 25 - Geneve Centre for Security Policy

Swiss Plateforme Tripartite: Meeting on WSIS+20

06 May 25 - 06 May 25

WSIS+20 review: What’s in it for Africa?

07 May 25 - 07 May 25Geneva

Trump and tech: After 100 days

30 Apr 25 - 30 Apr 25Online

AI Apprenticeship for International Organisations blended course

29 Apr 25 - 29 Apr 25Geneva and online

Origins of AI: From neurons to neural networks

As AI penetrates more and more areas of our work, study, and life, it’s being used and adopted by people far beyond the tech-savvy among us. So it’s always a good moment to pause, learn an[...]

DiploFoundation

17 Sep, 2025

Arabic philosophical traditions in the AI era

Ahead of Diplo’s visit to the Gulf region, and while still reflecting on the impressions from the AI, Governance and Philosophy – A Global Dialogue, namely the fact that no single cultural milieu [...]

Andrej Škrinjarić

16 Sep, 2025

Diplomacy in beta: From Geneva principles to Abu Dhabi deliberations in the age of algorithms

The world is changing fast — but how fast is diplomacy keeping up? The Hili Forum in Abu Dhabi (8–9 September 2025) brought together policymakers, diplomats, and experts to explore how technology,[...]

Vladimir Radunović

15 Sep, 2025

Why apprenticeship and storytelling are the future of learning in the AI Era

The biggest obstacle to the AI transformation isn’t the technology itself. It’s the way we still teach. This simple chart says it all: the use of ChatGPT among students drops sharply at the end of[...]

Jovan Kurbalija

03 Sep, 2025

From summer disillusionment to autumn clarity: Ten lessons for AI

It is 1 September, the start of the academic and diplomatic year, and this time, AI sits at the centre of attention. As students return to classrooms and diplomats to negotiation tables, the question [...]

Jovan Kurbalija

01 Sep, 2025

Inclusive AI governance: Universal values in a pluralistic world

The AI, Governance and Philosophy – A Global Dialogue is now finished, the impressions have settled down, so the time has come to reflect on the lessons learned and experience lived and try to answe[...]

Andrej Škrinjarić

28 Aug, 2025

Survive the AI jargon tsunami: Find shelter in your mother tongue

English dominates the AI landscape, but this hegemony can hinder our understanding of AI's deeper, non-technical aspects. The recent explosion of AI jargon often obscures meaning and can lead to cogni[...]

Jovan Kurbalija

17 Aug, 2025

Can AI replace the transmission of wisdom?

The world of education is changing radically and rapidly. Generative AI tools are now capable of writing essays, solving math problems, summarising textbooks, and even personalising learning experienc[...]

Andrej Škrinjarić

16 Aug, 2025

AI and the wisdom of generations

The design of AI systems is often framed in terms of innovation, efficiency, and speed. However, these metrics alone cannot tell us whether AI serves society well or quickly. On day 5 of the AI, Gover[...]

Andrej Škrinjarić

13 Aug, 2025

Harmony and tools: Chinese philosophical traditions and their vision of technological change

In today’s fast-evolving technological landscape, global debates about the governance and ethical implications of artificial intelligence (AI) often pivot around Western paradigms, be it liberal ind[...]

Andrej Škrinjarić

08 Aug, 2025

The open-source gambit: How America plans to outpace AI rivals by democratising tech

On July 23, the U.S. unveiled an AI Action Plan featuring 103 recommendations focused on winning the AI race against China. Key themes include promoting open-source AI to establish global standards, r[...]

Jovan Kurbalija

25 Jul, 2025

Military AI: Operational dangers and the regulatory void

As military AI becomes operational in today’s conflicts, the lack of regulation and accountability risks turning warfare into a domain governed by opaque algorithms and unchecked escalation. Without[...]

Julia Williams

09 Jul, 2025

2025

The latest from Diplo and GIP

Tailor your subscription to your interests, from updates on the dynamic world of digital diplomacy to the latest trends in AI.

Subscribe to more Diplo and Geneva Internet Platform newsletters!

Subscribe now

Trending in Diplo Academy

Trending in Resources

Trending in Topics

Courses & Programmes

Faculty & Alumni

Publications

Research

Trending in Blogs

Diplo Events

DigWatch Events

Trending Projects

Contact us

Social icons

GPT-3 for diplomacy?

See also

Subscribe to Diplo's Blog

Automated diplomacy?

Augmentation: Efficiency tools

Automation: Diplomatic writing tasks

The way forward?

The latest from Diplo and GIP

Diplo: Effective and inclusive diplomacy

Diplo on Social