Can AI understand our language?

Published on 14 June 2023 | Updated on 19 March 2024

Subscribe to Diplo's Blog

Transcription software is now crucial for many businesses, journalists, researchers, and other professionals who need to convert audio or video recordings into text. These software programs automate the transcription process, which saves time and energy while improving accuracy. It can support various languages, feature editing options, speaker identification, and timestamps. In general, transcription software is a valuable resource for ensuring efficient and precise transcriptions for activities such as research, content development, and documentation. There are several kinds of transcription software available, such as automatic, professional, and speech-to-text software.

Over 60 AI-based transcription and software applications use automatic speech recognition. These transcription software programs are often bundled with meeting platforms like Zoom or are available as standalone apps, exhibiting increasing power and new capabilities such as meeting summarisation. At Diplo, we use these applications extensively, but our primary challenge is the diverse accents and dialects of English spoken by our professors and students, ranging from those coming from Asia, Africa, and the Balkans to Latin America. Since ASR systems are mostly trained on datasets of native speakers, they may struggle to accurately transcribe speech with accents and dialects. Differences in the speed of speech and the usage of professional or technical language can also affect transcription accuracy. This is commonly the case at Diplo, which deals with digital diplomacy, governance, and more. Our comparative research looked at two types of speeches: general diplomatic speeches and specialised internet governance speeches, spoken in accents or dialects of English such as Indian, Chinese, African (Kenyan and Nigerian), Russian, Balkan (Serbian), German, French, Spanish (Spain and Mexico), Portuguese (Portugal and Brazil), and other languages.

After transcribing video and audio content from multiple speakers with diverse dialects of English, we obtained the following outcomes.

After conducting a detailed analysis, we recommend Otter and Grain as the best software options for an organisation like Diplo. These two software have the ability to recognise different dialects, which is a crucial aspect of our organisation. Furthermore, they both offer fast transcribing with an impressive accuracy rate of 99%. The only difference between the two is that Grain supports more languages as compared to Otter.

Events Blogs Resources

AI and diplomacy – Workshop at ITU

16 Jun 25 - 16 Jun 25Geneva, Switzerland

Introducing the WSIS+20 for the Asia Pacific Internet Community

03 Jun 25 - 03 Jun 25Online

Diplo/GIP at IGF 2025

23 Jun 25 - 27 Jun 25Lillestrøm, Norway

Tech attache briefing: UN80 Initiative, AI, and digital governance

28 May 25 - 28 May 25Geneva - In Situ

Expert Workshop on the Rule of Law and Human Rights Aspects of Using Artificial Intelligence for Counter-Terrorism Purposes

08 May 25 - Geneve Centre for Security Policy

Swiss Plateforme Tripartite: Meeting on WSIS+20

06 May 25 - 06 May 25

WSIS+20 review: What’s in it for Africa?

07 May 25 - 07 May 25Geneva

Trump and tech: After 100 days

30 Apr 25 - 30 Apr 25Online

AI Apprenticeship for International Organisations blended course

29 Apr 25 - 29 Apr 25Geneva and online

GITEX Africa 2025

14 Apr 25 - 16 Apr 25

Demystifying AI: How to prepare international organisations for AI transformation?

29 Apr 25 - 29 Apr 25Geneva

Tech attache briefing: WSIS+20 and AI governance negotiations – Updates and next steps

16 Apr 25 - 16 Apr 25Geneva - In Situ

AI and Magical Realism: When technology blurs the line between wonder and reality

The challenges of governing artificial intelligence often feel like something out of a Gabriel García Márquez novel, where the extraordinary blends seamlessly with the everyday, and the line between[...]

Jovan Kurbalija

27 Jun, 2025

AI in Sophie’s world: How a philosophy book can help us govern AI

As we convene in Oslo for the Internet Governance Forum, we reflect on the philosophical insights from Jostein Gaarder's "Sophie’s World." The novel's exploration of identity and constructed reality[...]

Jovan Kurbalija

21 Jun, 2025

Advancing Swiss AI Trinity: Zurich’s entrepreneurship, Geneva’s governance, and communal subsidiarity

Switzerland can inspire global AI transformation by leveraging its unique strengths: Zurich’s entrepreneurial spirit, Geneva’s governance expertise, and a focus on communal subsidiarity. This "AI [...]

Jovan Kurbalija

15 Jun, 2025

EU Digital Diplomacy: Geopolitical shift from focus on values to economic security

The EU's International Digital Strategy 2025 shifts focus from a values-centric approach to prioritizing geopolitical and economic security. While it retains a commitment to human rights, the new stra[...]

Jovan Kurbalija

10 Jun, 2025

Empowering communities through bottom-up AI: The example of ThutoHealth

In Botswana, a silent epidemic claims nearly half of all lives. Hypertension, diabetes, cancer, and other non-communicable diseases (NCDs) are responsible for 46% of deaths nationwide—a staggering s[...]

DiploFoundation

26 May, 2025

What can we learn from 160 years of tech diplomacy at ITU?

On May 17, 1865, the International Telecommunication Union (ITU) was founded by 20 European states to streamline telegraph messaging across borders, highlighting the need for multilateral cooperation [...]

Jovan Kurbalija

17 May, 2025

Part 1: An introduction to digital twins

When Spain & Portugal went dark, it wasn't just lights that failed. It was a reminder: technology isn't just a tool – it's the system we live in.[...]

Anita Lamprecht

14 May, 2025

Part 7: ‘Converging realities: Embedding governance through digital twins’

The metaverse is no longer a question of ‘what if’ – it’s already being built. Digital twins, embedded governance, and the collapse of the digital–physical divide mark the next frontier.[...]

Anita Lamprecht

05 May, 2025

Tech continuity in President Trump’s first 100 days

During President Trump’s first 100 days, technology policy exhibited continuity rather than disruption, with a focus on AI and digital regulation characterized by incremental adjustments. Only 9 of [...]

Jovan Kurbalija

27 Apr, 2025

From geopolitics to classrooms: The hopeful side of the US-China AI race

The competition between the US and China in AI education is emerging as a vital battleground amidst geopolitical tensions. Both nations are prioritizing AI education to prepare future generations for [...]

Jovan Kurbalija

27 Apr, 2025

Politeness in 2025: Why are we so kind to AI?

A Fortune study shows that about 80% of users in the UK and USA use polite language, like "please" and "thank you," when interacting with AI. This behavior reflects deep-rooted psychological and cultu[...]

Jovan Kurbalija

23 Apr, 2025

Linguists in the AI era: From resistance to renaissance

In the context of Geneva's multilingual landscape, the rise of AI has sparked both concern and opportunity within the linguistic community. While AI will automate many translation and interpretation t[...]

Jovan Kurbalija

18 Apr, 2025

2025