language AI Archives - CLEAR Global https://clearglobal.org/tag/language-ai/ Tue, 02 Dec 2025 13:01:26 +0000 en-US hourly 1 https://clearglobal.org/wp-content/uploads/2021/05/cropped-CLEAR-SM-Logos_Blue-1-32x32.png language AI Archives - CLEAR Global https://clearglobal.org/tag/language-ai/ 32 32 The future of language technology and why industry partnership matters https://clearglobal.org/the-future-of-language-technology-and-why-industry-partnership-matters/ https://clearglobal.org/the-future-of-language-technology-and-why-industry-partnership-matters/#respond Wed, 03 Dec 2025 13:29:00 +0000 https://clearglobal.org/?p=76174 By Britta Aagaard The digital divide runs along linguistic lines. While billions of people now carry powerful computing devices in their pockets, most of […]

The post The future of language technology and why industry partnership matters appeared first on CLEAR Global.

]]>

By Britta Aagaard

The digital divide runs along linguistic lines. While billions of people now carry powerful computing devices in their pockets, most of the world’s 7,000+ languages remain locked out of the digital revolution. The technology that powers our voice assistants, translation apps, and search engines works beautifully for English, Mandarin, and maybe a hundred other widely-spoken languages. But for the rest, there’s a gap.

I’ve spent 25 years in the language and localization industry, watching this gap widen despite our best efforts. The truth is that no single company, government, or organization can solve this problem alone. The scale is simply too vast, the linguistic diversity too complex, and the resources required too substantial for any one entity to tackle effectively.

Developing robust language technology for a single low-resource language requires extensive text corpora, audio recordings, linguistic expertise, computational resources, and years of iterative development. Multiply that by thousands of languages, and the challenge becomes clear. We need a fundamentally different approach.

Community leadership is the answer to sustainability

The most sustainable language technology comes from communities themselves. At CLEAR Global, I’ve witnessed community-driven projects outlast and outperform top-down initiatives. When speakers of Aymara, Kanuri or Marma lead the development of solutions for their own languages, something remarkable happens. The technology reflects actual usage patterns, incorporates cultural nuances that outsiders miss, and enjoys immediate adoption because it solves real problems that the community itself identified.

This is a pragmatic strategy. Community ownership ensures long-term maintenance and evolution of language tools. When people shape the initiatives that affect them, they don’t just use the resulting technology; they champion it, improve it, and adapt it to changing needs. A translation app developed by and for a language’s own speakers will always be more accurate, more culturally relevant, and more widely used than one created by well-meaning technologists who don’t speak the language.

Community enthusiasm alone isn’t enough, though. These projects need fuel: funding, technical infrastructure, and expert support. Strategic investment transforms potential into reality. A small nonprofit working with low-resource language communities can accomplish far more with proper backing than a Fortune 500 company launching a standalone commercial product. The nonprofit builds capacity within the community itself, creating sustainable ecosystems rather than dependencies.

Strategic funding multiplies impact

When you fund technical support for one community language project, you help far more than that single language. You create knowledge, tools, and training that can be adapted by plenty of other communities. You build networks of practitioners who share knowledge and resources. You demonstrate what’s possible, inspiring similar efforts worldwide. This networked, supported approach delivers exponentially greater impact than fragmented efforts or ventures that serve only the most profitable language markets.

Our baseline challenge: language data

But none of this works without high-quality data. Language technology runs on data: text, audio, grammars, linguistic annotations, and existing translations. For most of the world’s languages, we either don’t have this data in digital form or it exists in quantities far too small to train effective AI models. We desperately need sustained funding for research that addresses this fundamental bottleneck. This means supporting linguists working on documentation, funding digitization of existing materials, and developing new methodologies for efficient data collection in resource-constrained environments, or alternative routes such as synthetic language data.

The research challenges are significant but not insurmountable. How do we build effective language models with limited training data? What ethical frameworks should govern data collection and technology deployment in affected communities? These questions require serious investigation and solution design work, and that research needs funding commitments that extend beyond single grant cycles. Funding research that increases the availability and quality of low-resource language data is essential.

Moving forward with a collaborative approach

The language technology landscape stands at a crossroad. We can continue on the current path, where a handful of languages enjoy cutting-edge tools while thousands of others fade into digital obscurity. Or we can embrace a collaborative, community-centered model that distributes both the work and the benefits across the full spectrum of human linguistic diversity.

This vision requires resources. CLEAR Global is a nonprofit organization working to make this vision real by developing language technology with low-resource language speakers as the primary focus, not an afterthought. Our work prioritizes community needs, builds local capacity, and creates tools that actually serve the people who speak these languages. But we cannot do this critical work without support.

If you believe that language technology should serve all of humanity, not just the wealthy and numerous, I urge you to contribute to CLEAR Global. Individual donations directly fund the development of tools, resources, and infrastructure for languages that the commercial market ignores. For corporations in the technology, localization, and communications sectors, sponsoring CLEAR Global offers an opportunity to demonstrate genuine commitment to linguistic diversity while supporting work that benefits the entire industry. In supporting CLEAR Global, whether as an individual donor or corporate sponsor, you’re investing in linguistic justice and a future where nobody’s words matter less because of the language they speak. 

 

 

Britta Aagaard is a CLEAR Global Board member, Chief Business Officer at Semantix and a part of the executive leadership team of TransPerfect.

The post The future of language technology and why industry partnership matters appeared first on CLEAR Global.

]]>
https://clearglobal.org/the-future-of-language-technology-and-why-industry-partnership-matters/feed/ 0
Protecting human expertise in an age of automation and budget cuts https://clearglobal.org/protecting-human-expertise-in-an-age-of-automation-and-budget-cuts/ https://clearglobal.org/protecting-human-expertise-in-an-age-of-automation-and-budget-cuts/#comments Tue, 30 Sep 2025 00:00:00 +0000 https://clearglobal.org/?p=75112 By Aimee Ansari and Catherine Fox Both the language services industry and the humanitarian and development aid sector have been going through huge and […]

The post Protecting human expertise in an age of automation and budget cuts appeared first on CLEAR Global.

]]>

By Aimee Ansari and Catherine Fox

Both the language services industry and the humanitarian and development aid sector have been going through huge and painful changes recently. Language services are facing severe pressure from the impacts of machine translation and AI, which threaten the prospects and viability of freelancers and industry professionals. Meanwhile, changes in the aid industry have been dramatic with drastic and rapid cuts to US and other government budgets leading to significant staff reductions and profound uncertainty within organisations dedicated to serving vulnerable communities in the most insecure locations. These dramatic changes dovetail to imply significant change in the area where these industries intersect, leading to severe uncertainty for the individuals working in these industries. In this moment of change and unpredictability, it is critical to protect human expertise to limit long-term loss of skills, to safeguard the wellbeing of people working in these fields, and to maintain the quality of translation and interpreting services in an industry where communications are very often high-stakes.

The intense pressure on both industries is already leading to the loss of experienced and highly qualified staff. More experienced colleagues, both in the aid sector and the language services industry, are quitting their jobs in increasing numbers due to disillusionment, moral injury, and lack of financial viability. Translation and interpreting face three particular challenges: prices are falling due to market pressure, the platform economy is damaging worker rights, and clients increasingly opt for machine translation post-editing or AI over human expertise. These trends are eroding job satisfaction and professional sustainability.

Potential new recruits have to some extent seen the writing on the wall, with fewer students signing up for language studies and some universities closing their modern language departments altogether. With dramatic cuts and reorganisations in the aid sector, organisations may increasingly rely on unpaid internships to fill staffing gaps. To make matters worse, these unpaid positions will exclude candidates who cannot afford to work without pay, worsening inequality in the sector. This crunch in resources risks damaging long-term human capital, with lasting effects for years to come.

Sustaining expertise through sector upheaval

So, what can be done to prevent irreversible damage to the industries and to protect the people working within them? This crisis puts enormous strain on individual workers who are fighting to protect their jobs, wellbeing, and family livelihoods. The situation demands strong leadership from industry and associations. The language services and aid sectors must support emerging talent while retaining experienced professionals who can pass on knowledge and skills.

Mentoring schemes can connect different areas of the workforce and encourage mutual learning. Employers should value skills over years of experience, especially in fast-changing industries where a long time on the job is not necessarily a good predictor of success. Industry and associations need to work with academia and training providers to attract new talent. Meanwhile, the machine translation community is starting to acknowledge that technology assessments need to expand beyond performance metrics to include environmental and human impact.

And what can individual workers do to stay well and continue to find fulfillment at work? Given the high proportion of freelancers in the industry collaboration and networking are essential. Sharing challenges and solutions can help to counter the feelings of isolation and hopelessness that these changes bring about. Active associations and their networks, such as the Institute of Translation and Interpreting, the Association of Translation Companies, and the International Federation of Translators are worth their weight in gold in these difficult times. Freelancers and industry professionals would do well to invest their time in building and contributing to these bodies. 

Continuing professional development is essential, not only to gain new skills, but to remain agile and open to new ideas and learning. The current state of these industries demands a new generation of translators, interpreters, and aid workers who can use tech and can teach themselves to keep up-to-date with tools that are constantly changing. They can achieve this through paid learning programs or by tapping into the extensive range of available open-source resources out there. 

Workers must ultimately protect their wellbeing, ride the storms faced by these industries and remain flexible. While market conditions shift, human skills in these areas will always be needed. Professionals will continue to serve as the ‘human in the loop’ or oversee the work of automated systems. All these changes are exhausting, so we need to pace ourselves mindfully, find new sources of motivation and prevent the burnout that is a real threat to human expertise in both industries. 

 


 

Aimee Ansari is CLEAR Global’s CEO.

Catherine Fox is a freelance translator working from French and Spanish into English, after over a decade of experience in the humanitarian and development sector. She recently completed a Master’s in specialised translation at the University of Geneva. Her research interests include translation and emotion, translator self-care and translators adapting to new technology.

The post Protecting human expertise in an age of automation and budget cuts appeared first on CLEAR Global.

]]>
https://clearglobal.org/protecting-human-expertise-in-an-age-of-automation-and-budget-cuts/feed/ 1
Our impact on language AI: a strategic evaluation and insight for the road ahead https://clearglobal.org/our-impact-on-language-ai-a-strategic-evaluation-and-insight-for-the-road-ahead/ https://clearglobal.org/our-impact-on-language-ai-a-strategic-evaluation-and-insight-for-the-road-ahead/#respond Wed, 20 Aug 2025 12:35:15 +0000 https://clearglobal.org/?p=74331 Access to technology still depends on the language a person speaks. While voice assistants, transcription tools, and text-to-speech systems are becoming common in major […]

The post Our impact on language AI: a strategic evaluation and insight for the road ahead appeared first on CLEAR Global.

]]>

Access to technology still depends on the language a person speaks. While voice assistants, transcription tools, and text-to-speech systems are becoming common in major global languages, millions of people who speak marginalized or low-resource languages remain excluded.

To better understand CLEAR Global’s impact as an organization focused on low-resource language technology, and to guide our future direction, in June 2025 we commissioned an independent evaluation of our language technology efforts.

Key findings

The evaluation provided valuable, practical insights, recognizing CLEAR Global’s strengths while identifying areas for sharper focus.

What CLEAR Global is doing well

  • High-quality language data collection: CLEAR Global’s ability to collect, curate, and release reliable language datasets for under-resourced languages is widely recognized, with partners like Digital Umuganda highlighting the difference our tools make. 
  • Global reach and trusted brand: with more than 100,000 linguists connected to the TWB (Translators without Borders) Platform, CLEAR Global has an exceptional global network and reputation that helps us drive complex, multi-partner, and multi-language projects.
  • Project management and coordination: the team excels in securing funding and organizing effective collaborations, ensuring language technology projects come to fruition. 

Where CLEAR Global needs to refocus

  • Leveraging industry-standard models for training, while contributing high-quality datasets from CLEAR Global to fine-tune and enhance their performance.
  • Concentrate technical capacity on core competencies: taking on areas like app development, model training, and server hosting has spread resources thin and distracted the teams from core strengths.
  • Think about sustainability ahead of development: many technology projects have ended after the pilot phase, often due to funding priorities and limited pathways for handover to local partners.

Strategic direction

The evaluation has helped clarify the next steps. CLEAR Global will focus on where we add the most unique value while building partnerships where others excel.

Priorities moving forward
  • Data leadership: we will double down on producing high-quality, open, ethical, and well-documented datasets for low-resource languages, optimized for platforms like Hugging Face.
  • Expanding TWB Voice: by making our data collection tool, TWB Voice, available to external organizational partners or local communities, offering tools needed to engage their communities and activities, we provide an opportunity for others to autonomously collect data and develop their own language AI.
  • Empowering the TWB Community: CLEAR Global will aim to provide communities with more opportunities to engage in language AI development through tasks such as evaluation, rating, and validation. These activities are essential to ensuring that AI tools are inclusive, fair, and representative of marginalized language speakers.
Why it matters
  • As AI and language technology advance, there’s a real risk that communities speaking under-resourced languages will be left behind. Many of the world’s most widely used models simply don’t serve these languages well or at all.
  • CLEAR Global has a unique role to play not by competing with major tech companies but by providing the essential data, expertise, and community engagement needed to build inclusive, ethical, impactful language technologies.
  • By focusing on our strengths and working in partnership with organizations that can scale solutions, we help ensure that everyone, regardless of language, can be part of global conversations.

     

The opportunity ahead

This strategic realignment positions us to solidify our role as a global leader in data for low-resource language technology, while driving new partnerships with academic institutions, social impact actors, and technology companies. It will open doors to new funding streams that prioritize ethical AI and inclusion, and it will reinvigorate our linguist community by providing meaningful, empowering, and ongoing opportunities to collaborate and contribute.

This evaluation affirms that we are progressing toward our intended goals. CLEAR Global’s greatest impact lies in leveraging our global community and expertise in data collection to help close the language technology gap.

With a focused strategy, strong partnerships, and an empowered linguist community, we can help make language technology work for everyone, in every language.

Together, we can build a future where every language and every voice are heard.

For partnerships or more information, contact us at info@clearglobal.org.

The post Our impact on language AI: a strategic evaluation and insight for the road ahead appeared first on CLEAR Global.

]]>
https://clearglobal.org/our-impact-on-language-ai-a-strategic-evaluation-and-insight-for-the-road-ahead/feed/ 0
We have a plan to expand language AI for social impact: join us! https://clearglobal.org/we-have-a-plan-to-expand-language-ai-for-social-impact-join-us/ https://clearglobal.org/we-have-a-plan-to-expand-language-ai-for-social-impact-join-us/#comments Wed, 29 Jan 2025 17:18:57 +0000 https://clearglobal.org/?p=71311 CLEAR Global has a plan to radically expand the availability of language AI for speakers of marginalized languages, in partnership with other social impact […]

The post We have a plan to expand language AI for social impact: join us! appeared first on CLEAR Global.

]]>

CLEAR Global has a plan to radically expand the availability of language AI for speakers of marginalized languages, in partnership with other social impact organizations.

We have a record of pioneering and advocating for the development of language technology for marginalized communities – opening the way for others. 

  • Our early automatic speech recognition and machine translation for Kurdish and Tigrinya outperformed the models available at the time.
  • Our offline information kiosk in Bihar, India answered farmers’ spoken questions on climate adaptation in their own language when literacy and connectivity were problematic.
  • We built chatbots using conversational AI to answer people’s questions on Covid-19 in Lingala, Congolese Swahili, Hausa and Kanuri when most Covid chatbots were menu-based.

 

Working with other language tech experts in Africa and South Asia in particular, we have shown how language technology can be made accessible and useful for more of the 4 billion people worldwide excluded by language from information, services and conversations.

You can help make that change.

Advances in technology and untapped potential in the social impact sector present a unique opportunity to accelerate that progress and ensure it benefits speakers of less powerful languages. We would love to work with you on making that happen.

What needs to happen?

Our analysis is that the biggest bottlenecks at present are:

  • A shortage of diverse, good quality language data (especially voice) for marginalized languages
  • A lack of awareness among social impact organizations about what is possible
  • A lack of information on what communities need and want

Very little voice data exists for most of the world’s languages, and what does, is fairly undiverse: young male voices from the cities predominate, as do the accents and dialects of more prosperous regions. That means that if the aim is to communicate with women farmers in rural areas, for instance, the application is likely to fail. 

While language AI is bringing change to the social impact sector as it is elsewhere, many of the organizations involved are unfamiliar with language AI or  lack the capacity to use or help build it. 

We aim to:

  • Build diverse voice datasets and support other social impact organizations to do the same – tapping into a largely unmined potential of language data for marginalized language speakers, in ways that reflect their needs and wishes and are durably safe for the individuals concerned
  • Collaborate on building voice models, integrating them into useful applications, and documenting their impact
  • Research and collaborate on and advocate for centering language technology development on the needs of people excluded by language

How you can be part of it

  • Explore the potential of language technology for reaching and hearing from marginalized language speakers: integrate it into your programs and services and contribute to improving and promoting its impact. 
  • Help build a consensus on and support for safe and ethical pooling of language data for marginalized languages and for using it to expand access to services, information and conversations.
  • Collect and share language data safely and ethically.
  • Work with us to understand what marginalized communities need from language tech and how best to consult them.

How CLEAR Global can help

  • To learn more, read about and share information on our work and learning to date, or contact us to set up a call.
  • To help build language data, contact us about using our TWB Voice platform or getting our support to share text or voice data safely in other ways.
  • To integrate language tech for marginalized language speakers into your work, get in touch to discuss how we can support you with information, capacity building and language models.
  • To get a better understanding of community needs, we can support you with research and data collection.

About TWB Voice

TWB Voice is CLEAR Global’s latest contribution to building voice data by facilitating quality-controlled voice recordings in any language to meet the needs of automated speech recognition (ASR) and text-to-speech (TTS) technology. 

-A platform for quality-controlled voice data collection

-A growing repository of open-source voice data

-Access to the 100,000-strong TWB Community of linguists

-Voices accurately classified by age group and gender to aid appropriate use

How you can use TWB Voice

If you are looking to expand into new languages, we can work with you to build the voice datasets and develop the models needed for speech recognition and text-to-speech. We can also advise on integrating them into your existing services if needed.

Because we aim to expand the reach of language AI for marginalized languages, the goal is to publish datasets as open-source whenever possible, with the full consent of contributors and in compliance with ethical data management and AI use. If that won’t work for your datasets, coordinate with our experts on the needs of your language data project.

Other language tech support

If you are interested in building language datasets over time from your own digital communication with communities, we can advise and support on setting this up.

We also offer user research services to help tailor technology-enabled solutions to the needs of your intended users.

Please get in touch!

We work with a wide range of technologists, civil society organizations, international aid providers and governments to build language AI and other solutions to language exclusion. We want to hear from you if you have needs we can help address, and if you have capacity, ideas or learning we can build on together.

The post We have a plan to expand language AI for social impact: join us! appeared first on CLEAR Global.

]]>
https://clearglobal.org/we-have-a-plan-to-expand-language-ai-for-social-impact-join-us/feed/ 2