Funds for Companies

Grants and Resources for Sustainability

  • Subscribe for Free
  • Premium Support
  • Premium Sign in
  • Premium Sign up
  • Home
  • Funds for NGOs
    • Agriculture, Food and Nutrition
    • Animals and Wildlife
    • Arts and Culture
    • Children
    • Civil Society
    • Community Development
    • COVID
    • Democracy and Good Governance
    • Disability
    • Economic Development
    • Education
    • Employment and Labour
    • Environmental Conservation and Climate Change
    • Family Support
    • Healthcare
    • HIV and AIDS
    • Housing and Shelter
    • Humanitarian Relief
    • Human Rights
    • Human Service
    • Information Technology
    • LGBTQ
    • Livelihood Development
    • Media and Development
    • Narcotics, Drugs and Crime
    • Old Age Care
    • Peace and Conflict Resolution
    • Poverty Alleviation
    • Refugees, Migration and Asylum Seekers
    • Science and Technology
    • Sports and Development
    • Sustainable Development
    • Water, Sanitation and Hygiene (WASH)
    • Women and Gender
  • Funds for Companies
    • Accounts and Finance
    • Agriculture, Food and Nutrition
    • Artificial Intelligence
    • Education
    • Energy
    • Environment and Climate Change
    • Healthcare
    • Innovation
    • Manufacturing
    • Media
    • Research Activities
    • Startups and Early-Stage
    • Sustainable Development
    • Technology
    • Travel and Tourism
    • Women
    • Youth
  • Funds for Individuals
    • All Individuals
    • Artists
    • Disabled Persons
    • LGBTQ Persons
    • PhD Holders
    • Researchers
    • Scientists
    • Students
    • Women
    • Writers
    • Youths
  • Funds in Your Country
    • Funds in Australia
    • Funds in Bangladesh
    • Funds in Belgium
    • Funds in Canada
    • Funds in Switzerland
    • Funds in Cameroon
    • Funds in Germany
    • Funds in the United Kingdom
    • Funds in Ghana
    • Funds in India
    • Funds in Kenya
    • Funds in Lebanon
    • Funds in Malawi
    • Funds in Nigeria
    • Funds in the Netherlands
    • Funds in Tanzania
    • Funds in Uganda
    • Funds in the United States
    • Funds within the United States
      • Funds for US Nonprofits
      • Funds for US Individuals
      • Funds for US Businesses
      • Funds for US Institutions
    • Funds in South Africa
    • Funds in Zambia
    • Funds in Zimbabwe
  • Proposal Writing
    • How to write a Proposal
    • Sample Proposals
      • Agriculture
      • Business & Entrepreneurship
      • Children
      • Climate Change & Diversity
      • Community Development
      • Democracy and Good Governance
      • Disability
      • Disaster & Humanitarian Relief
      • Environment
      • Education
      • Healthcare
      • Housing & Shelter
      • Human Rights
      • Information Technology
      • Livelihood Development
      • Narcotics, Drugs & Crime
      • Nutrition & Food Security
      • Poverty Alleviation
      • Sustainable Develoment
      • Refugee & Asylum Seekers
      • Rural Development
      • Water, Sanitation and Hygiene (WASH)
      • Women and Gender
  • News
    • Q&A
  • Premium
    • Premium Log-in
    • Premium Webinars
    • Premium Support
  • Contact
    • Submit Your Grant
    • About us
    • FAQ
    • NGOs.AI
You are here: Home / Grant / Apply for the Natural Language Processing (NLP) Program

Apply for the Natural Language Processing (NLP) Program

Deadline: 23 August 2024

The Lacuna Fund is inviting applications for the Natural Language Processing (NLP) Program to support efforts to develop open and accessible datasets for machine learning applications related to Natural Language Processing (NLP) for low-resource languages and cultures in Africa and Latin America.

The ability to communicate and be understood in one’s own language variety and cultural context is fundamental to digital and societal inclusion. Natural language processing techniques have the potential to enable AI applications that facilitate digital inclusion and improvements in education, finance, healthcare, agriculture, communication, and responses to natural hazards, among others. Many advances in both fundamental and applied NLP have stemmed from openly licensed and publicly available datasets.

However, such datasets are scarce to non-existent for many African and Latin-American languages, excluding these populations from the benefits of NLP. Many current machine learning (ML) models are informed by Anglo-centric and/or translated datasets, lacking culturally relevant nuances and creating biased or unusable models for communities in Africa and Latin America. Where relevant datasets do exist, they are often based on religious or judiciary texts of the past, leading to outdated language and bias. There is a need for openly accessible datasets to facilitate NLP technologies for low-resource languages in Africa and Latin America and support the development of robust and culturally appropriate language datasets that cater to the specific needs of underrepresented communities.

Funding Information

  • The total pool available is approximately $1 million USD. They would like to fund projects in each of the target regions (Africa, Latin America) and anticipate supporting 6-8 smaller projects with budgets up to $100k USD and 2-3 larger, more complex projects with budgets ranging from $100-250k USD.

Need 

  • Lacuna Fund seeks proposals from qualified, multidisciplinary teams to develop open and accessible training and evaluation datasets for machine learning applications for NLP in low-resource languages and underrepresented cultures in Africa and Latin America.
  • Proposals may include, but are not limited to:
    • Collecting and/or annotating new data;
    • Annotating or releasing existing data;
    • Augmenting existing datasets from diverse sources to fill gaps in local ground truth data, decrease bias (such as geographic bias, gender gaps or other types of bias or discrimination), or increase the usability of data and technology related to NLP in low- and middle-income contexts;
  • Linking and harmonizing existing datasets (such as across regions, time, linguistic varieties, as well as domain-specific datasets such as historical, health and education data).
  • The TAP sees a need for training and evaluation datasets that will account for the linguistic diversity and cultural nuances in Africa and Latin America. This includes datasets on regional slang, idiomatic expressions, local linguistic varieties or dialects, and culturally relevant data. Such datasets are crucial for developing more inclusive and effective natural language processing tools that can serve the unique needs of culturally diverse linguistic communities.
  • They seek datasets identified by local experts designed to address locally identified needs. The following are illustrative examples only.
  • Datasets may include, but are not limited to the following:
    • Labeled and unlabeled datasets for low-resource NLP tasks, supporting the development of accurate and effective machine learning models. Downstream tasks from labeled datasets might include, but are not limited to: question answering and conversational AI, sentiment analysis datasets, social bias detection, hate speech detection and counter speech, misinformation and disinformation detection; automatic text summarization or other natural language understanding and generation tasks, or resources to support NLP education in collaboration with communities. Unlabeled datasets include text corpora that can be used to support the training and evaluation of speech models.
    • Speech corpora, including datasets to enable automatic speech recognition (ASR) that allows illiterate or otherwise underprivileged groups of persons to access information and/or services in low-resource languages.
    • Text-generation tasks datasets, particularly other than machine translation.
    • Multimodal and other innovative datasets, such as video or audio captioning, visual question-answering or other image-text interactions.
    • Datasets supporting knowledge-intensive tasks, such as quality assurance (QA) and Retrieval Augmented Generation (RAG).
    • Datasets related to dialectal variation corpora and code-switched text and speech, including capturing linguistic variations (regional slang, idiomatic expressions, culturally relevant data) in dialect-rich low-resource languages and in linguistic communities where code-switching is common.
    • Domain-specific creation or augmentation of text and speech datasets, such as healthcare, place names, agriculture or education, that enable applications with significant social impact. Exploring Generative Data Augmentation frameworks to include domain-specialized vocabulary, semantics, morphology, and syntax.
    • Datasets supporting machine learning for linguistics, for the preservation and revitalization of marginalized cultures and aspects of underrepresented languages that these cultures consider important for their health, dignity, environment, and well-being. These datasets may include phonetic, morphological, and syntactic annotations, and automatized tools to perform these tasks if sought by the involved social group
    • Across all datasets: gender-responsiveness and inclusion of key vulnerable groups, including bias mitigation for those living in humanitarian and conflict settings, as well as those at the intersections of more than one socio-economic group (e.g., disability, gender, age, minorities). Please refer to the ‘Risks, including Ethics and Privacy’ paragraph on the Proposal narrative section of this document and carefully consider ethics around data collection.

Eligibility Criteria

  • Lacuna Fund aims to make its funding accessible to as many organizations as possible in the AI for social good space and cultivate capacity and emerging organizations in the field.
  • To be eligible for funding, organizations must:
    • Be either a non-profit entity, research institution, for-profit social enterprise, or a team of such organizations. Individuals must apply through an institutional sponsor. Partnerships are strongly encouraged as a way to strengthen collaboration and maximize the benefits derived from the use of the datasets, but only the lead applicant will receive funds.
    • Have a mission supporting societal good, broadly defined.
    • Be headquartered in the country or region where data will be collected. The geographic focus of this call is Africa and Latin America. Institutions based in other countries or regions can apply as partners of the lead institution. As stated above, only the lead applicant will receive funds.
    • Have all necessary national or other approvals to conduct the proposed research. The approval process may be conducted in parallel with the grant application, if necessary. Approval costs, if any, are the responsibility of the applicant.
    • Have the technical capacity – or the ability to build this capacity through a partnership described in the proposal – to conduct dataset labeling, creation, aggregation, expansion, and/or maintenance, including the ability to apply best practice and established standards in the specific domain (e.g. natural language processing) to allow high quality AI/ML analytics to be performed by multiple entities.

For more information, visit Lacuna Fund.

Building a Trustworthy Social Media Sphere: Countering Disinformation on Social Media for Young Europeans Programme

CFPs: Support for TV and Online Content Fiction Projects to Boost European Audiovisual Production

Open Call: European Co-Development Support for Audiovisual Production Companies

Green Textile Innovation Challenge to Support Creative Industry in the Textile Sector (Mali)

DYNAMIC Entrepreneur Spark Program for Students (Cambodia)

Films on the Move Programme: Pan-European Distribution Support for Non-National European Films

Enhance Call by PoliRuralPlus to Validate AI Chatbot JackDaw

6th REINFORCING Open Call on Responsible Innovation

Orange Corners South Sudan Programme

Orange Corners Designs Incubation Programme 2026 (South Africa)

Registrations open for Orange Corners Bangladesh Ideation Challenge 6.0

Call for Proposals: Private Sector Engagement on Early Warnings and Resilient Infrastructure in Africa

Zepto Nova Pitch in 10 Programme to Accelerate India’s Startup Ecosystem

ELPS Initiative: Strengthening the Macadamia Nuts Export Value Chain with Private Sector Partners (Rwanda)

BNEF Pioneers Competition: Advancing Climate-Tech Innovations Globally

Become a Mentee through the Mentoring Women in Business Programme

Apply Now to Join the Star Venture Programme

Call for Proposals: European Support for Strengthening Video on Demand Networks

CFAs: Creative Europe Markets and Networking Support for Audiovisual Industry Growth

RFPs: TV and Online Content Animation Projects

CFPs: TV and Online Content Documentary Projects

Call for Proposals: European Slate Development Programme

Apply for BHP Xplor Accelerator Program 2025

Saksham 2.0 – Assistive Technology Accelerator Program (India)

Building a Trustworthy Social Media Sphere: Countering Disinformation on Social Media for Young Europeans Programme

CFPs: Support for TV and Online Content Fiction Projects to Boost European Audiovisual Production

Open Call: European Co-Development Support for Audiovisual Production Companies

Green Textile Innovation Challenge to Support Creative Industry in the Textile Sector (Mali)

DYNAMIC Entrepreneur Spark Program for Students (Cambodia)

Films on the Move Programme: Pan-European Distribution Support for Non-National European Films

Enhance Call by PoliRuralPlus to Validate AI Chatbot JackDaw

6th REINFORCING Open Call on Responsible Innovation

Orange Corners South Sudan Programme

Orange Corners Designs Incubation Programme 2026 (South Africa)

Registrations open for Orange Corners Bangladesh Ideation Challenge 6.0

Call for Proposals: Private Sector Engagement on Early Warnings and Resilient Infrastructure in Africa

Zepto Nova Pitch in 10 Programme to Accelerate India’s Startup Ecosystem

ELPS Initiative: Strengthening the Macadamia Nuts Export Value Chain with Private Sector Partners (Rwanda)

BNEF Pioneers Competition: Advancing Climate-Tech Innovations Globally

Terms of Use
Third-Party Links & Ads
Disclaimers
Copyright Policy
General
Privacy Policy

Contact us
Submit a Grant
Advertise, Guest Posting & Backlinks
Fight Fraud against NGOs
About us

Terms of Use
Third-Party Links & Ads
Disclaimers
Copyright Policy
General
Privacy Policy

Premium Sign in
Premium Sign up
Premium Customer Support
Premium Terms of Service

©FUNDSFORNGOS LLC.   fundsforngos.org, fundsforngos.ai, and fundsforngospremium.com domains and their subdomains are the property of FUNDSFORNGOS, LLC 140 Broadway 46th Floor, New York, NY 10005, United States.   Unless otherwise specified, this website is not affiliated with the abovementioned organizations. The material provided here is solely for informational purposes and without any warranty. Visitors are advised to use it at their discretion. Read the full disclaimer here. Privacy Policy. Cookie Policy.

Manage Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
View preferences
{title} {title} {title}