Alternative, Evaluation and Language

Evaluating speech synthesis in many languages with SQuId

Google Research AI blog

JUNE 7, 2023

Posted by Thibault Sellam, Research Scientist, Google Previously, we presented the 1,000 languages initiative and the Universal Speech Model with the goal of making speech and language technologies available to billions of users around the world. Such evaluation is a major bottleneck in the development of multilingual speech systems.

Evaluation

Evaluation Language Local Training

Visual language maps for robot navigation

Google Research AI blog

MARCH 23, 2023

Building robots that are proficient at navigation requires an interconnected understanding of (a) vision and natural language (to associate landmarks or follow instructions), and (b) spatial reasoning (to connect a map representing an environment to the true spatial distribution of objects).

Map

Map Language Environment Instruction

The Ultimate Guide to Accounting Software for Nonprofits

Nonprofit Tech for Good

FEBRUARY 6, 2022

For international organizations, you may face additional complexity such as handling multiple currencies and multiple languages. To find the right product for your needs, the best place to begin is with requirements to help you evaluate alternatives. Support multiple languages. High-Level Requirements. Financial Tool Kit.

Software

Software Guide Nonprofit Award

Webinars

The Everyday Donor: Unlocking Prospecting Segments Through Behavior Analysis

MORE WEBINARS

Accelerating Text Generation with Confident Adaptive Language Modeling (CALM)

Google Research AI blog

DECEMBER 16, 2022

Posted by Tal Schuster, Research Scientist, Google Research Language models (LMs) are the driving force behind many recent breakthroughs in natural language processing. Models like T5 , LaMDA , GPT-3 , and PaLM have demonstrated impressive performance on various language tasks. The encoder reads the input text (e.g.,

Language

Language Model Generation Local

ReAct: Synergizing Reasoning and Acting in Language Models

Google Research AI blog

NOVEMBER 8, 2022

Posted by Shunyu Yao, Student Researcher, and Yuan Cao, Research Scientist, Google Research, Brain Team Recent advances have expanded the applicability of language models (LM) to downstream tasks. On the other hand, recent work uses pre-trained language models for planning and acting in various interactive environments (e.g.,

Language

Language Model Sample Wikipedia

The most innovative companies in artificial intelligence for 2025

Fast Company Tech

MARCH 18, 2025

Google DeepMind broke through with a family of natively multi-modal models called Gemini that understand imagery and audio as well as they do language. Mistral released impressive new small language models that can run on laptops and even phones with its Ministral 3B and Ministral 8B, as did Microsoft with its Phi-3 and Phi-4 models.

Companies

Companies Model Train Training

74 Free or Low-Cost Tools and Resources for Nonprofits

Nonprofit Tech for Good

JUNE 29, 2021

The analytics tools will also evaluate your posts to deduce the best possible times to share your content. Dulingo provides access to free online language learning tools. For nonprofit social media managers that work internationally, Dulingo’s design and gamification make it fun to learn the basics of a new language.

Tools

Tools Resource Free Nonprofit

12 Ways to Use ChatGPT and Other AI Tools for Fundraising

Nonprofit Tech for Good

APRIL 2, 2023

6) Sharing Impact With just a few details about your organization, ChatGPT can write boilerplate language about your impact, mission, and programs. This technology is particularly valuable for mission work, such as monitoring changes in wildlife populations or evaluating the effectiveness of conservation efforts.

Tools

Tools Fundraising Donor Analytics

Writer deploys home-cooked large language models to power up enterprise copy

TechCrunch

FEBRUARY 13, 2023

Writer is such a one, and it just announced a new trio of large language models to power its enterprise copy assistant. More than just catching typos and recommending the preferred word, Writer’s new models can evaluate style and write content themselves, even doing a bit of fact-checking when they’re done.

Language

Language Model Generation Reddit

The 10 most innovative data science companies of 2025

Fast Company Tech

MARCH 18, 2025

By combining the worlds largest manufacturing data foundation with proprietary algorithms, the company claims to deliver real-time lifecycle assessments 100 times faster than the best available alternatives. Air Force, Varda Space Industries, and others, continue to transform testing and evaluation.

Data

Data Companies Analysis Analytics

94 Free or Low-Cost Tools and Resources for Nonprofits

Nonprofit Tech for Good

SEPTEMBER 6, 2022

The analytics tool evaluates the effectiveness of your posts and provides the best times to share your content. Rote is a web-based grant writing tool that saves your past language in one place – organized, filterable, and at your fingertips when you need it to write a first draft of a new proposal. Buffer :: buffer.com.

Tools

Tools Resource Free Nonprofit

50+ Year End Fundraiser Email Subject Lines

CauseVox

DECEMBER 9, 2024

Alternatively, 69 percent of email recipients flag an email as spam based on the subject line. Since it sounds a bit disingenuous, its best to remove or replace them with more accessible language. To knock your subject line out of the park, Mailmeteor will also provide a list of alternative subject lines based on the one you entered.

email

email Fundraising Spam Donor

Resolving code review comments with ML

Google Research AI blog

MAY 23, 2023

As part of this process, the reviewer inspects the proposed code and asks the author for code changes through comments written in natural language. As part of this, we explored different user experience (UX) alternatives through a series of user studies. We then refined the feature based on insights from an internal beta (i.e.,

Comment

Comment Review Model Authoring

Reimagining equity with AI: What philanthropy can do

Candid

JANUARY 21, 2025

In the nonprofit space, large language models like ChatGPT can support overworked staff that operate with limited resources. Nonprofits may lack the technical expertise and staff capacity to evaluate or adopt AI effectively. This reframing can reduce skepticism and encourage broader adoption. Build AI literacy. Foster collaboration.

Philanthropy

Philanthropy Literacy Adopt Grant

LMS Security and Compliance: Steps for Protection and Adherence

Gyrus

JULY 25, 2024

Disseminates automated notifications for policy updates and acknowledgment in employees’ preferred languages, ensuring everyone is up-to-date. Generates reports in various languages to cater to diverse workforces and ensure clear communication across all levels of the organization.

Training

Training Train Measure Data

How to Spot Misleading Charts, a Checklist

Tableau

NOVEMBER 14, 2023

Alberto Cairo, data visualization expert and author of How Charts Lie Whether you are reading a social post, news article or business report, it’s important to know and evaluate the source of the data and charts that you view. When viewing summary numbers, evaluate if the summary number is appropriate.

Chart

Chart Badge Comparison Data

Pre-training generalist agents using offline reinforcement learning

Google Research AI blog

FEBRUARY 23, 2023

Pre-training on diverse datasets has proven to enable data-efficient fine-tuning for individual downstream tasks in natural language processing (NLP) and vision problems. However, running RL algorithms in the real world requires expensive active data collection. only data from highly suboptimal policies).

Offline

Offline Train Training Learning

How to Effectively Communicate With Donors When Fundraising Online

CauseVox

JUNE 23, 2022

We evaluated the power of “why” questions for your donors in a recent webinar. Desired Action vs. Alternative Action : Your value proposition applies at every step along the donor mountain, not just with donations. give to you”) and an alternative action (e.g. Check it out ! With each step, there will be a desired action (e.g.

Communication

Communication Donor Effective Fundraising

Drivetrain is the “Google Maps for business growth”

TechCrunch

OCTOBER 18, 2022

Businesses usually plot their growth strategies on spreadsheets, but Drivetrain wants to provide a faster alternative for financial planning and decision-making. During his six years at the firm, Goel evaluated hundreds of SaaS companies and served on many of their boards.

Map

Map Business Google Metrics

Robots That Write Their Own Code

Google Research AI blog

NOVEMBER 2, 2022

It turns out that the latest generation of language models, such as PaLM , are capable of complex reasoning and have also been trained on millions of lines of code. To explore this possibility, we developed Code as Policies (CaP), a robot-centric formulation of language model-generated programs executed on physical systems.

Instruction

Instruction Instructional Language API

What Do Website Users Want From Nonprofits? What the Data Says

Greater Giving

DECEMBER 19, 2024

Kanopis nonprofit website maintenance guide recommends taking a continuous improvement approachthis involves regularly evaluating your websites analytics and user feedback and implementing the insights you learn. Review your website regularly to ensure a positive visitor experience from the very beginning of the user journey.

Websites

Websites Data Nonprofit Mobile

Measure What You Value: Designing a Values-based Performance Appraisal System

Blue Avocado

JULY 16, 2024

Improve how your nonprofit evaluates, recognizes, and motivates its employees. If you would like to significantly improve how your nonprofit evaluates, recognizes, and motivates its employees, there are a few strategies that you might implement to help guarantee success. So How Do We Achieve This Gold Standard of Evaluation?

Measure

Measure System Design Demonstration

Technical interview platform Byteboard spins out of Google’s Area 120, takes on new funding

TechCrunch

OCTOBER 5, 2021

Byteboard , a service designed to replace the pre-onsite technical interview part of a company’s hiring process with a web-based alternative, will be spinning out of Google, TechCrunch learned and Google confirmed. And this evaluation is handled anonymously, with the aim of taking the bias out of the process.

Interview

Interview Fund Platform Google

How To Think Like An Instructional Designer for Your Nonprofit Trainings

Beth's Blog: How Nonprofits Can Use Social Media

JANUARY 27, 2014

” ADDIE is an instructional design method that stands for Analysis, Design, Development, Implementation, and Evaluation. Sometimes you don’t have the ability to do a survey before, especially if it is an online webinar or a conference session. There are alternative ways to do research. This is evaluation.

Instructional Design

Instructional Design Instruction Instructional Train

The Nubank EC-1

TechCrunch

JUNE 14, 2021

The country’s financial system is volatile and often leaves its citizens with few or no alternatives. Yet, it’s a startup with a CEO and co-founder who isn’t Brazilian, didn’t speak the local language of Portuguese, hadn’t started a company before, and didn’t really know a lot about banking to begin with.

Brazil

Brazil Colombia Mexico Miami

Google at ICLR 2023

Google Research AI blog

APRIL 30, 2023

Morcos , Dhruv Batra Offline Q-Learning on Diverse Multi-task Data Both Scales and Generalizes (see blog post ) Aviral Kumar , Rishabh Agarwal , Xingyang Geng , George Tucker , Sergey Levine ReAct: Synergizing Reasoning and Acting in Language Models (see blog post ) Shunyu Yao *, Jeffrey Zhao , Dian Yu , Nan Du , Izhak Shafran , Karthik R.

Google

Google Language Model Jing

Deciphering Clinical Abbreviations with Privacy Protecting ML

Google Research AI blog

JANUARY 24, 2023

However, clinical notes are hard to understand because of the specialized language that clinicians use, which contains unfamiliar shorthand and abbreviations. Coming up with this translation is tough for laypeople and computers because some abbreviations are uncommon in everyday language (e.g., “lbp”

Literacy

Literacy Technique Model Evaluation

Revising Stages-Oversight Reveals Greater Situational Awareness in LLMs

The AI Alignment Forum

MARCH 12, 2025

Published on March 12, 2025 5:56 PM GMT Summary The Stages-Oversight benchmark from the Situational Awareness Dataset tests whether large language models (LLMs) can distinguish between evaluation prompts (such as benchmark questions) and deployment prompts (real-world user inputs).

Awareness

Awareness Evaluation Sample Benchmark

AVFormer: Injecting vision into frozen speech models for zero-shot AV-ASR

Google Research AI blog

JUNE 2, 2023

AVFormer injects visual embeddings into a frozen ASR model (similar to how Flamingo injects visual information into large language models for vision-text tasks) using lightweight trainable adaptors that can be trained on a small amount of weakly labeled video data with minimum additional training time and parameters.

Model

Model Avatar Audio Phase

Sectorness: What does "nonprofit" do for us?

ASU Lodestar Center

AUGUST 9, 2011

Better maybe, for the different pieces to keep to themselves, with their own methods, language, and professional development programs? The alternative is fragmentation and balkanization, which is no alternative at all. Is that really a reason to study , talk about , and define a professional identity of a "nonprofit sector?"

Nonprofit

Nonprofit Museum Professional Advocacy

Deci lands $25M for tech that makes AI models more efficient

TechCrunch

JULY 13, 2022

NAS, which is difficult to evaluate , can be expensive and time-consuming.) Due to the growth in Deci’s business and the product expansion opportunities into additional domains such as natural language processing, among others, our existing investors decided to double down to support that growth,” Geifman said. ” .

Model

Model Tech Images Poll

Unifying image-caption and image-classification datasets with prefix conditioning

Google Research AI blog

JUNE 27, 2023

Posted by Kuniaki Saito, Student Researcher, Cloud AI Team, and Kihyuk Sohn, Research Scientist, Perception Team Pre-training visual language (VL) models on web-scale image-caption datasets has recently emerged as a powerful alternative to traditional pre-training on image classification data. classification vs. caption).

Images

Images Language Training Train

Anthropic launches Claude, a chatbot to rival OpenAI’s ChatGPT

TechCrunch

MARCH 14, 2023

. “We use Claude to evaluate particular parts of a contract, and to suggest new, alternative language that’s more friendly to our customers,” Robin CEO Richard Robinson said in an emailed statement. “We’ve Modern chatbots are notoriously prone to toxic, biased and otherwise offensive language.

Language

Language Instruction Instructional Model

Going Virtual – An Alternative Volunteer Event Guide

Connection Cafe

MARCH 26, 2020

In light of that trend and all the recent travel upheaval due to COVID – 19, with increasing restrictions being placed on companies and their global workforce, we wanted to address creative alternatives to traditional volunteer events. . There’s an app for that!

Volunteer

Volunteer Virtual Alternative Disaster

Google Research, 2022 & beyond: Algorithmic advances

Google Research AI blog

FEBRUARY 10, 2023

In “ Robust Routing Using Electrical Flows ”, we presented a recent paper that proposed a Google Maps solution to efficiently compute alternate paths in road networks that are resistant to failures (e.g., The clients evaluate these suggestions and return measurements. closures, incidents).

Research

Research Google Technique Model

Thoughts on the Future of Open Source and Nonprofits

NTEN

OCTOBER 6, 2010

In this resource-strapped environment, it's arguable that not considering open source options seriously when evaluating your software toolbox is a disservice to your organization, board, constituents, and clients. . The new wave includes CRM (SugarCRM, CiviCRM) and desktop applications like Open Office and GIMP (a Photoshop alternative). .

Open Source

Open Source Open Nonprofit Application

Putting the AI in Education: Stepping Toward Generative Artificial Intelligences

sgEngage

OCTOBER 9, 2023

Imagine empowering a teacher with generative AI to improve question-building workflows for online assessments and open-book evaluations. Imagine being able to ask your AI to review all your saved topics and suggest alternatives and improvements based on related assessments and grades.

Education

Education Generation Blackbaud Student

Data Dirtiness Score

Towards Data Science

MARCH 2, 2024

The evaluation of each phase typically relies on comparing a dirty dataset against a clean (ground truth) version, using classification metrics like recall, precision, and F1-score for error detection (see for example Can Foundation Models Wrangle Your Data? or Large Language Models as Data Preprocessors ). 102 instead of 12).

Data

Data Student Issue Language

The best foldable phones for 2025

Engadget

FEBRUARY 21, 2025

A note on durability Best foldable phones for 2025 How we test foldable phones When evaluating new foldable phones, we consider the same general criteria as we do when were judging the best smartphones. Table of contents Best foldable phones for 2025 How we test foldable phones Are foldable phones worth it?

Phone

Phone North America Instruction Instructional

AdaTape: Foundation model with adaptive computation and dynamic read-and-write

Google Research AI blog

AUGUST 8, 2023

AdaTape provides helpful inductive bias We evaluate AdaTape on parity, a very challenging task for the standard Transformer, to study the effect of inductive biases in AdaTape. Parity is the simplest non-counter-free or periodic regular language , but perhaps surprisingly, the task is unsolvable by the standard Transformer.

Model

Model Foundation Evaluation Sample

NTC Summary, and Nonprofit Technology Consulting 2.0

Zen and the Art of Nonprofit Technology

APRIL 8, 2007

From the stories I’ve heard this week, nonprofits of the size that I’m most familiar with (small to medium-sized) still don’t have in-house technology expertise to make evaluations about what directions to go in. The previous sentence is a clumsy attempt to use langage from both parties in the conversation.

Consultant

Consultant Summary NTC Technology

Research directions Open Phil wants to fund in technical AI safety

The AI Alignment Forum

FEBRUARY 7, 2025

We think this adversarial style of evaluation and iteration is necessary to ensure an AI system has a low probability of catastrophic failure. Wed like to support more such evaluations, especially on scalable oversight protocols like AI debate. and Which rules are LLM agents happy to break, and which are they more committed to? .

Research

Research Fund Open Technique

We’re 39 percent similar; how can we be exponentially better?

Candid

NOVEMBER 17, 2021

Data Handling, Overview, Measurement, Evaluation and Reporting (4 percent). Alternative Supports (<1 percent). Miscellaneous (3 percent). Corporate Delegation and Oversight, Organizational Structure (5 percent). Project Demographics/Orientation/Status (2 percent). How Did You Hear of Us (<1 percent).

Grant

Grant Application Contact Analysis

How might we safely pass the buck to AI?

The AI Alignment Forum

FEBRUARY 19, 2025

Alternatively, the developer can continue to rely on human oversight to assure safety. The developer also runs targeted evaluations of M_1 , for example, removing AI safety research from 2024 from its training data and asking it to re-discover 2024 AI safety research results. One method is to perform a holistic control evaluation.

Evaluation

Evaluation Research Measure Develop

Evaluating speech synthesis in many languages with SQuId

Visual language maps for robot navigation

Webinars

Trending Sources

The Ultimate Guide to Accounting Software for Nonprofits

Webinars

Accelerating Text Generation with Confident Adaptive Language Modeling (CALM)

ReAct: Synergizing Reasoning and Acting in Language Models

The most innovative companies in artificial intelligence for 2025

74 Free or Low-Cost Tools and Resources for Nonprofits

12 Ways to Use ChatGPT and Other AI Tools for Fundraising

Writer deploys home-cooked large language models to power up enterprise copy

The 10 most innovative data science companies of 2025

94 Free or Low-Cost Tools and Resources for Nonprofits

50+ Year End Fundraiser Email Subject Lines

Resolving code review comments with ML

Reimagining equity with AI: What philanthropy can do

LMS Security and Compliance: Steps for Protection and Adherence

How to Spot Misleading Charts, a Checklist

Pre-training generalist agents using offline reinforcement learning

How to Effectively Communicate With Donors When Fundraising Online

Drivetrain is the “Google Maps for business growth”

Robots That Write Their Own Code

What Do Website Users Want From Nonprofits? What the Data Says

Measure What You Value: Designing a Values-based Performance Appraisal System

Technical interview platform Byteboard spins out of Google’s Area 120, takes on new funding

How To Think Like An Instructional Designer for Your Nonprofit Trainings

The Nubank EC-1

Google at ICLR 2023

Deciphering Clinical Abbreviations with Privacy Protecting ML

Revising Stages-Oversight Reveals Greater Situational Awareness in LLMs

AVFormer: Injecting vision into frozen speech models for zero-shot AV-ASR

Sectorness: What does "nonprofit" do for us?

Deci lands $25M for tech that makes AI models more efficient

Unifying image-caption and image-classification datasets with prefix conditioning

Anthropic launches Claude, a chatbot to rival OpenAI’s ChatGPT

Going Virtual – An Alternative Volunteer Event Guide

Google Research, 2022 & beyond: Algorithmic advances

Thoughts on the Future of Open Source and Nonprofits

Putting the AI in Education: Stepping Toward Generative Artificial Intelligences

Data Dirtiness Score

The best foldable phones for 2025

AdaTape: Foundation model with adaptive computation and dynamic read-and-write

NTC Summary, and Nonprofit Technology Consulting 2.0

Research directions Open Phil wants to fund in technical AI safety

We’re 39 percent similar; how can we be exponentially better?

How might we safely pass the buck to AI?

Stay Connected