Evaluation and Test - Nonprofit Technology

6 Innovative QR Code Strategies for Nonprofits

Nonprofit Tech for Good

FEBRUARY 2, 2025

” Test Thoroughly: Test QR codes across devices and lighting conditions to ensure functionality. Evaluate which materials generate the most engagement. Resource: QR Code Chimp 6) Video QR Codes: Highlight specific programs, tell your nonprofit’s story, or showcase donor impact by linking to video content.

Strategy

Strategy Nonprofit Fundraising Volunteer

23andMe files for bankruptcy, CEO steps down

Mashable Tech

MARCH 24, 2025

The DNA testing company said it would use the money from the sale to "resolve all outstanding legal liabilities stemming from the previously disclosed October 2023 cyber incident." The company agreed to pay a $30 million settlement over a massive data breach that affected 6.9 million users in October 2023. "Any

Files

Files Missouri Press Benefit

From Train-Test to Cross-Validation: Advancing Your Model’s Evaluation

Machine Learning Mastery

AUGUST 7, 2024

Many beginners will initially rely on the train-test method to evaluate their models. In this blog, we’ll discuss why it’s important […] The post From Train-Test to Cross-Validation: Advancing Your Model’s Evaluation appeared first on MachineLearningMastery.com.

Evaluation

Evaluation Training Train Test

Webinars

The Everyday Donor: Unlocking Prospecting Segments Through Behavior Analysis

MORE WEBINARS

Tesla’s self-driving capabilities are now a Looney Tunes cartoon joke

Fast Company Tech

MARCH 18, 2025

He created a polystyrene wall with an image of a road printed on it and placed it in the middle of a real street to evaluate the reaction of the sensors in his own Tesla Model Y, which relies only on cameras. For comparison, Rober also tested a Lexus RX equipped with Lidar under the same conditions. But the reality was different.

Camera

Camera Test Environment System

Autism rate rises slightly; RFK Jr. claims he’ll “have answers by September“

Ars Technica

APRIL 15, 2025

The report's authorsresearchers at the CDC and academic institutions across the country suggest that the slight uptick is likely due to improved access to evaluations in underserved groups, including Black, Hispanic, and low-income communities. The rate of autism in a group of 8-year-olds in the US rose from 2.76

Rate

Rate Journal Children Evaluation

The crucial role of software testing in your AMS implementation

Nimble AMS

JULY 17, 2024

As you prepare your organization for new technology, you can alleviate some of these implementation concerns by thoroughly testing your platform. Keep reading to learn how software testing will prepare your association for a smooth implementation. What is software testing?

Test

Test Software Roles Associations

The 5-star rating system is terrible for gig workers. Here’s a simple solution

Fast Company Tech

MARCH 24, 2025

A small gap with huge consequences Existing research has shown that when customers submit evaluations, individual workers from ethnic minority groups are more likely to be negatively evaluated, even if their performance and quality is the same. In contrast, upvote/downvote ratings directly ask if the service met customer standards.

Rate

Rate System Evaluation Review

Easy blood test can predict a woman's heart disease risk decades in advance

TechSpot

SEPTEMBER 2, 2024

A routine blood test can determine the long-term risk of a woman's propensity for heart disease, according to a study published in the New England Journal of Medicine. Traditionally, cardiovascular risk has been evaluated through cholesterol levels, especially low-density lipoprotein (LDL), or "bad" cholesterol as it is sometimes called.

Test

Test Journal Evaluation Studies

5 Things to do When Evaluating ELT/ETL Tools

Towards Data Science

MAY 7, 2024

A list to make evaluating ELT/ETL tools a bit less daunting Photo by Volodymyr Hryshchenko on Unsplash We’ve all been there: you’ve attended (many!) Now you have to decide what sorts of things to test in order to figure out definitively if the tool is the right commitment for you and the team.

Evaluation

Evaluation Tools Integration Reddit

Imagen Editor and EditBench: Advancing and evaluating text-guided image inpainting

Google Research AI blog

JUNE 9, 2023

EditBench The EditBench dataset for text-guided image inpainting evaluation contains 240 images, with 120 generated and 120 natural images. To provide insight into the relative strengths and weaknesses of different models, EditBench prompts are designed to test fine-grained details along three categories: (1) attributes (e.g.,

Evaluation

Evaluation Images Guide Model

Apple Mac Studio M4 Max review: A creative powerhouse

Engadget

MARCH 13, 2025

All M4 Max models start with a decent 36GB of unified memory, though my test unit came with the maximum 128GB in a $3,699 configuration. It falls just below the Mac Studio with M2 Ultra on the multicore Geekbench 6 test. These specs align pretty closely with the MacBook Pro M4 Max but at a lower price, by the way.

Review

Review Test Comparison Model

AI for good: How you can help Candid Labs empower nonprofits

Candid

NOVEMBER 11, 2024

As of May 2024, the MMMU benchmark , which evaluates responses to college-level questions, scored GPT-4o at 60%, compared with an 83% human average. Now, we’re inviting the public to test drive the tool on Candid Labs. There are more prototypes in the works, and you can continue to help us by testing them as they’re released.

Help

Help Nonprofit Benchmark Grant

Accessibility Widgets and Plugins: What Your Nonprofit Needs to Know

Nonprofit Tech for Good

AUGUST 12, 2022

However, since they try to fix accessibility problems automatically without actually testing or changing your specific website, they often cause more problems than they solve. You can check a lot of this (and more) with WAVE , a free accessibility testing tool. WAVE—Web Accessibility Evaluation Tool. In theory, it sounds great!

Plugin

Plugin Widget Accessibility Nonprofit

To Build or Buy a Software Solution: Which is Right for You?

Association Analytics

JULY 20, 2021

While usage is a great data point to evaluate your product’s success, there’s so much more to consider when weighing the options to build an in-house solution or use an off-the-shelf product. Throughout the evaluation process, it’s important to keep your association’s unique goals and success metrics top-of-mind.

Software

Software Build Analytics Associations

Microsoft is testing a Windows 11 desktop watermark for unsupported hardware

The Verge

FEBRUARY 22, 2022

In the latest test builds of Windows 11, a new watermark has appeared on the desktop wallpaper, alongside a similar warning in the landing page of the settings app. The software maker revealed recently that it would test new additions to Windows 11 that might not make the final cut.

Test

Test Evaluation Method System

DARPA begins testing phase for in-orbit space construction

TechSpot

FEBRUARY 14, 2025

DARPA has announced a major shift in the final phase of its NOM4D program, transitioning from laboratory testing to small-scale orbital demonstrations. This move aims to evaluate novel materials and assembly techniques in space, marking a critical step toward the development of large-scale orbital structures. Read Entire Article

Phase

Phase Test Technique Evaluation

Massive Foundation Model for Biomolecular Sciences Now Available via NVIDIA BioNeMo

NVIDIA AI Blog

FEBRUARY 19, 2025

For example, researchers from Stanford and the Arc Institute found that in tests with BRCA1, a gene associated with breast cancer, Evo 2 could predict with 90% accuracy whether previously unrecognized mutations would affect gene function.

Foundation

Foundation Model San Francisco University

Meta will soon roll out Community Notes, a whitewashed version of Elon Musk’s fact-checking system

Fast Company Tech

MARCH 13, 2025

Images: Meta] Once a note is submitted, it’s evaluated by other Community Notes contributors. Meta says it will be monitoring the system, evaluating the latency, coverage, and the downstream effects of viewership and sharing utilizing those metrics to guide future work, refinements, testing, and iterations.

Community

Community System Headline Design

We tested one of Intel’s Tiger Lake processors — and Xe Graphics are the real deal

The Verge

SEPTEMBER 17, 2020

I haven’t gotten to try any of those models yet — Intel loaned me a generic pre-production reference design for these tests. Here’s the Tiger Lake reference design that Intel sent me to test. The pre-production test system also included 32GB of RAM.) MSI built this test system (but it’s Intel-branded).

Test

Test Laptop Benchmark Integration

With Evals, OpenAI hopes to crowdsource AI model testing

TechCrunch

MARCH 14, 2023

Alongside GPT-4 , OpenAI has open sourced a software framework to evaluate the performance of its AI models. It’s a sort of crowdsourcing approach to model testing, OpenAI explains in a blog post. ” OpenAI created Evals to develop and run benchmarks for evaluating models like GPT-4 while inspecting their performance. .

Model

Model Test Benchmark Open Source

In Miami, this 3D-printed seawall will help protect the coastline

Fast Company Tech

APRIL 18, 2025

Developed by our team of architects and marine biologists at Florida International University, the uniquely textured prototype tiles are designed to test a new approach for helping cities such as Miami adapt to rising sea levels while simultaneously restoring ecological balance along their shorelines.

Miami

Miami Help Florida University

16 Best Toiletry Bags, Tested and Reviewed (2025)

Wired

FEBRUARY 10, 2025

Our team evaluated a ton of toiletry bags to find the best storage, organization, and design options for all your essentials.

Test

Test Review Evaluation Design

TikTok’s Tunnel Girl is digging again—much to the internet’s delight

Fast Company Tech

MARCH 17, 2025

#engineering #minecraft #secrettunnel #construction #mining #diy Beautiful Paradise – Aga Alamsyah “They did give me a stop work order and are requiring an immediate evaluation by a professional engineer. If you’re inspired to dig your own tunnel, dont expect the process to be easy.

Virginia

Virginia Project Process Green

5 ways QA will evaluate the impact of new generative AI testing tools

InfoWorld

JANUARY 15, 2024

In a recent article about upgrading continuous testing for generative AI , I asked how code generation tools , copilots, and other generative AI capabilities would impact quality assurance (QA) and continuous testing.

Test

Test Generation Impact Evaluation

Building Resilient Funding Models: Essential Tips for Nonprofit Finance Professionals

sgEngage

NOVEMBER 20, 2024

By actively bringing together different departments and leading discussions around revenue diversification, you can set measurable goals, evaluate the ROI of each funding source, and make informed decisions about where to invest time and resources. How to Measure: Evaluate cost per dollar raised, donor acquisition costs, and conversion rates.

Professional

Professional Fund Model Build

Email Marketing for Nonprofits: The Ultimate How-To Guide

Bloomerang

DECEMBER 12, 2024

Evaluate if these metrics change when you adjust your email frequency. You can also conduct A/B testing with your emails to assess your metrics after making small changes to your strategy. For example, you can test your subject lines by sending half of your audience an email with one subject and half an email with a different subject.

email

email Marketing Guide Nonprofit

D-Wave says it achieved ‘quantum supremacy,’ using a computer to solve a problem that would otherwise take a million years

Fast Company Tech

MARCH 12, 2025

The quantum computing firm announced on Wednesday that, for the first time, it was able to successfully simulate the properties of magnetic materials using its Advantage2 annealing quantum computer, which allows us to invent and evaluate new materials without needing to build them in the lab, D-Wave CEO Dr. Alan Baratz tells Fast Company.

Problem

Problem Demonstration Journal Results

How to Run Facebook Fundraising Ads: 5 Simple Steps

Nonprofit Tech for Good

FEBRUARY 14, 2019

The guide below covers the key steps to running a Facebook fundraising ad campaign from start to finish, including set-up, monitoring and evaluating success after completion. The risk is low, and the potential rewards are great, so be sure to consider testing this channel as part of your next fundraising effort.

Facebook

Facebook Fundraising ROI Metrics

AI Factories, Built Smarter: New Omniverse Blueprint Advances AI Factory Design and Simulation

NVIDIA AI Blog

MARCH 18, 2025

Connected to leading simulation tools such as Cadence Reality Digital Twin Platform and ETAP, the engineering teams can test and optimize power, cooling and networking long before construction starts. Model real-world conditions Predict and test how different AI workloads will impact cooling, power stability and network congestion.

Design

Design Digital Test Library

OpenAIs o3 and o4-mini hallucinate way higher than previous models

Mashable Tech

APRIL 19, 2025

By OpenAI 's own testing, its newest reasoning models, o3 and o4 -mini, hallucinate significantly higher than o1. First reported by TechCrunch , OpenAI's system card detailed the PersonQA evaluation results, designed to test for hallucinations. ” Evaluation benchmarks are tricky. GPT-4o scored 1.5 percent, GPT-4.5

Model

Model Benchmark Evaluation Rate

Using Prompt Evaluation to Combat Bio-Weapon Research

The AI Alignment Forum

FEBRUARY 19, 2025

Published on February 19, 2025 12:39 PM GMT With many thanks to Sasha Frangulov for comments and editing Before publishing their o1-preview model system card on Sep 12, 2024, OpenAI tested the model on various safety benchmarks which they had constructed. In the past week, we've shown that prompt evaluation can be used to prevent jailbreaks.

Evaluation

Evaluation Research Benchmark Model

Marketing Automation: The Future of Fundraising for Your Nonprofit

Nonprofit Tech for Good

MAY 25, 2024

Automation Data and Insights Marketing automation platforms perform tests and collect data that can help you improve your outreach. Use benchmark data from past actions or other nonprofits to evaluate your campaigns, and then work to improve the metrics that matter most. 2) A/B Testing Not sure what subject line to go with?

Marketing

Marketing Fundraising Nonprofit Social Media

Challenging Conversations—Tackling Risk

.orgSource

FEBRUARY 13, 2023

Evaluating where the blind spots lie in your organization is one way to begin approaching these difficult conversations. They are activities that are frequently governed and controlled by policies and procedures, such as: Evaluating where the blind spots lie in your organization is one way to begin conversations about risk.

Conversation

Conversation Challenge Evaluation Associations

OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

TechSpot

APRIL 21, 2025

However, internal testing and third-party evaluations now reveal that o3 and o4-mini, both classified as "reasoning models," are more prone to making things up than earlier reasoning models. Read Entire Article

Model

Model Evaluation Rate Test

The 10 most innovative data science companies of 2025

Fast Company Tech

MARCH 18, 2025

Nominal For building a data platform for mission-critical testing and evaluation Nominals data platform is purpose-built for testing and evaluation within mission-critical sectors such as aerospace, defense, and advanced technology. It also raised $27.5

Data

Data Companies Analysis Analytics

Examination of 18 antivirus programs shows Microsoft Defender has the highest system load

TechSpot

MAY 10, 2023

AV-TEST, an independent organization that evaluates and rates antivirus and security suite software, tested 18 antivirus packages for Windows 10. The examination involved testing the programs against around 12,000 malware samples mixed into 1.5 It also.

Program

Program System Sample Evaluation

TechCrunch+ roundup: Beyond the Turing test, 3 VCs on SVB, usage-based pricing tactics

TechCrunch

MARCH 14, 2023

Although generative AI is exceeding my expectations, the Turing test is mostly intact in my personal experience. To help teams shorten the “time to trust” interval, he asks several questions cybersecurity customers are likely to pose while evaluating vendors, along with action items that can help provide convincing answers.

Test

Test San Francisco Images Evaluation

The best air purifier for 2025

Engadget

MARCH 7, 2025

Weve tested over a dozen air purifiers that range from $150 to $1,200 but the most effective method for getting the green light from our air quality monitors is completely free: opening the windows. Unfortunately, it was the lowest performing unit during two separate burn tests and had repeated connectivity issues.

Test

Test Measure Design Cancer

Claude Sonnet 3.7 (often) knows when it’s in alignment evaluations

The AI Alignment Forum

MARCH 17, 2025

Published on March 17, 2025 7:11 PM GMT Note: this is a research note based on observations from evaluating Claude Sonnet 3.7. Were sharing the results of these work-in-progress investigations as we think they are timely and will be informative for other evaluators and decision-makers. Claude Sonnet 3.7 We find that Sonnet 3.7

Evaluation

Evaluation Awareness Model Test

Scott Resumes Giving Away Millions In Large Chunks

The NonProfit Times

NOVEMBER 18, 2024

In the Fall of 2023, organizations top-rated by their peers advanced to a second round of review by an external Evaluation Panel recruited for relevant experience to the cause and underwent a final round of due diligence. There were 6,353 applications for the Open Call. Secretary of the Treasury Robert E.

New York City

New York City Giving Award New York

50+ Year End Fundraiser Email Subject Lines

CauseVox

DECEMBER 9, 2024

Dont Guess – Test Your Fundraiser Email Subject Lines Testing your fundraiser email subject lines can help you learn what gets the best response from your audience. SubjectLine.com is similar by testing the overall effectiveness of your copy. You could try performing an A/B test, also known as a split test.

email

email Fundraising Spam Donor

Creating a Nonprofit Digital Strategy with ChatGPT: A Practical Guide

Nonprofit Tech for Good

SEPTEMBER 5, 2023

Implement and Test : Develop and implement your ChatGPT-powered initiatives. Be sure to continuously test and iterate to ensure optimal performance and user satisfaction. The Results: 6) Execution and Evaluation: Enhancing Implementation Executing a strategy is as important as formulating it.

Digital

Digital Strategy Create Guide

OpenAI’s o3: AI Benchmark Discrepancy Reveals Gaps in Performance Claims

TechRepublic

APRIL 22, 2025

The FrontierMath benchmark from Epoch AI tests generative models on difficult math problems. Find out how OpenAIs o3 and other AI models performed.

Benchmark

Benchmark Generation Test Model

A Budget Is Not a Spreadsheet: 7 Steps to Strategic FY26 Planning for Public Media

Allegiance Group

APRIL 14, 2025

Identify testing opportunities: With limited budgets, focus on offer and audience segmentation testing rather than creative tests, which can often be more costly. Now that youve reviewed campaign performance, take a step back and evaluate how each marketing channel contributes to your overall goals. Whats changing?

Public

Public Media Channel Mail

6 Innovative QR Code Strategies for Nonprofits

23andMe files for bankruptcy, CEO steps down

Webinars

Trending Sources

From Train-Test to Cross-Validation: Advancing Your Model’s Evaluation

Webinars

Tesla’s self-driving capabilities are now a Looney Tunes cartoon joke

Autism rate rises slightly; RFK Jr. claims he’ll “have answers by September“

The crucial role of software testing in your AMS implementation

The 5-star rating system is terrible for gig workers. Here’s a simple solution

Easy blood test can predict a woman's heart disease risk decades in advance

5 Things to do When Evaluating ELT/ETL Tools

Imagen Editor and EditBench: Advancing and evaluating text-guided image inpainting

Apple Mac Studio M4 Max review: A creative powerhouse

AI for good: How you can help Candid Labs empower nonprofits

Accessibility Widgets and Plugins: What Your Nonprofit Needs to Know

To Build or Buy a Software Solution: Which is Right for You?

Microsoft is testing a Windows 11 desktop watermark for unsupported hardware

DARPA begins testing phase for in-orbit space construction

Massive Foundation Model for Biomolecular Sciences Now Available via NVIDIA BioNeMo

Meta will soon roll out Community Notes, a whitewashed version of Elon Musk’s fact-checking system

We tested one of Intel’s Tiger Lake processors — and Xe Graphics are the real deal

With Evals, OpenAI hopes to crowdsource AI model testing

In Miami, this 3D-printed seawall will help protect the coastline

16 Best Toiletry Bags, Tested and Reviewed (2025)

TikTok’s Tunnel Girl is digging again—much to the internet’s delight

5 ways QA will evaluate the impact of new generative AI testing tools

Building Resilient Funding Models: Essential Tips for Nonprofit Finance Professionals

Email Marketing for Nonprofits: The Ultimate How-To Guide

D-Wave says it achieved ‘quantum supremacy,’ using a computer to solve a problem that would otherwise take a million years

How to Run Facebook Fundraising Ads: 5 Simple Steps

AI Factories, Built Smarter: New Omniverse Blueprint Advances AI Factory Design and Simulation

OpenAIs o3 and o4-mini hallucinate way higher than previous models

Using Prompt Evaluation to Combat Bio-Weapon Research

Marketing Automation: The Future of Fundraising for Your Nonprofit

Challenging Conversations—Tackling Risk

OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

The 10 most innovative data science companies of 2025

Examination of 18 antivirus programs shows Microsoft Defender has the highest system load

TechCrunch+ roundup: Beyond the Turing test, 3 VCs on SVB, usage-based pricing tactics

The best air purifier for 2025

Claude Sonnet 3.7 (often) knows when it’s in alignment evaluations

Scott Resumes Giving Away Millions In Large Chunks

50+ Year End Fundraiser Email Subject Lines

Creating a Nonprofit Digital Strategy with ChatGPT: A Practical Guide

OpenAI’s o3: AI Benchmark Discrepancy Reveals Gaps in Performance Claims

A Budget Is Not a Spreadsheet: 7 Steps to Strategic FY26 Planning for Public Media

Stay Connected