Open Source, Technique and Train - Nonprofit Technology

AI firms follow DeepSeek’s lead, create cheaper models with “distillation”

Ars Technica

MARCH 3, 2025

The technique caught widespread attention after Chinas DeepSeek used it to build powerful and efficient AI models based on open source systems released by competitors Meta and Alibaba. Through distillation, companies take a large language modeldubbed a teacher modelwhich generates the next likely word in a sentence.

Model

Model Create Open Source Technique

The Flan Collection: Advancing open source methods for instruction tuning

Google Research AI blog

FEBRUARY 1, 2023

The ability to reason on new tasks is mostly credited to training models on a wide variety of unique instructions, known as “instruction tuning”, which was introduced by FLAN and extended in T0 , Super-Natural Instructions , MetaICL , and InstructGPT. Counts for each are reported using task definitions from the respective works.

Instruction

Instruction Instructional Open Source Method

What Are Foundation Models?

NVIDIA AI Blog

FEBRUARY 11, 2025

Foundation Models Defined A foundation model is an AI neural network trained on mountains of raw data, generally with unsupervised learning that can be adapted to accomplish a broad range of tasks. Google released BERT as open-source software , spawning a family of follow-ons and setting off a race to build ever larger, more powerful LLMs.

Foundation

Foundation Model Language Train

Webinars

The Everyday Donor: Unlocking Prospecting Segments Through Behavior Analysis

A New Look At Grant Management: Why Use A System When You Have Excel?

MORE WEBINARS

Making Brain Waves: AI Startup Speeds Disease Research With Lab in the Loop

NVIDIA AI Blog

APRIL 22, 2025

BrainStorm is also collaborating with the NVIDIA BioNeMo team to help optimize open-source access to the Geneformer model. View of an organoid using Fluorescence Imaging Plate Reader, or FLIPR a technique used to study the effect of compounds on cells during drug screening.

Brain

Brain Research San Diego San Francisco

Stability AI releases ChatGPT-like language models

TechCrunch

APRIL 19, 2023

Stability AI , the startup behind the generative AI art tool Stable Diffusion , today open-sourced a suite of text-generating AI models intended to go head to head with systems like OpenAI’s GPT-4. But Stability AI claims it created a custom training set that expands the size of the standard Pile by 3x. make up) facts.

Language

Language Model Open Source Technique

German startup Kern AI nabs seed funding for modular NLP development platform

TechCrunch

FEBRUARY 16, 2023

Natural language processing ( NLP ), while hardly a new discipline, has catapulted into the public consciousness these past few months thanks in large part to the generative AI hype train that is ChatGPT. The company also says that its basic open source incarnation has been used by data scientists at companies such as Samsung and DocuSign.

Platform

Platform Fund Develop Open Source

Planning for AGI and beyond

OpenAI

FEBRUARY 24, 2023

2] Generally speaking, we think more usage of AI in the world will lead to good, and want to promote it (by putting models in our API, open-sourcing them, etc.). We will need to develop new alignment techniques as our models become more powerful (and tests to understand when our current techniques are failing).

Open Source

Open Source Model Benefit System

Google Research, 2022 & beyond: Algorithmic advances

Google Research AI blog

FEBRUARY 10, 2023

We proposed a 2-hop spanner technique , called STAR , as an efficient and distributed graph building strategy, and showed how it significantly decreases the number of similarity computations in theory and practice, building much sparser graphs while producing high-quality graph learning or clustering outputs.

Research

Research Google Technique Model

A newcomer to AI data labeling, Encord looks to ride a rising tidal wave

TechCrunch

APRIL 8, 2022

Other times, it can be done with open source tools or sensors. We think that transitioning to an approach where you really think about the training data in the first place will help accelerate the progression of these models.”. But it also points out that new techniques are emerging that can speed things up.

Data

Data Technique Images Open Source

Making ML models differentially private: Best practices and open challenges

Google Research AI blog

MAY 19, 2023

These models achieve remarkable performance partially due to the abundance of available training data. Therefore, protecting the privacy of the training data is critical to practical, applied ML. Making the input data differentially private means that any model that is trained on this data will also have DP guarantees.

Model

Model Practice Challenge Open

Stability AI backs effort to bring machine learning to biomed

TechCrunch

NOVEMBER 4, 2022

Its first projects are: BioLM , which seeks to apply natural language processing (NLP) techniques to the fields of computational biology and chemistry. Each project is led by independent researchers, but Stability AI is providing support in the form of access to its AWS-hosted cluster of over 5,000 Nvidia A100 GPUs to train the AI systems.

Learning

Learning Structure Train Training

International Human Rights Day 2011

Beneblog: Technology Meets Society

DECEMBER 10, 2011

Here’s a sample of our accomplishments: Martus , our secure, open-source information management software for human rights defenders continued to empower many human rights groups worldwide to secure thousands of stories of human rights violations and to use this information strategically to advance their causes.

International

International Guatemala Colombia Statistics

The 10 most innovative data science companies of 2025

Fast Company Tech

MARCH 18, 2025

Its now parlaying its advancement of business AI into serving as an open-source tools hub for the technology community. With an open-source model fostering community-driven innovation, Airbyte has used AI to help a robust community of 20,000 data engineers develop 10,000+ user-built custom data connectors.

Data

Data Companies Analysis Analytics

The most innovative architecture companies for 2025

Fast Company Tech

MARCH 18, 2025

The most innovative firms in the industry expand this notion, solving pressing issues in new ways that build on or scale up existing techniques and technologies. Green Building Councilis an open-source tool that gives designers, manufacturers, and clients a data-backed road map on specification, prioritizing sustainability and health.

Los Angeles

Los Angeles Companies North Carolina Denver

Fast, Low-Cost Inference Offers Key to Profitable AI

NVIDIA AI Blog

JANUARY 23, 2025

NVIDIA Triton Inference Server, one of the companys most popular open-source projects , allows users to package and serve any model regardless of the AI framework it was trained on. Reducing Response Times With Recurrent Drafter (ReDrafter) Open-source research advancements are helping to democratize AI inference.

Profit

Profit Oracle Platform Model

Distributed differential privacy for federated learning

Google Research AI blog

MARCH 2, 2023

This allows the training of models on locally available signals without exposing raw data to servers, increasing user privacy. This allows the training of models on locally available signals without exposing raw data to servers, increasing user privacy.

Learning

Learning Aggregator Train Training

Data-centric ML benchmarking: Announcing DataPerf’s 2023 challenges

Google Research AI blog

MARCH 30, 2023

The key to both is a deeper understanding of ML data — how to engineer training datasets that produce high quality models and test datasets that deliver accurate indicators of how close we are to solving the target problem. ImageNet or LibriSpeech ) or scraped from the web with very limited filtering of content (e.g., LAION or The Pile ).

Benchmark

Benchmark Challenge Data Train

Giving 2.0 ProjectU

Beth's Blog: How Nonprofits Can Use Social Media

JUNE 11, 2013

The idea is simple – by open sourcing her materials, she hopes to inspire more colleges to incorporate courses on philanthropy in their curriculum. at Stanford two years ago. As a trainer who designs and delivers trainings on topics related to these courses, releasing these materials to all to use and adapt is a gift.

Giving

Giving Philanthropy Teach Student

E-Mediat: Day 2 – The Networked NGO in the Arab World

Beth's Blog: How Nonprofits Can Use Social Media

MARCH 1, 2011

I’m the lead for Zoetica where my role is to deliver training, advise on the curriculum and coaching methods, model transparency, and serve as meta network weaver. I have the honor of co-training with some of the best folks doing work in this part of the world. Leadership isn’t open. NGOS that work like fortresses.

NGO

NGO Network Lebanon Tunisia

What Is Retrieval-Augmented Generation?

NVIDIA AI Blog

NOVEMBER 15, 2023

Retrieval-augmented generation is a technique for enhancing the accuracy and reliability of generative AI models with facts fetched from external sources. Building User Trust Retrieval-augmented generation gives models sources they can cite, like footnotes in a research paper, so users can check any claims. That builds trust.

Generation

Generation Knowledge Base Knowledge Model

AirOps is helping companies build AI-enabled applications on top of LLMs

TechCrunch

APRIL 26, 2023

The idea is to help users do things like automating processes, extracting insights from data, generating personalized content and performing natural language processing techniques, according to the company. What’s kind of really interesting is that you can actually use the larger models to train smaller models.

Application

Application Companies Build Help

NTC Summary, and Nonprofit Technology Consulting 2.0

Zen and the Art of Nonprofit Technology

APRIL 8, 2007

April 8, 2007 As I write this, I’m hurtling through small towns and big cities on the train home. They struggle mightily with software, no matter whether it’s free/open source or proprietary, shrink-wrapped or custom-built, on their desktops or web-hosted, which they generally spend extraordinary amounts of time and/or money on.

Consultant

Consultant Summary NTC Technology

Modern Data Engineering

Towards Data Science

NOVEMBER 4, 2023

Platform Specific Tools and Advanced Techniques Photo by Christopher Burns on Unsplash The modern data ecosystem keeps evolving and new data tools emerge now and then. This is where open-source alternatives come into play. It’s not a surprise that many of them are open-source and are Python-based. Image by author.

Data

Data Analytics Open Source Literacy

When it comes to large language models, should you build or buy?

TechCrunch

JANUARY 25, 2023

We saw huge neural networks trained on a massive corpora of data that can accomplish exceedingly impressive tasks, none more famous than OpenAI’s GPT-3 and its newer, hyped offspring, ChatGPT. Of course, companies can still choose other peer open-sourced models. For context, OpenAI’s DaVinci costs $0.02 per thousand tokens.

Language

Language Model Build Open Source

Google Research, 2022 & beyond: ML & computer systems

Google Research AI blog

FEBRUARY 2, 2023

In this post, we provide an overview of the numerous advances made across Google this past year in systems for ML that enable us to support the serving and training of complex models while easing the complexity of implementation for end users. Bottom: Illustration of the CollectiveEinsum technique. See paper for details.)

Research

Research System Google Technique

Nonprofit Presenters: What are your best tips for preparing presentations?

Beth's Blog: How Nonprofits Can Use Social Media

MARCH 10, 2009

Many of my presentations are training, so it is also thinking through the instructional delivery. When I asked her what did you think would be most useful, she urged me to "open source my creative process." I thought it would be a great oppotunity to reflect on process and help others. Andy is a master at storytelling.

Nonprofit

Nonprofit PowerPoint Tips Storytelling

Google Research, 2022 & beyond: Research community engagement

Google Research AI blog

FEBRUARY 28, 2023

We at Google see it as our responsibility to disseminate our work as contributing members of the scientific community and to help train the next generation of researchers. Top Training the next generation of researchers Part of our responsibility in guiding how technology affects society is to help train the next generation of researchers.

Research

Research Community Google Open Source

Spotlight: The Forum One Tech Team

Forum One

DECEMBER 22, 2021

The tech team has several divisions that, working together, contribute the diversity of skills and techniques our clients need. It’s not uncommon to discover a team leader who started as an intern or in a non-technical role, but worked to pursue the training needed to contribute at the highest levels. Figure out how you like to learn.

Team

Team Tech Digital Hosting

A Few Reflections from SXSW Crowdsourcing Panel

Beth's Blog: How Nonprofits Can Use Social Media

MARCH 16, 2010

During the session, it was pointed out that crowd sourcing is a way to people, organizations, and communities work together for mutual good, and without competing. The second topic started percolating at last year's SXSW Social Media for Social Good BBQ and a question - so where are the good examples and techniques?

Reflection

Reflection Social Media Sample Wiki

Responsible AI at Google Research: PAIR

Google Research AI blog

MAY 18, 2023

The growth of generative LLMs has also opened up new techniques to solve important long-standing problems. We've also developed new state-of-the-art explainability methods to identify the role of training data on model behaviors and misbehaviours. In the past, PAIR and Google Cloud developed model cards.

Research

Research Google Method Interaction

Wondercise Studio wants to replace Zoom for connected fitness classes

The Verge

DECEMBER 30, 2021

It uses motion data recorded from wearables to evaluate your technique and form during a workout. The Cycle Studio is a system made up of the company’s new Flex Cycle — a 4-in-1 exercise bike with four training modes and two seat configurations — and the Wondercise Timeless Band, a screenless fitness tracker to measure heart rate.

Classes

Classes Social Media Metrics Technique

Google Research at I/O 2023

Google Research AI blog

MAY 25, 2023

This game showcased a fine-tuning technique called DreamBooth for adapting pre-trained image generation models. Chirp Chirp is Google's family of state-of-the-art Universal Speech Models trained on 12 million hours of speech to enable automatic speech recognition (ASR) for 100+ languages.

Research

Research Google API Language

Beyond automatic differentiation

Google Research AI blog

APRIL 14, 2023

By locally approximating a training loss , derivatives guide an optimizer toward lower values of the loss. Automatic differentiation frameworks such as TensorFlow , PyTorch , and JAX are an essential part of modern machine learning, making it feasible to use gradient-based optimizers to train very complex models.

Application

Application Open Source Learning Train

The BirdCLEF 2023 Challenge: Pushing the frontiers of biodiversity monitoring

Google Research AI blog

MARCH 9, 2023

Researchers can gather thousands of hours of audio with remote recording devices, and then use machine learning (ML) techniques to process the data. The best entries can train reliable classifiers with limited training data. The 2023 BirdCLEF ML competition This year we partnered with The Cornell Lab of Ornithology's K.

Challenge

Challenge Kenya Conservancy Model

Top 4 Key Emerging Trends In Learning Management System

Gyrus

MARCH 17, 2024

Top 4 Key Emerging Trends In Learning Management System GyrusAim LMS GyrusAim LMS - Like every other industry, the educational sector is constantly evolving with new trends and techniques. For the last two years, due to the global crisis, the demand for online training software has been deliberately rising.

Learning Management

Learning Management Trend System Learning

Top 4 Key Emerging Trends In Learning Management System

Gyrus

MARCH 17, 2024

Top 4 Key Emerging Trends In Learning Management System GyrusAim LMS GyrusAim LMS - Like every other industry, the educational sector is constantly evolving with new trends and techniques. For the last two years, due to the global crisis, the demand for online training software has been deliberately rising.

Learning Management

Learning Management Trend System Learning

Top 4 Key Emerging Trends In Learning Management System

Gyrus

MARCH 17, 2024

Top 4 Key Emerging Trends In Learning Management System Gyrus Systems Gyrus Systems - Best Online Learning Management Systems Like every other industry, the educational sector is constantly evolving with new trends and techniques. Next Level Learning Analytics Analytics is a useful tool for corporate training.

Learning Management

Learning Management Trend System Learning

Cambodia Bloggers Summit: Help Young Cambodian Bloggers Join the Global Conversation

Beth's Blog: How Nonprofits Can Use Social Media

JULY 30, 2007

The idea is to bring together students, professional Bloggers, writers, NGO workers, media, and tech gurus from within and outside Cambodia to share and learn more from each other on about how the ICT (including Open Source Software and Web2.0 A conference workshop on blogging techniques and video blogging techniques.

Blogger

Blogger Cambodia Conversation Global

RT-1: Robotics Transformer for Real-World Control at Scale

Google Research AI blog

DECEMBER 13, 2022

This model is trained on a large-scale, real-world robotics dataset of 130k episodes that cover 700+ tasks, collected using a fleet of 13 robots from Everyday Robots (EDR) over 17 months. We demonstrate that RT-1 can exhibit significantly improved zero-shot generalization to new tasks, environments and objects compared to prior techniques.

Instruction

Instruction Instructional Train Training

DeepMind creates ‘transformative’ map of human proteins drawn by artificial intelligence

The Verge

JULY 22, 2021

With these techniques, AI systems are trained on datasets of known protein structures and use this information to create their own predictions. The company also released the underlying code for AlphaFold last week as open-source, allowing others to build on its work in the future. Image: DeepMind.

Map

Map Create Structure Research

Leveraging transfer learning for large scale differentially private image classification

Google Research AI blog

MARCH 28, 2023

However, it has been shown that without any protection it is plausible for bad actors to attack a variety of models, across modalities, to reveal information from individual training examples. Differential privacy (DP) provides formal protection against an attacker who aims to extract information about the training data.

Images

Images Learning Train Training

Google at ACL 2023

Google Research AI blog

JULY 10, 2023

Board and Organizing Committee Area chairs include: Dan Garrette Workshop chairs include: Annie Louis Publication chairs include: Lei Shu Program Committee includes: Vinodkumar Prabhakaran , Najoung Kim , Markus Freitag Spotlight papers NusaCrowd: Open Source Initiative for Indonesian NLP Resources Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, (..)

Google

Google Language Benchmark Model

MATS Applications + Research Directions I'm Currently Excited About

The AI Alignment Forum

FEBRUARY 6, 2025

8B distill and see how well the LLaMA Scope SAEs transfer, you will likely get better results if you finetune the SAEs on the finetuned model activations, as they were trained on base LLaMA 3.1 You can use the r1 LLaMA 3.1 8B Note: In all of these ideas you likely want some kind of dataset of problems for the model to reason about.

Application

Application Research Model Train

NpTech Summary: Red T-Shirt Day - Supporting Monks, YouTube Nonprofit Channel, and Web2forDev Conference Reports

Beth's Blog: How Nonprofits Can Use Social Media

SEPTEMBER 28, 2007

NTEN's September Newsletter has tons of excellent content, in particular check out these articles about new fundraising techniques using tools like mobile phones and Facebook. It's an open source free web meeting service. Fundraising 2.0 It isn't dumb dumb. You can read more about the campaign on their blog.

Nptech

Nptech Summary Burma Channel

AI firms follow DeepSeek’s lead, create cheaper models with “distillation”

The Flan Collection: Advancing open source methods for instruction tuning

Webinars

Trending Sources

What Are Foundation Models?

Webinars

Making Brain Waves: AI Startup Speeds Disease Research With Lab in the Loop

Stability AI releases ChatGPT-like language models

German startup Kern AI nabs seed funding for modular NLP development platform

Planning for AGI and beyond

Google Research, 2022 & beyond: Algorithmic advances

A newcomer to AI data labeling, Encord looks to ride a rising tidal wave

Making ML models differentially private: Best practices and open challenges

Stability AI backs effort to bring machine learning to biomed

International Human Rights Day 2011

The 10 most innovative data science companies of 2025

The most innovative architecture companies for 2025

Fast, Low-Cost Inference Offers Key to Profitable AI

Distributed differential privacy for federated learning

Data-centric ML benchmarking: Announcing DataPerf’s 2023 challenges

Giving 2.0 ProjectU

E-Mediat: Day 2 – The Networked NGO in the Arab World

What Is Retrieval-Augmented Generation?

AirOps is helping companies build AI-enabled applications on top of LLMs

NTC Summary, and Nonprofit Technology Consulting 2.0

Modern Data Engineering

When it comes to large language models, should you build or buy?

Google Research, 2022 & beyond: ML & computer systems

Nonprofit Presenters: What are your best tips for preparing presentations?

Google Research, 2022 & beyond: Research community engagement

Spotlight: The Forum One Tech Team

A Few Reflections from SXSW Crowdsourcing Panel

Responsible AI at Google Research: PAIR

Wondercise Studio wants to replace Zoom for connected fitness classes

Google Research at I/O 2023

Beyond automatic differentiation

The BirdCLEF 2023 Challenge: Pushing the frontiers of biodiversity monitoring

Top 4 Key Emerging Trends In Learning Management System

Top 4 Key Emerging Trends In Learning Management System

Top 4 Key Emerging Trends In Learning Management System

Cambodia Bloggers Summit: Help Young Cambodian Bloggers Join the Global Conversation

RT-1: Robotics Transformer for Real-World Control at Scale

DeepMind creates ‘transformative’ map of human proteins drawn by artificial intelligence

Leveraging transfer learning for large scale differentially private image classification

Google at ACL 2023

MATS Applications + Research Directions I'm Currently Excited About

NpTech Summary: Red T-Shirt Day - Supporting Monks, YouTube Nonprofit Channel, and Web2forDev Conference Reports

Stay Connected