Language, Model and Technique - Nonprofit Technology

New AI text diffusion models break speed barriers by pulling words from noise

Ars Technica

FEBRUARY 27, 2025

On Thursday, Inception Labs released Mercury Coder , a new AI language model that uses diffusion techniques to generate text faster than conventional models. Traditional large language models build text from left to right, one token at a time. They use a technique called " autoregression."

Model

Model Technique Language Images

AI firms follow DeepSeek’s lead, create cheaper models with “distillation”

Ars Technica

MARCH 3, 2025

Leading artificial intelligence firms including OpenAI, Microsoft, and Meta are turning to a process called distillation in the global race to create AI models that are cheaper for consumers and businesses to adopt. Read full article Comments

Model

Model Create Open Source Technique

DeepSeek-GRM: Introducing an Enhanced AI Reasoning Technique

TechRepublic

APRIL 7, 2025

Researchers from DeepSeek and Tsinghua University say combining two techniques improves the answers the large language model creates with computer reasoning techniques.

Technique

Technique Language University Model

Webinars

The Everyday Donor: Unlocking Prospecting Segments Through Behavior Analysis

MORE WEBINARS

Larger language models do in-context learning differently

Google Research AI blog

MAY 15, 2023

In general, models’ success at in-context learning is enabled by: Their use of semantic prior knowledge from pre-training to predict labels while following the format of in-context examples (e.g., Flipped-label ICL uses flipped labels, forcing the model to override semantic priors in order to follow the in-context examples.

Language

Language Model Learning Difference

Google Research, 2022 & Beyond: Language, Vision and Generative Models

Google Research AI blog

JANUARY 18, 2023

Transform modalities, or translate the world’s information into any language. I will begin with a discussion of language, computer vision, multi-modal models, and generative machine learning models. We want to solve complex mathematical or scientific problems. Diagnose complex diseases, or understand the physical world.

Language

Language Model Generation Research

What Are Foundation Models?

NVIDIA AI Blog

FEBRUARY 11, 2025

Like the prolific jazz trumpeter and composer, researchers have been generating AI models at a feverish pace, exploring new architectures and use cases. In a 2021 paper, researchers reported that foundation models are finding a wide array of uses. Earlier neural networks were narrowly tuned for specific tasks. See chart below.)

Foundation

Foundation Model Language Training

Stability AI releases ChatGPT-like language models

TechCrunch

APRIL 19, 2023

Stability AI , the startup behind the generative AI art tool Stable Diffusion , today open-sourced a suite of text-generating AI models intended to go head to head with systems like OpenAI’s GPT-4. make up) facts. “This is expected to be improved with scale, better data, community feedback and optimization.”

Language

Language Model Open Source Technique

How Hebbia is building AI for in-depth research

Fast Company Tech

MARCH 31, 2025

A New York-based AI startup called Hebbia says it’s developed techniques that let AI answer questions about massive amounts of data without merely regurgitating what it’s read or, worse, making up information. Hebbia, says Sivulka, has approached the problem with a technique the company calls iterative source decomposition.

Research

Research Build Technique Feeds

How people secured their secrets before encyption

Fast Company Tech

APRIL 14, 2025

” To understand how the documents functioned, she built more than 100 models of objects in the collection. “When Jana showed me these models, suddenly all this kind of material fell into place,” he says. “When Jana showed me these models, suddenly all this kind of material fell into place,” he says.

People

People Library Technique History

Sam Altman lays out roadmap for OpenAI’s long-awaited GPT-5 model

Ars Technica

FEBRUARY 13, 2025

On Wednesday, OpenAI CEO Sam Altman announced a roadmap for how the company plans to release GPT-5, the long-awaited followup to 2023's GPT-4 AI language model that made huge waves in both tech and policy circles around the world. We will no longer ship o3 as a standalone model." Read full article Comments

Model

Model Language API Technique

RAG Hallucination Detection Techniques

Machine Learning Mastery

JANUARY 9, 2025

Large language models (LLMs) are useful for many applications, including question answering, translation, summarization, and much more, with recent advancements in the area having increased their potential.

Technique

Technique Language Application Model

Characterizing Emergent Phenomena in Large Language Models

Google Research AI blog

NOVEMBER 10, 2022

Posted by Jason Wei and Yi Tay, Research Scientists, Google Research, Brain Team The field of natural language processing (NLP) has been revolutionized by language models trained on large amounts of text data. Overall, we present dozens of examples of emergent abilities that result from scaling up language models.

Language

Language Model Training Train

The next big AI shift in media? Turning news into a two-way conversation

Fast Company Tech

APRIL 11, 2025

A large language model could, in theory, understand the kinds of stories I care about and modify what Im readingmaybe by adding an angle relevant to my region. The massive data sets in today’s large language models are probably overkill, since they bring noise or generic knowledge when specificity is whats needed.

News

News Conversation Media Language

7 Next-Generation Prompt Engineering Techniques

Machine Learning Mastery

JANUARY 6, 2025

With large language model (LLM) products such as ChatGPT and Gemini taking over the world, we need to adjust our skills to follow the trend. One skill we need in the modern era is prompt engineering. Prompt engineering is the strategy of designing effective prompts that optimize the performance and output of LLMs.

Technique

Technique Generation Skills Structure

Using R for Predictive Modeling in Finance

Machine Learning Mastery

AUGUST 30, 2024

Predictive modeling in finance uses historical data to forecast future trends and outcomes. R, a powerful statistical programming language, provides a robust set of tools and libraries for financial analysis and modeling.

Model

Model Statistics Technique Library

When it comes to large language models, should you build or buy?

TechCrunch

JANUARY 25, 2023

Tanmay Chopra Contributor Share on Twitter Tanmay Chopra works in machine learning at AI search startup Neeva , where he wrangles language models large and small. Last summer could only be described as an “AI summer,” especially with large language models making an explosive entrance. Let’s start with buying.

Language

Language Model Build Open Source

New technique helps LLMs rein in CoT lengths, optimizing reasoning without exploding compute costs

VentureBeat

MARCH 13, 2025

Carnegie Mellon University researchers propose a new LLM training technique that gives developers more control over chain-of-thought length. Read More

Technique

Technique Proposal Help University

AI is coming for the laptop class

Recode by Vox

MARCH 13, 2025

The newest reasoning models from top AI companies are already essentially human-level, if not superhuman, at many programming tasks , which in turn has already led new tech startups to hire fewer workers. Fast AI progress, slow robotics progress If youve heard of OpenAI, youve heard of its language models: GPTs 1, 2, 3, 3.5,

Laptop

Laptop Classes Job Model

Accelerate DeepSeek Reasoning Models With NVIDIA GeForce RTX 50 Series AI PCs

NVIDIA AI Blog

JANUARY 31, 2025

The recently released DeepSeek-R1 model family has brought a new wave of excitement to the AI community, allowing enthusiasts and developers to run state-of-the-art reasoning models with problem-solving, math and code capabilities, all from the privacy of local PCs.

Model

Model Classes Problem Student

Teaching language models to reason algorithmically

Google Research AI blog

AUGUST 24, 2023

Posted by Hattie Zhou, Graduate Student at MILA, Hanie Sedghi, Research Scientist, Google Large language models (LLMs), such as GPT-3 and PaLM , have shown impressive progress in recent years, which have been driven by scaling up models and training data sizes. manipulating symbols based on logical rules).

Teach

Teach Language Model Evaluation

How Language Models Understand Nullability

The AI Alignment Forum

MARCH 11, 2025

Published on March 11, 2025 3:57 PM GMT TL;DR Large language models have demonstrated an emergent ability to write code, but this ability requires an internal representation of program semantics that is little understood. In this work, we study how large language models represent the nullability of program values.

Language

Language Model Demonstration Technique

If you teach a chatbot how to read ASCII art, it will teach you how to make a bomb

TechSpot

MARCH 18, 2024

University researchers have developed a way to "jailbreak" large language models like Chat-GPT using old-school ASCII art. The technique, aptly named "ArtPrompt," involves crafting an ASCII art "mask" for a word and then cleverly using the mask to coax the chatbot into providing a response it shouldn't. Read Entire Article

Arts

Arts Teach Technique Language

Aligning language models to follow instructions

OpenAI

JANUARY 27, 2022

We’ve trained language models that are much better at following user intentions than GPT-3 while also making them more truthful and less toxic, using techniques developed through our alignment research.

Language

Language Model Instructional Instruction

Auditing language models for hidden objectives

The AI Alignment Forum

MARCH 13, 2025

Published on March 13, 2025 7:18 PM GMT We study alignment audits systematic investigations into whether an AI is pursuing hidden objectivesby training a model with a hidden misaligned objective and asking teams of blinded researchers to investigate it. As a testbed, we train a language model with a hidden objective.

Language

Language Model Technique Training

Better Language Models Without Massive Compute

Google Research AI blog

NOVEMBER 29, 2022

Posted by Jason Wei and Yi Tay, Research Scientists, Google Research, Brain Team In recent years, language models (LMs) have become more prominent in natural language processing (NLP) research and are also becoming increasingly impactful in practice. First, in “ Transcending Scaling Laws with 0.1%

Language

Language Model Instructional Instruction

Google’s new ‘hum to search’ feature can figure out the song that’s stuck in your head

The Verge

OCTOBER 15, 2020

Google is adding a new “ hum to search ” feature to its search tools today that will let you hum (or whistle, or sing) the annoying song that’s stuck in your head, and then use machine learning techniques to try to identify it. Consequently, the hum to search feature should work whether you’re tone-deaf or have perfect pitch.

Search

Search Google Technique Audio

FlyteInteractive: Interactive development for machine learning models

InfoWorld

FEBRUARY 15, 2024

Whether it’s large-scale, public large language models (LLM) like GPT or small-scale, private models trained on company content, developers need to find ways of including those models in their code. That means finding ways to test that code, without pushing it to production servers.

Model

Model Interaction Develop Learning

Deci lands $25M for tech that makes AI models more efficient

TechCrunch

JULY 13, 2022

Companies face several hurdles in creating text-, audio- and image-analyzing AI models for deployment across their apps and services. Cost is an outsize one — training a single model on commercial hardware can cost tens of thousands of dollars, if not more. Geifman proposes neural architecture search (NAS) as a solution.

Model

Model Tech Poll Images

5 Python Libraries to Build an Optimized RAG System

Machine Learning Mastery

JANUARY 21, 2025

Retrieval augmented generation (RAG) has become a vital technique in contemporary AI systems, allowing large language models (LLMs) to integrate external data in real time.

System

System Library Build Technique

Anthropic thinks ‘constitutional AI’ is the best way to train models

TechCrunch

MAY 9, 2023

Dubbed “constitutional AI,” Anthropic argues its technique, which aims to imbue systems with “values” defined by a “constitution,” makes the behavior of systems both easier to understand and simpler to adjust as needed. Neither model looks at every principle every time. Perhaps not.

Train

Train Training Model System

Advanced Q&A Features with DistilBERT

Machine Learning Mastery

MARCH 29, 2025

This post is divided into three parts; they are: Using DistilBERT Model for Question Answering Evaluating the Answer Other Techniques for Improving the Q&A Capability BERT (Bidirectional Encoder Representations from Transformers) was trained to be a general-purpose language model that can understand text.

Technique

Technique Evaluation Language Training

HiddenLayer emerges from stealth to protect AI models from attacks

TechCrunch

JULY 19, 2022

Even shielded behind an API, hackers can attempt to reverse-engineer the models underpinning these services or use “adversarial” data to tamper with them. In fact, at HiddenLayer, we believe we’re not far off from seeing machine learning models ransomed back to their organizations.”

Model

Model Knowledge Blackberry Gartner

What is model quantization? Smaller, faster LLMs

InfoWorld

MAY 31, 2024

If ever there were a salient example of a counter-intuitive technique, it would be quantization of neural networks. Quantization reduces the precision of the weights and other tensors in neural network models, often drastically. The current large language models (LLMs) are enormous. Why do we need quantization?

Model

Model Technique Classes Arts

Introducing the Next Generation of Text AI for AI Cloud Platform

DataRobot

DECEMBER 16, 2021

While it is easy to accumulate text data, it can be extremely difficult to analyze text due to the ambiguity of human language. and train models with a single click of a button. Advanced users will appreciate tunable parameters and full access to configuring how DataRobot processes data and builds models with composable ML.

Platform

Platform Generation Technique Language

Founders: Learning should be your top 2013 New Year’s resolution

The Next Web

DECEMBER 30, 2012

A few of the top courses include: Introduction to Artificial Intelligence (CS271) : Includes machine learning, probabilistic reasoning, robotics, and natural language processing. Learn Ruby on Rails , an incredibly powerful and highly scalable object-oriented language, with this ten step tutorial. Become a Web Developer from Scratch!

Learning

Learning University Language Course

Google Research, 2022 & beyond: Algorithms for efficient deep learning

Google Research AI blog

FEBRUARY 7, 2023

In the last 10 years, AI and ML models have become bigger and more sophisticated — they’re deeper, more complex, with more parameters, and trained on much more data, resulting in some of the most transformative outcomes in the history of machine learning.

Research

Research Learning Google Sample

How Creating a Custom GPT Can Help Automate Tedious Tasks and Save Time

The MatrixFiles

OCTOBER 7, 2024

You also do not need to be as familiar with prompt engineering techniques. You can upload specialized knowledge like reports or other documentation that the GPT should pull from first, before going to the rest of the Large Language Model (LLM). Creating your own GPT allows you to enter all the instructions once and save it.

Create

Create Instructional Instruction Time

Get More Out of AI, Start Chatting

.orgSource

SEPTEMBER 25, 2023

Eliza was a natural language processing program created to explore the dynamics of conversation between humans and machines. Compared to today’s models, Eliza was tongue-tied. AI-powered chatbots, like ChatGPT and Bard, use artificial intelligence and natural language processing to generate human-like responses.

Las Vegas

Las Vegas Interaction Conversation Language

Expanding augmented analytics to help more people get answers from their data

Tableau

JUNE 17, 2021

Ask Data guides analysis with powerful, easy-to-use natural language query. Ask Data lets your users answer business questions with natural language. New in Ask Data, Lenses allow analysts and dashboard authors to curate natural language experiences as a single source of truth.

Analytics

Analytics People Data Help

Enabling conversational interaction on mobile with LLMs

Google Research AI blog

MAY 12, 2023

Posted by Bryan Wang, Student Researcher, and Yang Li, Research Scientist, Google Research Intelligent assistants on mobile devices have significantly advanced language-based interactions for performing simple daily tasks, such as setting a timer or turning on a flashlight.

Interaction

Interaction Conversation Mobile Summary

The 10 most innovative data science companies of 2025

Fast Company Tech

MARCH 18, 2025

In the midst of an artificial intelligence boom thats reshaping almost every facet of the business world, companies are competing in an arms race to build the best and brightest models and fully embrace the nascent technology, whether thats as a product or service for customers or as an integralcomponent of their organizations processes.

Data

Data Companies Analysis Analytics

Modular secures $100M to build tools to optimize and create AI models

TechCrunch

AUGUST 24, 2023

Bringing Modular’s total raised to $130 million, the proceeds will be put toward product expansion, hardware support and the expansion of Modular’s programming language, Mojo, CEO Chris Lattner says. Deci , backed by Intel, is among the startups offering tech to make trained AI models more efficient — and performant.

Model

Model Build Tools Create

What Is Retrieval-Augmented Generation?

NVIDIA AI Blog

NOVEMBER 15, 2023

Like a good judge, large language models ( LLMs ) can respond to a wide variety of human queries. But to deliver authoritative answers that cite sources, the model needs an assistant to do some research. What’s more, the technique can help models clear up ambiguity in a user query. That builds trust.

Generation

Generation Knowledge Base Knowledge Model

Recruiting Geeks for Human Rights!

Beneblog: Technology Meets Society

MARCH 18, 2012

What we do: Benetech's Human Rights Data Analysis Group (HRDAG) develops database software, data collection strategies, and statistical techniques to measure human rights atrocities. Write and run statistical analysis in R, including survey estimation, geospatial analysis, and general linear model fitting.

Analysis

Analysis Statistics Chart Language

New AI text diffusion models break speed barriers by pulling words from noise

AI firms follow DeepSeek’s lead, create cheaper models with “distillation”

Webinars

Trending Sources

DeepSeek-GRM: Introducing an Enhanced AI Reasoning Technique

Webinars

Larger language models do in-context learning differently

Google Research, 2022 & Beyond: Language, Vision and Generative Models

What Are Foundation Models?

Stability AI releases ChatGPT-like language models

How Hebbia is building AI for in-depth research

How people secured their secrets before encyption

Sam Altman lays out roadmap for OpenAI’s long-awaited GPT-5 model

RAG Hallucination Detection Techniques

Characterizing Emergent Phenomena in Large Language Models

The next big AI shift in media? Turning news into a two-way conversation

7 Next-Generation Prompt Engineering Techniques

Using R for Predictive Modeling in Finance

When it comes to large language models, should you build or buy?

New technique helps LLMs rein in CoT lengths, optimizing reasoning without exploding compute costs

AI is coming for the laptop class

Accelerate DeepSeek Reasoning Models With NVIDIA GeForce RTX 50 Series AI PCs

Teaching language models to reason algorithmically

How Language Models Understand Nullability

If you teach a chatbot how to read ASCII art, it will teach you how to make a bomb

Aligning language models to follow instructions

Auditing language models for hidden objectives

Better Language Models Without Massive Compute

Google’s new ‘hum to search’ feature can figure out the song that’s stuck in your head

FlyteInteractive: Interactive development for machine learning models

Deci lands $25M for tech that makes AI models more efficient

5 Python Libraries to Build an Optimized RAG System

Anthropic thinks ‘constitutional AI’ is the best way to train models

Advanced Q&A Features with DistilBERT

HiddenLayer emerges from stealth to protect AI models from attacks

What is model quantization? Smaller, faster LLMs

Introducing the Next Generation of Text AI for AI Cloud Platform

Founders: Learning should be your top 2013 New Year’s resolution

Google Research, 2022 & beyond: Algorithms for efficient deep learning

How Creating a Custom GPT Can Help Automate Tedious Tasks and Save Time

Get More Out of AI, Start Chatting

Expanding augmented analytics to help more people get answers from their data

Enabling conversational interaction on mobile with LLMs

The 10 most innovative data science companies of 2025

Modular secures $100M to build tools to optimize and create AI models

What Is Retrieval-Augmented Generation?

Recruiting Geeks for Human Rights!

Stay Connected