Learning Theory, Method and Practice

The Theoretical Reward Learning Research Agenda: Introduction and Motivation

The AI Alignment Forum

FEBRUARY 28, 2025

Concretely, this research agenda involves answering questions such as: What is the right method for expressing goals and instructions to AI systems? The next question is whether or not a given reward learning method is guaranteed to converge to a reward function that is sufficiently accurate in this sense.

Research

Research Learning Method Policy

SLT for AI Safety

The AI Alignment Forum

JUNE 30, 2025

As of 2025, there is essentially no difference between the methods we use to align models and the methods we use to make models more capable. Everything is based on deep learning, and the main distinguishing factor is the choice of training data. Learning) How does training data determine the algorithms that models learn?

Learning Theory

Learning Theory Structure Train Training

Paradigms for computation

The AI Alignment Forum

JUNE 29, 2025

I think one would have had to imagine the development of technology not only pushing computers towards the ideal of recursion theory (that is, allowing many things computable in principle to become computable in practice) but also to push computers beyond it, making the assumptions of recursion theory outdated.

Learning Theory

Learning Theory Learning Offline Statistics

Webinars

Interactive Webinar: Recurring Giving Events That Keep on Giving

MORE WEBINARS

A Comprehensive Guide to Social Learning Theory

Gyrus

MARCH 14, 2024

A Comprehensive Guide to Social Learning Theory GyrusAim LMS GyrusAim LMS - Social learning theory’s fundamental tenet is that people learn by watching, copying, and behaving like others in social situations. What Is Social Learning Theory?

Learning Theory

Learning Theory Learning Social Guide

A Comprehensive Guide to Social Learning Theory

Gyrus

MARCH 14, 2024

A Comprehensive Guide to Social Learning Theory GyrusAim LMS GyrusAim LMS - Social learning theory’s fundamental tenet is that people learn by watching, copying, and behaving like others in social situations. What Is Social Learning Theory?

Learning Theory

Learning Theory Learning Social Guide

A Comprehensive Guide to Social Learning Theory

Gyrus

MARCH 14, 2024

A Comprehensive Guide to Social Learning Theory Gyrus Systems Gyrus Systems - Best Online Learning Management Systems Social learning theory’s fundamental tenet is that people learn by watching, copying, and behaving like others in social situations. What Is Social Learning Theory?

Learning Theory

Learning Theory Learning Social Guide

Six Tips for Evaluating Your Nonprofit Training Session

Beth's Blog: How Nonprofits Can Use Social Media

FEBRUARY 18, 2014

There are two different methods to evaluate your training. Use Learning Theory. I have written a lot about how it is important to understand how the brain works, how people learn by using learning theories to guide the design of your workshops. to define the four levels of training evaluation.

Evaluation

Evaluation Train Training Tips

Moving from Red AI to Green AI, Part 1: How to Save the Environment and Reduce Your Hardware Costs

DataRobot

APRIL 21, 2022

They are used for different applications, but nonetheless they suggest that the development in infrastructure (access to GPUs and TPUs for computing) and the development in deep learning theory has led to very large models. To better quantify this, we have developed methods to measure efficiency.

Green

Green Environment Metrics Measure

How To Think Like An Instructional Designer for Your Nonprofit Trainings

Beth's Blog: How Nonprofits Can Use Social Media

JANUARY 27, 2014

If you want to get results, you need to think about instructional design and learning theory. And, there is no shortage of learning theories and research. As someone who has been designing and delivering training for nonprofits over the past twenty years, the most exciting part is apply theory to your practice.

Instructional Design

Instructional Design Instructional Instruction Train

Twittering and Forgetting

Beth's Blog: How Nonprofits Can Use Social Media

JANUARY 3, 2008

I would recommend technology resources and they would share books about learning. Smith's book explains the history of learning theories and practice. He establishes two ways of thinking about educational practice--the official view of learning and the classical view. and tag it. Many times I forget.

Twitter

Twitter Bookmarking Nptech RSS

Timaeus in 2024

The AI Alignment Forum

FEBRUARY 20, 2025

Published on February 20, 2025 11:54 PM GMT TLDR: We made substantial progress in 2024: We published a series of papers that verify key predictions of Singular Learning Theory (SLT) [ 1 , 2 , 3 , 4 , 5 , 6 ]. Local Learning Coefficient Estimation. In practice, the scaling is highly sublinear.

Technique

Technique Sample Structure Train

Guest Post: Community and Civic Engagement in Museum Programs

Museum 2.0

SEPTEMBER 26, 2012

The purpose of my thesis was two-fold: To research and analyze community and civic engagement practices, methods, theories and examples in other museum programs. This can be accomplished through a variety of feedback methods conducted both inside and outside the museum.

Museum

Museum Program Community Participatory

Research directions Open Phil wants to fund in technical AI safety

The AI Alignment Forum

FEBRUARY 7, 2025

Were interested in more research on this, and other stress tests of todays state-of-the-art alignment methods. We want to fund research that identifies the conditions under which these failure modes occur, and makes progress toward robust methods of mitigating or avoiding them. How problematic is it?

Research

Research Fund Open Technique

Other Papers About the Theory of Reward Learning

The AI Alignment Forum

FEBRUARY 28, 2025

While this is an informal principle, it does empirically seem to hold quite robustly in practice (see e.g. this paper ). The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret In this paper , we look at what happens when a learnt reward function is optimised.

Learning

Learning Discussion Classes Policy

Google at NeurIPS 2022

Google Research AI blog

NOVEMBER 28, 2022

Platt , Fernando Pereira , Dale Schuurmans Keynote Speakers The Data-Centric Era: How ML is Becoming an Experimental Science Isabelle Guyon The Forward-Forward Algorithm for Training Deep Neural Networks Geoffrey Hinton Outstanding Paper Award Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding Chitwan Saharia , William Chan (..)

Google

Google Language Tutorial Offline

UK AISI’s Alignment Team: Research Agenda

The AI Alignment Forum

MAY 7, 2025

Arriving at robust evidence that human-level AI systems are aligned requires complementary advances across empirical science, theory, and engineering. We need a theoretical argument for why our methods effectiveness, empirical data validating the theory, and engineering work on making the method low cost.

Research

Research Team Learning Theory Method

An alignment safety case sketch based on debate

The AI Alignment Forum

MAY 8, 2025

Executive summary AI safety via debate is a promising method for solving part of the alignment problem for ASI (artificial superintelligence). We think the methods described above are more likely to work within the bounds of this well-defined problem. Read the full paper here.

Train

Train Oracle Training Research

AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability

The AI Alignment Forum

MARCH 28, 2025

And if you look at what are the takeaways right now for people practicing mech interp, theyre more of the first kind: how can we ground our sense of what mechanistic explanations are? Or according to our singular learning theory friends, the local learning coefficients should be small and that implies this thing about this.

Network

Network Model Train Training

Nonprofit Technology

The Theoretical Reward Learning Research Agenda: Introduction and Motivation

SLT for AI Safety

Webinars

Trending Sources

Paradigms for computation

Webinars

A Comprehensive Guide to Social Learning Theory

A Comprehensive Guide to Social Learning Theory

A Comprehensive Guide to Social Learning Theory

Six Tips for Evaluating Your Nonprofit Training Session

Moving from Red AI to Green AI, Part 1: How to Save the Environment and Reduce Your Hardware Costs

How To Think Like An Instructional Designer for Your Nonprofit Trainings

Twittering and Forgetting

Timaeus in 2024

Guest Post: Community and Civic Engagement in Museum Programs

Research directions Open Phil wants to fund in technical AI safety

Other Papers About the Theory of Reward Learning

Google at NeurIPS 2022

UK AISI’s Alignment Team: Research Agenda

An alignment safety case sketch based on debate

AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability

Stay Connected