Language, Learning Theory and System

Language

Learning Theory

System

Google at ICLR 2023

Google Research AI blog

APRIL 30, 2023

If you’re registered for ICLR 2023, we hope you’ll visit the Google booth to learn more about the exciting work we’re doing across topics spanning representation and reinforcement learning, theory and optimization, social impact, safety and privacy, and applications from generative AI to speech and robotics.

Google

Google Language Model Jing

The Theoretical Reward Learning Research Agenda: Introduction and Motivation

The AI Alignment Forum

FEBRUARY 28, 2025

Concretely, this research agenda involves answering questions such as: What is the right method for expressing goals and instructions to AI systems? Some relevant criteria for evaluating a specification language include: How expressive is the language? Are there things it cannot express? For details, see e.g. this paper.)

Research

Research Learning Method Policy

Join 12,000+

professionals

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

The Everyday Donor: Unlocking Prospecting Segments Through Behavior Analysis

A New Look At Grant Management: Why Use A System When You Have Excel?

MORE WEBINARS

Trending Sources

Timaeus in 2024

The AI Alignment Forum

FEBRUARY 20, 2025

Published on February 20, 2025 11:54 PM GMT TLDR: We made substantial progress in 2024: We published a series of papers that verify key predictions of Singular Learning Theory (SLT) [ 1 , 2 , 3 , 4 , 5 , 6 ]. The S4 correspondence in small language models. in funding for 2025. Alignment).

Technique

Technique Sample Structure Train

Webinars

The Everyday Donor: Unlocking Prospecting Segments Through Behavior Analysis

A New Look At Grant Management: Why Use A System When You Have Excel?

MORE WEBINARS

Stanford AI Lab Papers and Talks at NeurIPS 2021

Stanford AI Lab Blog

DECEMBER 6, 2021

The thirty-fifth Conference on Neural Information Processing Systems (NeurIPS) 2021 is being hosted virtually from Dec 6th - 14th. Some of the members in our SAIL community also serve as co-organizers of several exciting workshops that will take place on Dec 13-14, so we hope you will check them out! Smith, Scott W.

Contact

Contact Learning Theory Authoring Offline

Stanford AI Lab Papers and Talks at ICLR 2022

Stanford AI Lab Blog

APRIL 25, 2022

Manning, Jure Leskovec Contact : xikunz2@cs.stanford.edu Award nominations: Spotlight Links: Paper | Website Keywords : knowledge graph, question answering, language model, commonsense reasoning, graph neural networks, biomedical qa Fast Model Editing at Scale Authors : Eric Mitchell, Charles Lin, Antoine Bosselut, Chelsea Finn, Christopher D.

Contact

Contact Award Authoring Language

Research directions Open Phil wants to fund in technical AI safety

The AI Alignment Forum

FEBRUARY 7, 2025

Adversarial machine learning This cluster of research areas uses simulated red-team/blue-team exercises to expose the vulnerabilities of an LLM (or a system that incorporates LLMs). We think this adversarial style of evaluation and iteration is necessary to ensure an AI system has a low probability of catastrophic failure.

Research

Research Fund Open Technique

Other Papers About the Theory of Reward Learning

The AI Alignment Forum

FEBRUARY 28, 2025

Goodhart's Law in Reinforcement Learning As you probably know, "Goodhart's Law" is an informal principle which says that "if a proxy is used as a target, it will cease to be a good proxy". Moreover, this dynamic is often at the core of many stories of how we could get catastrophic risks from AI systems. For details, see the full paper.

Learning

Learning Discussion Classes Policy

Google at NeurIPS 2022

Google Research AI blog

NOVEMBER 28, 2022

Posted by Cat Armato, Program Manager, Google This week marks the beginning of the 36th annual Conference on Neural Information Processing Systems ( NeurIPS 2022 ), the biggest machine learning conference of the year.

Google

Google Language Tutorial Offline

AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability

The AI Alignment Forum

MARCH 28, 2025

And a technical note: it needs to be in some first-order system or alternatively, you need to measure proof checking time as opposed to proof length. Daniel Filan (00:16:47): Well, it sounds like you could say that about any software system. And then you prove this, and the measure of compression is how long your proof is.

Model

Model Network Train Training

Nonprofit Technology

Google at ICLR 2023

The Theoretical Reward Learning Research Agenda: Introduction and Motivation

Webinars

Trending Sources

Timaeus in 2024

Webinars

Stanford AI Lab Papers and Talks at NeurIPS 2021

Stanford AI Lab Papers and Talks at ICLR 2022

Research directions Open Phil wants to fund in technical AI safety

Other Papers About the Theory of Reward Learning

Google at NeurIPS 2022

AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability

Stay Connected