Remove Alternative Remove Learning Theory Remove Structure
article thumbnail

How To Think Like An Instructional Designer for Your Nonprofit Trainings

Beth's Blog: How Nonprofits Can Use Social Media

If you want to get results, you need to think about instructional design and learning theory. And, there is no shortage of learning theories and research. As someone who has been designing and delivering training for nonprofits over the past twenty years, the most exciting part is apply theory to your practice.

article thumbnail

Google at ICLR 2023

Google Research AI blog

If you’re registered for ICLR 2023, we hope you’ll visit the Google booth to learn more about the exciting work we’re doing across topics spanning representation and reinforcement learning, theory and optimization, social impact, safety and privacy, and applications from generative AI to speech and robotics.

Google 105
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Other Papers About the Theory of Reward Learning

The AI Alignment Forum

The first of these is the preference structures given by multi-objective RL, where the agent is given multiple reward functions R 1 , R 2 , R 3 , , and has to find a policy that achieves a good trade-off of those rewards according to some specified criterion. Alternatively, see the main paper.

article thumbnail

Research directions Open Phil wants to fund in technical AI safety

The AI Alignment Forum

Alternatives to adversarial training : Adversarial training (and the rest of todays best alignment techniques) have failed to create LLM agents that reliably avoid misaligned goals. Alternative approaches to mitigating AI risks These research areas lie outside the scope of the clusters above. Wen et al. , Sheshadri et al.,

article thumbnail

AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability

The AI Alignment Forum

And a technical note: it needs to be in some first-order system or alternatively, you need to measure proof checking time as opposed to proof length. Daniel Filan (00:28:50): If people remember my singular learning theory episodes , theyll get mad at you for saying that quadratics are all there is, but its a decent approximation. (00:28:56):

Model 52