This site uses cookies to improve your experience. To help us insure we adhere to various privacy regulations, please select your country/region of residence. If you do not select a country, we will assume you are from the United States. Select your Cookie Settings or view our Privacy Policy and Terms of Use.
Cookie Settings
Cookies and similar technologies are used on this website for proper function of the website, for tracking performance analytics and for marketing purposes. We and some of our third-party providers may use cookie data for various purposes. Please review the cookie settings below and choose your preference.
Used for the proper function of the website
Used for monitoring website traffic and interactions
Cookie Settings
Cookies and similar technologies are used on this website for proper function of the website, for tracking performance analytics and for marketing purposes. We and some of our third-party providers may use cookie data for various purposes. Please review the cookie settings below and choose your preference.
Strictly Necessary: Used for the proper function of the website
Performance/Analytics: Used for monitoring website traffic and interactions
If you’re registered for ICLR 2023, we hope you’ll visit the Google booth to learn more about the exciting work we’re doing across topics spanning representation and reinforcement learning, theory and optimization, social impact, safety and privacy, and applications from generative AI to speech and robotics.
Concretely, this research agenda involves answering questions such as: What is the right method for expressing goals and instructions to AI systems? Some relevant criteria for evaluating a specification language include: How expressive is the language? Are there things it cannot express? For details, see e.g. this paper.)
Published on February 20, 2025 11:54 PM GMT TLDR: We made substantial progress in 2024: We published a series of papers that verify key predictions of Singular LearningTheory (SLT) [ 1 , 2 , 3 , 4 , 5 , 6 ]. The S4 correspondence in small language models. in funding for 2025. Alignment).
The thirty-fifth Conference on Neural Information Processing Systems (NeurIPS) 2021 is being hosted virtually from Dec 6th - 14th. Some of the members in our SAIL community also serve as co-organizers of several exciting workshops that will take place on Dec 13-14, so we hope you will check them out! Smith, Scott W.
Manning, Jure Leskovec Contact : xikunz2@cs.stanford.edu Award nominations: Spotlight Links: Paper | Website Keywords : knowledge graph, question answering, language model, commonsense reasoning, graph neural networks, biomedical qa Fast Model Editing at Scale Authors : Eric Mitchell, Charles Lin, Antoine Bosselut, Chelsea Finn, Christopher D.
Adversarial machine learning This cluster of research areas uses simulated red-team/blue-team exercises to expose the vulnerabilities of an LLM (or a system that incorporates LLMs). We think this adversarial style of evaluation and iteration is necessary to ensure an AI system has a low probability of catastrophic failure.
Goodhart's Law in Reinforcement Learning As you probably know, "Goodhart's Law" is an informal principle which says that "if a proxy is used as a target, it will cease to be a good proxy". Moreover, this dynamic is often at the core of many stories of how we could get catastrophic risks from AI systems. For details, see the full paper.
Posted by Cat Armato, Program Manager, Google This week marks the beginning of the 36th annual Conference on Neural Information Processing Systems ( NeurIPS 2022 ), the biggest machine learning conference of the year.
And a technical note: it needs to be in some first-order system or alternatively, you need to measure proof checking time as opposed to proof length. Daniel Filan (00:16:47): Well, it sounds like you could say that about any software system. And then you prove this, and the measure of compression is how long your proof is.
We organize all of the trending information in your field so you don't have to. Join 12,000+ users and stay up to date on the latest articles your peers are reading.
You know about us, now we want to get to know you!
Let's personalize your content
Let's get even more personalized
We recognize your account from another site in our network, please click 'Send Email' below to continue with verifying your account and setting a password.
Let's personalize your content