Remove Oracle Remove Policy Remove Proposal
article thumbnail

SHIFT relies on token-level features to de-bias Bias in Bios probes

The AI Alignment Forum

An oracle probe trained on an unbiased training dataset, where gender is not correlated with profession, achieves an accuracy of 93%. Our oracle probe (i.e., For this specific seed, the de-biased probe achieved higher accuracy compared to the oracle probe.) Marks et al. on the unbiased dataset.

Train 52
article thumbnail

Saiga aims to succeed where Magic and other concierge apps failed

TechCrunch

The chat is enriched with interactive proposal cards to improve the experience and allow faster decision making on part of our customers,” Hermann said. “Our users interact with Saiga via an app as the primary interface, where they have a chat-like interface for each task. ” A path to success?

Germany 74
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Google at NeurIPS 2022

Google Research AI blog

Sajjadi , Daniel Duckworth , Aravindh Mahendran , Sjoerd van Steenkiste , Filip Pavetić , Mario Lučić , Leonidas J.

Google 52
article thumbnail

An overview of areas of control work

The AI Alignment Forum

In principle, we could even allow for an end-to-end game where competitors submit either a red team attack policy or a blue team protocol and then we see how the dynamics go. These settings can be interesting to study even if the blue team has access to a basically perfect but expensive oracle of whether the AI's actions are problematic.

Work 52
article thumbnail

US senator Ed Markey proposes TikTok ban deadline extension bill while TikTok plans to shut down on Sunday

TechNode

Why it matters: The proposed delay to the ban reflects the US’s conflicted stance on TikTok. It also requires TikTok’s cloud service provider, Oracle, to cease hosting its US user data. Massachusetts Senator Ed Markey proposed a delay to the TikTok ban deadline less than a week before it was set to take effect.

article thumbnail

How a fake “Real Oversight Board” is putting pressure on Facebook

The Verge

Facebook’s proposed solution to this is the Oversight Board , an independent group that will serve as a kind of Supreme Court for content moderation. A lesson of the 2020 campaign so far is that Facebook struggles to remove harmful speech even when it makes a policy of doing so. So what effect will any of this have?

article thumbnail

Trump’s latest attack on Section 230 is really about censoring speech

The Verge

Goodman, a law professor at Rutgers University specializing in information policy, approaches the problem from another angle. as part of that country’s debate over proposed online-harm legislation, would “require platform companies to ensure that their algorithms do not skew toward extreme and unreliable material to boost user engagement.”