Is the Control Problem inherently a repeated problem?

This post has been de-listed (Author was flagged for spam)

It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.

Post Body

Say we build a self-improving AI that's twice as smart as a human.

It's going to go off and try to improve itself, right? But why should we expect its improved version to share the goals and desires of the original? The AI still has to solve the control problem.

Then when the second-generation AI tries to build the third-generation AI, it has to solve the control problem again.

Since it's not clear that controlling something smarter than you gets easier as you yourself get smarter, and assuming each iteration has some nonzero chance to fail to pass on the values it intended, you're eventually going to have values drift. Probably several times, or at least until you get iterations that don't feel like self-improving any more.

Author

Account Strength

Account Age

17 years

Verified Email

Yes

Verified Flair

Total Karma

19,675

Link Karma

3,103

Comment Karma

16,528

Profile updated: 2 months ago

Posts updated: 7 months ago

interfect

Subreddit

r/ControlProblem

Post Details

We try to extract some basic information from the post title. This is not always successful or accurate, please use your best judgement and compare these values to the post title and body for confirmation.

Posted: 7 years ago
Reddit URL: View post on reddit.com
External URL: reddit.com/r/ControlProb...