Learning from Little Human Feedback [R] [P]

This post has been de-listed

It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.

Post Flair (click to view more posts with a particular flair)

Research

Author Summary

Sea-Collection-8844 is a redditor

Post Body

I have an environment where the input is an image, which may or may not include a bounding box. An agent that determines if a human likes the bounding box based on the provided image and the human's actions. Now, I want to modify this setup so that the agent can adapt to any other human's preferences with minimal feedback. How can I achieve this?

Author

Account Strength

80%

Account Age

4 years

Verified Email

Yes

Verified Flair

Total Karma

161

Link Karma

114

Comment Karma

Profile updated: 4 days ago

Posts updated: 4 days ago

Sea-Collection-8844

Subreddit

r/MachineLearning

Post Details

They Are

a Redditor

We try to extract some basic information from the post title. This is not always successful or accurate, please use your best judgement and compare these values to the post title and body for confirmation.

Posted: 10 months ago
Reddit URL: View post on reddit.com
External URL: reddit.com/r/MachineLear...