Coming soon - Get a detailed view of why an account is flagged as spam!
view details

This post has been de-listed

It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.

13
Evaluating the quality of a dataset
Post Flair (click to view more posts with a particular flair)
Post Body

Hey,

over the course of the last month I created a dataset (images bounding box annotation) for an object detection task and it is working quite well. While evaluating classical metrics like precision and recall is easy, I wondered whether there are other useful metrics that solely rely on evaluating the dataset itself without any object detector involved. I think its general consensus that a dataset should be diverse in terms of scenes, angles etc. to mitigate any potential bias by the detector towards a specific situation. But I do not have a good idea how to evaluate the datasets "diversity", I have some additional tags for every image that I can use, but that feels really simple.

I am grateful for any suggestions towards other metrics :)

Author
Account Strength
90%
Account Age
11 years
Verified Email
Yes
Verified Flair
No
Total Karma
4,376
Link Karma
2,098
Comment Karma
2,218
Profile updated: 4 days ago
Posts updated: 1 day ago

Subreddit

Post Details

We try to extract some basic information from the post title. This is not always successful or accurate, please use your best judgement and compare these values to the post title and body for confirmation.
Posted
3 years ago