This post has been de-listed (Author was flagged for spam)
It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.
I am working on a project using some Netflix catalogue and some 15000 names of actors/directors, trying to estimate their ethnicities using datasets of names and what ethnicity they are most common in. I found the US Census Data of surnames and % of people who identify as certain ethnicities, but it's riddled with "(S)" values amongst the legit floats. Unfortunately the data dictionary from 531 or the census methodology [PDF] detailing did not clarify things.
Does anyone know the significance of that value?
https://www.census.gov/topics/population/genealogy/data/2010_surnames.html
Subreddit
Post Details
- Posted
- 2 years ago
- Reddit URL
- View post on reddit.com
- External URL
- reddit.com/r/datasets/co...