This post has been de-listed
It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.
So I mainly generate my own data bases via algorithms or paid users that write the data I use for my private models, thing is Its expensive when working on a massive project and my current project is an advanced form of text generation and chat capabilities, so my question here specifically is:
I found a data base on kaggle of 51 million discord messages from anonyms users, they have context but still require a lot of refinement and work on them, but is it ethical to use this data base which was probably collected without the knowledge of the anonyms users present in it as discords TOS are against such Data-bases and data collection...?
Subreddit
Post Details
- Posted
- 1 year ago
- Reddit URL
- View post on reddit.com
- External URL
- reddit.com/r/artificial/...