Why all AI should be open source and openly available

This post has been de-listed

It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.

373

Post Flair (click to view more posts with a particular flair)

Discussion

Post Body

None, exactly zero, of the companies in AI, no matter who, created any of the training data themself. They harvested it from the internet. From D*scord, Reddit, Twitter, Youtube, from image sites, from fan-fiction sites, wikipedia, news, magazines and so on. Sure, they used money for the hardware and energy to train the models on, but a training can only be as good as the input and for that, their core business, the quality of the input, they paid literally nothing.

On top of that everything ran and runs on open source software.

Therefore they should be required to release the models and give everyone access to them in the same way they got access to the training data in the first place. They still can offer a service, after all running a model still needs skills: you need to finetune, use the right settings, provide the infrastructure and so on. That they can still sell if they want to, however harvesting the whole internet and then keeping the result private to make money off it is just theft.

Fight me.

Duplicate Posts

2 posts with the exact same title by 1 other authors

View Details

Author

Account Strength

100%

Account Age

6 years

Verified Email

Yes

Verified Flair

Total Karma

9,525

Link Karma

2,059

Comment Karma

7,466

Profile updated: 3 days ago

dreamyrhodes

Subreddit

r/LocalLLaMA

Post Details

We try to extract some basic information from the post title. This is not always successful or accurate, please use your best judgement and compare these values to the post title and body for confirmation.

Posted: 10 months ago
Reddit URL: View post on reddit.com
External URL: reddit.com/r/LocalLLaMA/...