[P] Advise on building Image Captioning Model in Minor Language

This post has been de-listed

It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.

Post Flair (click to view more posts with a particular flair)

Project

Post Body

Hello everyone!

I am a freshman at the university. Lately, I have been interested in ML and DL approaches to solving problems. I want to build an image labeling/captioning model in a minor language. I have found that the language I am interested in has no labeled dataset.

I have three approaches in mind:

Create dataset by myself - approximately 10000 images with manual captions - Decide the NN architecture train the model
Try to use the existing pre-trained model and use the dataset I prepared
Add Neural Machine Translation component in the architecture - For Multilingual captioning?

If possible, maybe I can cross-validate all these three options to see which one is potentially a better.

I am still learning and there are lots of unclear things I want to get some advice from the experts. Any insight or suggestion would mean the world to me!

Author

Account Strength

80%

Account Age

4 years

Verified Email

Yes

Verified Flair

Total Karma

Link Karma

Comment Karma

Profile updated: 2 days ago

Posts updated: 3 days ago

Witty-Satisfaction41

Subreddit

r/MachineLearning

Post Details

We try to extract some basic information from the post title. This is not always successful or accurate, please use your best judgement and compare these values to the post title and body for confirmation.

Posted: 1 year ago
Reddit URL: View post on reddit.com
External URL: reddit.com/r/MachineLear...