This post has been de-listed
It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.
What it is:
The Pipe is a tool for feeding visually complex data (pdf, urls, docx, pptx, csv, youtube videos, etc) into vision-language models such as GPT-4. It is open source and entirely written in Python, so hopefully I posted this in the right place lol. this
Why I made this:
I tried to make an application to chat with my documents and web pages. Traditional scrapers would just give me only text, and would fail to accurately scrape most of the text anyways. This tool allows you to give GPT high quality text AND visual data in an LLM-ready prompt format, so it can see the document just like you or me.
Cheers!
Post Details
- Posted
- 6 months ago
- Reddit URL
- View post on reddit.com
- External URL
- reddit.com/r/Python/comm...