LlamaIndex’s Post

🔥 Introducing GPT-4o + LlamaParse 🔥 GPT-4o is the state-of-the-art model for multimodal understanding, meaning it also has state-of-the-art document parsing capabilities. LlamaParse is the platform for enabling LLM-powered parsing - it uses LLMs to extract documents from any file type in a performant, reliable fashion, offering state-of-the-art response quality for advanced document RAG. We’re excited to offer GPT-4o as an explicit option in LlamaParse, which will use GPT-4o for extraction per page into markdown, instead of using our default parsers/models. Why:  - GPT-4o is very good at parsing very complex documents into well-formatted markdown. Oftentimes it outperforms our default approaches. - This means that it can turn documents with very complex tables / charts into clean, indexable data for your RAG pipeline - higher response quality, lower hallucinations 📈 Tradeoffs / Caveats ⚠️:  - It’s expensive 💵: Due to the cost of inference, using GPT-4o is currently $0.60 USD per page (while by default LlamaParse is $0.003 per page). This cost can spike quickly - beware!  - You can specify your OpenAI key, in which case the marginal cost per page goes down to 0.3c per page.  - This is a beta feature. Given the cost and latency, use this with caution! If you want to give this a shot, signup for an account and check out our UI: https://lnkd.in/gbkxQAQd Notebook: https://lnkd.in/grwUVr-G

  • No alternative text description for this image
Ranjeet Rustogi

co-f @awaydayai, cto @suketv | 5x exit tech nut | ai/web3 consultant | crypto-native | startup advisor, mentor, investor

7mo

Using own key should bring the cost down to 0.3c or $0.3? Because 0.3c is $0.003, which is then the same as the cost of the default LlamaParse 🤔 LlamaIndex

Ibrahim Akhtar

Data Scientist | AI Developer | Game Developer

7mo

Can't wait to experiment around with this model. Once the price goes down a bit ofc.

Jonny H.

Software Engineer

7mo

Could the two be used in combination to improve accuracy? For example, 4o reviews the llamaparsed output and input page image to check for errors.

Like
Reply

Interesting. Should give it a try.

Like
Reply
Matthew Combatti

Simulanics Technologies - AI & ML Systems Engineer - Master Software Developer & Systems Security Expert

7mo

You're welcome for the idea this morning 💡 🙂 🙏

  • No alternative text description for this image
Jhonnatan Betancourt

Data, AI & Software for Business ⚙️📊 Data engineering & Analytics

7mo

I'm waiting impatiently for GPT 4o🤓

Like
Reply
Piyush Sar

MLE I at infocusp innovations | NLP + Multimodal

7mo
See more comments

To view or add a comment, sign in

Explore topics