Maxim Maximov’s Post

View profile for Maxim Maximov, graphic

Innovation, Digital Product, & IT Operations @ Ingram Micro | MIE, Business Innovation

The enterprise search capability and the methods we used to store enterprise data are outdated. How can we re-imagined to create data and store it so AI systems can start consuming without pre-processing/grounding? What you, Pulse AI (YC S24), have is a great achievement!

View organization page for Y Combinator, graphic

1,117,595 followers

Pulse AI (YC S24) just launched an API for production-grade unstructured document extraction, turning complex information into LLM-ready inputs. No training required. Approximately 75% of enterprise data is unstructured, the majority of this is directly within PDF files. This makes it extremely difficult to build RAG applications with this data, and ingestion is often the bottleneck. The team tested every tool on the market and found they lacked accurate contextual understanding, multi-column PDFs, and multimodal documents. Most current technologies are simply wrappers on Textract or Gemini, which have their own inherent flaws. Pulse trained its own set of Vision Language Models (VLMs) and OCR techniques to bridge this gap. Pulse reached state-of-the-art (SOTA) performance on its vision model for documents and spreadsheets. The API processes all PDF types, including handwritten documents, foreign languages, and more. It seamlessly integrates into new or existing engineering workflows as well. They are also actively working on a novel reasoning tool on spreadsheets using this technology – stay tuned. Pulse's API saw initial success across supply chain teams in the three-way match process and is deployed in companies across hardware, healthcare, manufacturing industries, and more. Sign up at studio.trypulse.ai. Congrats on the launch Sid Manchkanti and Ritvik Pandey! 🚀 https://lnkd.in/gKW37rcH

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics