Qantev’s Post

View organization page for Qantev, graphic

3,494 followers

We're thrilled to announce the publication of our latest scientific paper: "Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation"! by Filipe Lauar and Valentin LAURENT Optical Character Recognition (OCR) has transformed text extraction from images, and at Qantev, we're pushing the boundaries with our innovative approach to multilingual OCR, particularly for Visual Rich Documents (VRDs). 📖 About the Paper: Our research delves into creating a synthetic dataset in Spanish, designed to handle the unique challenges posed by VRDs. By fine-tuning the TrOCR model using this dataset, we've achieved remarkable results, making our Spanish OCR model a leading open-source solution. ⭐️ Key Highlights: - Creation of a synthetic VRD dataset in Spanish - Fine-tuning TrOCR with advanced data augmentation techniques - Benchmarking against EasyOCR and Microsoft Azure OCR API - Significant improvements in Character Error Rate (CER) and Word Error Rate (WER) You can explore the full paper here: https://lnkd.in/eS_C578s 💡 Read the Blog Post: Dive deeper into our methodology and findings in our comprehensive blog post. Learn how we tackled the challenges of VRDs and fine-tuned the TrOCR model for superior performance in Spanish OCR. 🔗 Blog Post: https://lnkd.in/eaFePisF 🔍 Explore Our Resources: - Spanish TrOCR models on Hugging Face: huggingface.co/qantev - Dataset Generation Method on GitHub: https://lnkd.in/eEPZGrq2 Special thanks to our amazing authors, Filipe Lauar and Valentin LAURENT, for their invaluable contributions to this paper. We're proud to contribute to the OCR community and excited to see how our work can aid in various applications, from digitizing documents to extracting text from complex images. #Qantev #OCR #TrOCR #MachineLearning #AI #Research #OpenSource #SpanishOCR #VRD Feel free to reach out if you have any questions or feedback. Thank you for your support!

Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation

Spanish TrOCR: Leveraging Transfer Learning for Language Adaptation

medium.com

Joel Farvault

AWS Principal SA Data & Analytics for Energy | Transforming business with Data Analytics & Cloud technology

4mo

Congrats 🎉

To view or add a comment, sign in

Explore topics