


default search action
R. Manmatha
Person information
- affiliation: University of Massachusetts Amherst, USA
SPARQL queries 
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j17]Yi Zhu
, Zhongyue Zhang
, Chongruo Wu
, Zhi Zhang
, Tong He
, Hang Zhang
, R. Manmatha
, Mu Li
, Alexander J. Smola
:
Improving Semantic Segmentation via Efficient Self-Training. IEEE Trans. Pattern Anal. Mach. Intell. 46(3): 1589-1602 (2024) - [c91]Srikar Appalaraju, Peng Tang, Qi Dong, Nishant Sankaran, Yichu Zhou, R. Manmatha:
DocFormerv2: Local Features for Document Understanding. AAAI 2024: 709-718 - [c90]Tianyang Zhao, Kunwar Yashraj Singh, Srikar Appalaraju, Peng Tang, Vijay Mahadevan, R. Manmatha, Ying Nian Wu:
No Head Left Behind - Multi-Head Alignment Distillation for Transformers. AAAI 2024: 7514-7524 - [c89]Hao Li, Yang Zou, Ying Wang, Orchid Majumder, Yusheng Xie, R. Manmatha, Ashwin Swaminathan, Zhuowen Tu, Stefano Ermon, Stefano Soatto:
On the Scalability of Diffusion-based Text-to-Image Generation. CVPR 2024: 9400-9409 - [c88]Ofir Abramovich, Niv Nayman, Sharon Fogel, Inbal Lavi, Ron Litman, Shahar Tsiper, Royee Tichauer, Srikar Appalaraju, Shai Mazor, R. Manmatha:
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding. ECCV (8) 2024: 241-259 - [c87]Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto:
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models. EMNLP 2024: 3167-3193 - [c86]Ajoy Mondal, Vijay Mahadevan, R. Manmatha, C. V. Jawahar:
ICDAR 2024 Competition on Recognition and VQA on Handwritten Documents. ICDAR (6) 2024: 426-442 - [c85]Peng Tang, Srikar Appalaraju, R. Manmatha, Yusheng Xie, Vijay Mahadevan:
Multiple-Question Multiple-Answer Text-VQA. NAACL (Industry Track) 2024: 73-88 - [c84]Peng Tang, Pengkai Zhu, Tian Li, Srikar Appalaraju, Vijay Mahadevan, R. Manmatha:
DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models. NAACL-HLT (Findings) 2024: 116-131 - [i31]Hao Li, Yang Zou, Ying Wang, Orchid Majumder, Yusheng Xie, R. Manmatha, Ashwin Swaminathan, Zhuowen Tu, Stefano Ermon, Stefano Soatto:
On the Scalability of Diffusion-based Text-to-Image Generation. CoRR abs/2404.02883 (2024) - [i30]Pei Wang, Zhaowei Cai, Hao Yang, Ashwin Swaminathan, R. Manmatha, Stefano Soatto:
Mixed-Query Transformer: A Unified Image Segmentation Architecture. CoRR abs/2404.04469 (2024) - [i29]Ofir Abramovich, Niv Nayman, Sharon Fogel, Inbal Lavi, Ron Litman, Shahar Tsiper, Royee Tichauer, Srikar Appalaraju, Shai Mazor, R. Manmatha:
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding. CoRR abs/2407.12594 (2024) - [i28]Chaofan Tao, Gukyeong Kwon, Varad Gunjal, Hao Yang, Zhaowei Cai, Yonatan Dukler, Ashwin Swaminathan, R. Manmatha, Colin J. Taylor, Stefano Soatto:
NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality. CoRR abs/2408.09511 (2024) - [i27]Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto:
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models. CoRR abs/2410.03061 (2024) - [i26]Hao Li, Shamit Lal, Zhiheng Li, Yusheng Xie, Ying Wang, Yang Zou, Orchid Majumder, R. Manmatha, Zhuowen Tu, Stefano Ermon, Stefano Soatto, Ashwin Swaminathan:
Efficient Scaling of Diffusion Transformers for Text-to-Image Generation. CoRR abs/2412.12391 (2024) - 2023
- [c83]Jiang Liu, Hui Ding, Zhaowei Cai, Yuting Zhang, Ravi Kumar Satzoda, Vijay Mahadevan, R. Manmatha:
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation. CVPR 2023: 18653-18663 - [c82]Haofu Liao, Aruni RoyChowdhury, Weijian Li, Ankan Bansal, Yuting Zhang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan:
DocTr: Document Transformer for Structured Information Extraction in Documents. ICCV 2023: 19527-19537 - [i25]Yash Patel, Yusheng Xie, Yi Zhu, Srikar Appalaraju, R. Manmatha:
SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation. CoRR abs/2302.03432 (2023) - [i24]Jiang Liu, Hui Ding, Zhaowei Cai, Yuting Zhang, Ravi Kumar Satzoda, Vijay Mahadevan, R. Manmatha:
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation. CoRR abs/2302.07387 (2023) - [i23]Srikar Appalaraju, Peng Tang, Qi Dong, Nishant Sankaran, Yichu Zhou, R. Manmatha:
DocFormerv2: Local Features for Document Understanding. CoRR abs/2306.01733 (2023) - [i22]Haofu Liao, Aruni RoyChowdhury, Weijian Li, Ankan Bansal, Yuting Zhang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan:
DocTr: Document Transformer for Structured Information Extraction in Documents. CoRR abs/2307.07929 (2023) - [i21]Peng Tang, Srikar Appalaraju, R. Manmatha, Yusheng Xie, Vijay Mahadevan:
Multiple-Question Multiple-Answer Text-VQA. CoRR abs/2311.08622 (2023) - [i20]Peng Tang, Pengkai Zhu, Tian Li, Srikar Appalaraju, Vijay Mahadevan, R. Manmatha:
DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models. CoRR abs/2311.08623 (2023) - 2022
- [c81]Hang Zhang, Chongruo Wu, Zhongyue Zhang, Yi Zhu, Haibin Lin, Zhi Zhang, Yue Sun, Tong He, Jonas Mueller, R. Manmatha, Mu Li, Alexander J. Smola:
ResNeSt: Split-Attention Networks. CVPR Workshops 2022: 2735-2745 - [c80]Yair Kittenplon, Inbal Lavi, Sharon Fogel, Yarin Bar, R. Manmatha, Pietro Perona:
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer. CVPR 2022: 4594-4603 - [c79]Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha:
LaTr: Layout-Aware Transformer for Scene-Text VQA. CVPR 2022: 16527-16537 - [c78]Chih-Hui Ho, Srikar Appalaraju, Bhavan Jasani, R. Manmatha, Nuno Vasconcelos:
YORO - Lightweight End to End Visual Grounding. ECCV Workshops (8) 2022: 3-23 - [c77]Roi Ronen
, Shahar Tsiper, Oron Anschel, Inbal Lavi, Amir Markovitz, R. Manmatha:
GLASS: Global to Local Attention for Scene-Text Spotting. ECCV (28) 2022: 249-266 - [c76]Ron Slossberg, Oron Anschel, Amir Markovitz, Ron Litman, Aviad Aberdam, Shahar Tsiper, Shai Mazor, Jon Wu, R. Manmatha:
On Calibration of Scene-Text Recognition Models. ECCV Workshops (4) 2022: 263-279 - [i19]Yair Kittenplon, Inbal Lavi, Sharon Fogel, Yarin Bar, R. Manmatha, Pietro Perona:
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer. CoRR abs/2202.05508 (2022) - [i18]Roi Ronen, Shahar Tsiper, Oron Anschel, Inbal Lavi, Amir Markovitz, R. Manmatha:
GLASS: Global to Local Attention for Scene-Text Spotting. CoRR abs/2208.03364 (2022) - [i17]Chih-Hui Ho
, Srikar Appalaraju, Bhavan Jasani, R. Manmatha, Nuno Vasconcelos
:
YORO - Lightweight End to End Visual Grounding. CoRR abs/2211.07912 (2022) - 2021
- [c75]Aviad Aberdam, Ron Litman, Shahar Tsiper, Oron Anschel, Ron Slossberg, Shai Mazor, R. Manmatha, Pietro Perona:
Sequence-to-Sequence Contrastive Learning for Text Recognition. CVPR 2021: 15302-15312 - [c74]Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha:
DocFormer: End-to-End Transformer for Document Understanding. ICCV 2021: 973-983 - [c73]Yash Patel, Srikar Appalaraju, R. Manmatha:
Saliency Driven Perceptual Image Compression. WACV 2021: 227-236 - [i16]Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha:
DocFormer: End-to-End Transformer for Document Understanding. CoRR abs/2106.11539 (2021) - [i15]Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha:
LaTr: Layout-Aware Transformer for Scene-Text VQA. CoRR abs/2112.12494 (2021) - 2020
- [c72]Ron Litman, Oron Anschel, Shahar Tsiper, Roee Litman, Shai Mazor, R. Manmatha:
SCATTER: Selective Context Attentional Scene Text Recognizer. CVPR 2020: 11959-11969 - [i14]Yash Patel, Srikar Appalaraju, R. Manmatha:
Hierarchical Auto-Regressive Model for Image Compression Incorporating Object Saliency and a Deep Perceptual Loss. CoRR abs/2002.04988 (2020) - [i13]Ron Litman, Oron Anschel, Shahar Tsiper, Roee Litman, Shai Mazor, R. Manmatha:
SCATTER: Selective Context Attentional Scene Text Recognizer. CoRR abs/2003.11288 (2020) - [i12]Hang Zhang, Chongruo Wu, Zhongyue Zhang, Yi Zhu, Zhi Zhang, Haibin Lin, Yue Sun, Tong He, Jonas Mueller, R. Manmatha, Mu Li, Alexander J. Smola:
ResNeSt: Split-Attention Networks. CoRR abs/2004.08955 (2020) - [i11]Yi Zhu, Zhongyue Zhang, Chongruo Wu, Zhi Zhang, Tong He, Hang Zhang, R. Manmatha, Mu Li, Alexander J. Smola:
Improving Semantic Segmentation via Self-Training. CoRR abs/2004.14960 (2020) - [i10]Minesh Mathew, Dimosthenis Karatzas, R. Manmatha, C. V. Jawahar:
DocVQA: A Dataset for VQA on Document Images. CoRR abs/2007.00398 (2020) - [i9]Minesh Mathew, Rubèn Tito, Dimosthenis Karatzas, R. Manmatha, C. V. Jawahar:
Document Visual Question Answering Challenge 2020. CoRR abs/2008.08899 (2020) - [i8]Yi Zhu, Xinyu Li, Chunhui Liu, Mohammadreza Zolfaghari, Yuanjun Xiong, Chongruo Wu, Zhi Zhang, Joseph Tighe, R. Manmatha, Mu Li:
A Comprehensive Study of Deep Video Action Recognition. CoRR abs/2012.06567 (2020) - [i7]Aviad Aberdam, Ron Litman, Shahar Tsiper, Oron Anschel, Ron Slossberg, Shai Mazor, R. Manmatha, Pietro Perona:
Sequence-to-Sequence Contrastive Learning for Text Recognition. CoRR abs/2012.10873 (2020) - [i6]Ron Slossberg, Oron Anschel, Amir Markovitz, Ron Litman, Aviad Aberdam, Shahar Tsiper, Shai Mazor, Jon Wu, R. Manmatha:
On Calibration of Scene-Text Recognition Models. CoRR abs/2012.12643 (2020)
2010 – 2019
- 2019
- [j16]Ismet Zeki Yalniz
, R. Manmatha
:
Dependence Models for Searching Text in Document Images. IEEE Trans. Pattern Anal. Mach. Intell. 41(1): 49-63 (2019) - [i5]Son Tran, Ming Du, Sampath Chanda, R. Manmatha, Cj Taylor:
Searching for Apparel Products from Images in the Wild. CoRR abs/1907.02244 (2019) - [i4]Yash Patel, Srikar Appalaraju, R. Manmatha:
Deep Perceptual Compression. CoRR abs/1907.08310 (2019) - [i3]Yash Patel, Srikar Appalaraju, R. Manmatha:
Human Perceptual Evaluations for Image Compression. CoRR abs/1908.04187 (2019) - 2018
- [c71]Chao-Yuan Wu, Manzil Zaheer, Hexiang Hu
, R. Manmatha, Alexander J. Smola, Philipp Krähenbühl:
Compressed Video Action Recognition. CVPR 2018: 6026-6035 - 2017
- [c70]R. Manmatha, Chao-Yuan Wu, Alexander J. Smola, Philipp Krähenbühl:
Sampling Matters in Deep Embedding Learning. ICCV 2017: 2859-2867 - [i2]Chao-Yuan Wu, R. Manmatha, Alexander J. Smola, Philipp Krähenbühl:
Sampling Matters in Deep Embedding Learning. CoRR abs/1706.07567 (2017) - [i1]Chao-Yuan Wu, Manzil Zaheer, Hexiang Hu, R. Manmatha, Alexander J. Smola, Philipp Krähenbühl:
Compressed Video Action Recognition. CoRR abs/1712.00636 (2017) - 2016
- [c69]Venkatesh N. Murthy, Vivek K. Singh, Terrence Chen, R. Manmatha, Dorin Comaniciu:
Deep Decision Network for Multi-class Image Classification. CVPR 2016: 2240-2248 - [c68]Ismet Zeki Yalniz, Douglas Gray, R. Manmatha:
Efficient Exploration of Text Regions in Natural Scene Images Using Adaptive Image Sampling. ECCV Workshops (1) 2016: 427-439 - [c67]Venkatesh N. Murthy, Avinash Sharma, Visesh Chari, R. Manmatha:
Image Annotation using Multi-scale Hypergraph Heat Diffusion Framework. ICMR 2016: 299-303 - 2015
- [j15]Martin Halvey
, Philip J. McParlane, Joemon M. Jose, Keith van Rijsbergen, Stefan M. Rüger, R. Manmatha, Mohan S. Kankanhalli:
ICMR 2014: 4th ACM International Conference on Multimedia Retrieval. SIGIR Forum 49(1): 10-15 (2015) - [c66]Venkatesh N. Murthy, Subhransu Maji, R. Manmatha:
Automatic Image Annotation using Deep Learning Representations. ICMR 2015: 603-606 - 2014
- [j14]K. Pramod Sankar, R. Manmatha, C. V. Jawahar
:
Large scale document image retrieval by automatic word annotation. Int. J. Document Anal. Recognit. 17(1): 1-17 (2014) - [j13]Thomas B. Moeslund
, Omar Javed, Yu-Gang Jiang, R. Manmatha:
Special issue on Multimedia Event Detection. Mach. Vis. Appl. 25(1): 1-4 (2014) - [c65]David Fernández Mota, R. Manmatha, Alicia Fornés, Josep Lladós
:
Sequential Word Spotting in Historical Handwritten Documents. Document Analysis Systems 2014: 101-105 - [c64]Ethem F. Can, R. Manmatha:
Modeling Concept Dependencies for Event Detection. ICMR 2014: 289 - [c63]Venkatesh N. Murthy, Ethem F. Can, R. Manmatha:
A Hybrid Model for Automatic Image Annotation. ICMR 2014: 369 - [c62]Ethem F. Can, W. Bruce Croft, R. Manmatha:
Incorporating query-specific feedback into learning-to-rank models. SIGIR 2014: 1035-1038 - [e3]Mohan S. Kankanhalli, Stefan M. Rüger, R. Manmatha, Joemon M. Jose, Keith van Rijsbergen:
International Conference on Multimedia Retrieval, ICMR '14, Glasgow, United Kingdom - April 01 - 04, 2014. ACM 2014, ISBN 978-1-4503-2782-4 [contents] - 2013
- [c61]Ethem F. Can, Hüseyin Oktay, R. Manmatha:
Predicting retweet count using visual cues. CIKM 2013: 1481-1484 - [c60]Ethem F. Can, R. Manmatha:
Formulating Action Recognition as a Ranking Problem. CVPR Workshops 2013: 251-256 - [c59]David Wemhoener, Ismet Zeki Yalniz, R. Manmatha:
Creating an Improved Version Using Noisy OCR from Multiple Editions. ICDAR 2013: 160-164 - [c58]James Allan, Jeff Dalton, John Foley, R. Manmatha, Venkatesh N. Murthy, David Wemhoener:
Short Text Queries for Video Retrieval Multimedia event Detection at TRECVID 2013. TRECVID 2013 - [c57]Jingen Liu, Hui Cheng, Omar Javed, Qian Yu, Ishani Chakraborty, Weiyu Zhang, Ajay Divakaran, Harpreet S. Sawhney, James Allan, R. Manmatha, John Foley, Mubarak Shah, Afshin Dehghan, Michael Witbrock, Jon Curtis, Gerald Friedland:
SRI-Sarnoff AURORA System at TRECVID 2013 Multimedia Event Detection and Recounting. TRECVID 2013 - [e2]Volkmar Frinken, Bill Barrett, R. Manmatha, Volker Märgner:
Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing, HIP@ICDAR 2013, Washington, DC, USA, August 24, 2013. ACM 2013, ISBN 978-1-4503-2115-0 [contents] - 2012
- [j12]Volkmar Frinken, Andreas Fischer
, R. Manmatha, Horst Bunke:
A Novel Word Spotting Method Based on Recurrent Neural Networks. IEEE Trans. Pattern Anal. Mach. Intell. 34(2): 211-224 (2012) - [c56]Ismet Zeki Yalniz, R. Manmatha:
An Efficient Framework for Searching Text in Noisy Document Images. Document Analysis Systems 2012: 48-52 - [c55]David Fernández, Josep Lladós
, Alicia Fornés
, R. Manmatha:
On Influence of Line Segmentation in Efficient Word Segmentation in Old Manuscripts. ICFHR 2012: 763-768 - [c54]Ismet Zeki Yalniz, R. Manmatha:
Finding translations in scanned book collections. SIGIR 2012: 465-474 - [c53]Marc-Allen Cartright, Ethem F. Can, William Dabney, Jeff Dalton, Logan Giorda, Kriste Krstovski, Xiaoye Wu, Ismet Zeki Yalniz, James Allan, R. Manmatha, David A. Smith:
A framework for manipulating and searching multiple retrieval types. SIGIR 2012: 1001 - [c52]Hui Cheng, Jingen Liu, Saad Ali, Omar Javed, Qian Yu, Amir Tamrakar, Ajay Divakaran, Harpreet S. Sawhney, R. Manmatha, James Allan, Alexander G. Hauptmann, Mubarak Shah, Subhabrata Bhattacharya, Afshin Dehghan, Gerald Friedland, Benjamin Elizalde, Trevor Darrell, Michael Witbrock, Jon Curtis:
SRI-Sarnoff AURORA System at TRECVID 2012 Multimedia Event Detection and Recounting. TRECVID 2012 - 2011
- [c51]David A. Smith, R. Manmatha, James Allan:
Mining relational structure from millions of books: position paper. BooksOnline 2011: 49-54 - [c50]Ismet Zeki Yalniz, Ethem F. Can, R. Manmatha:
Partial duplicate detection for large book collections. CIKM 2011: 469-474 - [c49]Raman Jain, Volkmar Frinken, C. V. Jawahar
, Raghavan Manmatha:
BLSTM Neural Network Based Word Retrieval for Hindi Documents. ICDAR 2011: 83-87 - [c48]Ismet Zeki Yalniz, Raghavan Manmatha:
A Fast Alignment Scheme for Automatic OCR Evaluation of Books. ICDAR 2011: 754-758 - [c47]Hui Cheng, Amir Tamrakar, Saad Ali, Qian Yu, Omar Javed, Jingen Liu, Ajay Divakaran, Harpreet S. Sawhney, Alexander G. Hauptmann, Mubarak Shah, Subhabrata Bhattacharya, Michael Witbrock, Jon Curtis, Gerald Friedland, Robert Mertens, Trevor Darrell, R. Manmatha, James Allan:
Team SRI-Sarnoff's AURORA System @ TRECVID 2011. TRECVID 2011 - [e1]Bill Barrett, Michael S. Brown, R. Manmatha, Jake Gehring:
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing, HIP@ICDAR 2011, Beijing, China, September 16-17, 2011. ACM 2011, ISBN 978-1-4503-0916-5 [contents] - 2010
- [c46]Ainhoa Llorente, Raghavan Manmatha, Stefan M. Rüger:
Image retrieval using Markov Random Fields and global image features. CIVR 2010: 243-250 - [c45]K. Pramod Sankar, C. V. Jawahar
, Raghavan Manmatha:
Nearest neighbor based collection OCR. Document Analysis Systems 2010: 207-214 - [c44]Volkmar Frinken, Andreas Fischer, Horst Bunke, R. Manmatha:
Adapting BLSTM Neural Network Based Keyword Spotting Trained on Modern Data to Historical Documents. ICFHR 2010: 352-357
2000 – 2009
- 2009
- [j11]Nicholas R. Howe, Shaolei Feng, R. Manmatha:
Finding words in alphabet soup: Inference on freeform character recognition for historical scripts. Pattern Recognit. 42(12): 3338-3347 (2009) - [c43]Venkat Rasagna, Anand Kumar, C. V. Jawahar
, Raghavan Manmatha:
Robust Recognition of Documents by Fusing Results of Word Clusters. ICDAR 2009: 566-570 - 2008
- [c42]Shaolei Feng, Raghavan Manmatha:
A discrete direct retrieval model for image and video retrieval. CIVR 2008: 427-436 - [c41]Tingxin Yan, Deepak Ganesan, R. Manmatha:
Distributed image search in camera sensor networks. SenSys 2008: 155-168 - [p1]R. Manmatha:
Document Image Analysis and Recognition. Wiley Encyclopedia of Computer Science and Engineering 2008 - 2007
- [j10]Toni M. Rath, R. Manmatha:
Word spotting for historical documents. Int. J. Document Anal. Recognit. 9(2-4): 139-152 (2007) - [j9]Toni M. Rath, R. Manmatha:
Word spotting for historical documents. Int. J. Document Anal. Recognit. 9(2-4): 299 (2007) - [j8]E. Micah Kornfield, R. Manmatha, James Allan:
Further explorations in text alignment with handwritten documents. Int. J. Document Anal. Recognit. 10(1): 39-52 (2007) - [c40]Anand Kumar, C. V. Jawahar, R. Manmatha:
Efficient Search in Document Image Collections. ACCV (1) 2007: 586-595 - 2006
- [c39]Jamie L. Rothfeder, R. Manmatha, Toni M. Rath:
Aligning Transcripts to Automatically Segmented Handwritten Manuscripts. Document Analysis Systems 2006: 84-95 - [c38]Shaolei Feng, R. Manmatha, Andrew McCallum:
Exploring the Use of Conditional Random Field Models and HMMs for Historical Handwritten Document Recognition. DIAL 2006: 30-37 - [c37]Shaolei Feng, R. Manmatha:
A hierarchical, HMM-based automatic evaluation of OCR accuracy for a digital library of books. JCDL 2006: 109-118 - 2005
- [j7]R. Manmatha, Jamie L. Rothfeder:
A Scale Space Approach for Automatically Segmenting Words from Historical Handwritten Documents. IEEE Trans. Pattern Anal. Mach. Intell. 27(8): 1212-1225 (2005) - [j6]Raghavan Manmatha, Stefan M. Rüger, Alexander G. Hauptmann:
Multimedia information retrieval: workshop report. SIGIR Forum 39(2): 40-41 (2005) - [c36]Natasha Mohanty, Toni M. Rath, Audrey Lee, R. Manmatha:
Learning Shapes for Image Classification and Retrieval. CIVR 2005: 589-598 - [c35]Shih-Fu Chang, R. Manmatha, Tat-Seng Chua:
Combining text and audio-visual features in video indexing. ICASSP (5) 2005: 1005-1008 - [c34]Shaolei Feng, Raghavan Manmatha:
Classification Models for Historical Manuscript Recognition. ICDAR 2005: 528-532 - [c33]Giridharan Iyengar, Pinar Duygulu
, Shaolei Feng, Pavel Ircing
, Sanjeev Khudanpur, Dietrich Klakow, M. R. Krause, Raghavan Manmatha, Harriet J. Nock, D. Petkova, Brock Pytlik, Paola Virga:
Joint visual-text modeling for automatic retrieval of multimedia documents. ACM Multimedia 2005: 21-30 - [c32]Nicholas R. Howe, Toni M. Rath, R. Manmatha:
Boosted decision trees for word recognition in handwritten document retrieval. SIGIR 2005: 377-383 - 2004
- [c31]Jiwoon Jeon, R. Manmatha:
Using Maximum Entropy for Automatic Image Annotation. CIVR 2004: 24-32 - [c30]Donald Metzler, R. Manmatha:
An Inference Network Approach to Image Retrieval. CIVR 2004: 42-50 - [c29]Shaolei Feng, Raghavan Manmatha, Victor Lavrenko:
Multiple Bernoulli Relevance Models for Image and Video Annotation. CVPR (2) 2004: 1002-1009 - [c28]E. Micah Kornfield, R. Manmatha, James Allan:
Text Alignment with Handwritten Documents. DIAL 2004: 195-211 - [c27]Victor Lavrenko, Toni M. Rath, R. Manmatha:
Holistic Word Recognition for Handwritten Historical Documents. DIAL 2004: 278-287 - [c26]Victor Lavrenko, Shaolei Feng, Raghavan Manmatha:
Statistical models for automatic video annotation and retrieval. ICASSP (3) 2004: 1044-1047 - [c25]Toni M. Rath, R. Manmatha, Victor Lavrenko:
A search engine for historical manuscript images. SIGIR 2004: 369-376 - 2003
- [j5]James Allan, Jay Aslam, Nicholas J. Belkin, Chris Buckley, James P. Callan, W. Bruce Croft, Susan T. Dumais, Norbert Fuhr, Donna Harman, David J. Harper, Djoerd Hiemstra, Thomas Hofmann, Eduard H. Hovy, Wessel Kraaij, John D. Lafferty, Victor Lavrenko, David D. Lewis, Liz Liddy, R. Manmatha, Andrew McCallum, Jay M. Ponte, John M. Prager, Dragomir R. Radev, Philip Resnik, Stephen E. Robertson, Ronald Rosenfeld
, Salim Roukos, Mark Sanderson, Richard M. Schwartz, Amit Singhal, Alan F. Smeaton, Howard R. Turtle, Ellen M. Voorhees, Ralph M. Weischedel, Jinxi Xu, ChengXiang Zhai:
Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002. SIGIR Forum 37(1): 31-47 (2003) - [c24]Toni M. Rath, R. Manmatha:
Word Image Matching Using Dynamic Time Warping. CVPR (2) 2003: 521-527 - [c23]Toni M. Rath, R. Manmatha:
Features for Word Spotting in Historical Manuscripts. ICDAR 2003: 218-222 - [c22]Katrina M. Hanna, Brian Neil Levine, R. Manmatha:
Mobile Distributed Information Retrieval for Highly-Partitioned Networks. ICNP 2003: 38- - [c21]Victor Lavrenko, R. Manmatha, Jiwoon Jeon:
A Model for Learning the Semantics of Pictures. NIPS 2003: 553-560 - [c20]Jiwoon Jeon, Victor Lavrenko, R. Manmatha:
Automatic image annotation and retrieval using cross-media relevance models. SIGIR 2003: 119-126 - 2002
- [c19]R. Manmatha, Ao Feng, James Allan:
A critical examination of TDT's cost function. SIGIR 2002: 403-404 - 2001
- [c18]Madirakshi Das, R. Manmatha:
Automatic Segmentation and Indexing in a Database of Bird Images. ICCV 2001: 351-358 - [c17]R. Manmatha, Toni M. Rath, Fangfang Feng:
Modeling Score Distributions for Combining the Outputs of Search Engines. SIGIR 2001: 267-275
1990 – 1999
- 1999
- [j4]Madirakshi Das, R. Manmatha, Edward M. Riseman:
Indexing Flower Patent Images Using Domain Knowledge. IEEE Intell. Syst. 14(5): 24-33 (1999) - [j3]Victor Wu, R. Manmatha, Edward M. Riseman:
TextFinder: An Automatic System to Detect and Recognize Text In Images. IEEE Trans. Pattern Anal. Mach. Intell. 21(11): 1224-1229 (1999) - [j2]Rohini K. Srihari, Zhongfei Zhang, R. Manmatha, Chandu Ravela:
Indexing and Retrieval, SIGIR'99 Workshop Summary. SIGIR Forum 33(1): 34-35 (1999) - [c16]R. Manmatha, Nitin Srimal:
Scale Space Technique for Word Segmentation in Handwritten Documents. Scale-Space 1999: 22-33 - 1998
- [j1]Rohini K. Srihari, Zhongfei Zhang, R. Manmatha, Chandu Ravela:
Multimedia Indexing and Retrieval, Summary Report. SIGIR Forum 32(2): 29-30 (1998) - [c15]Victor Wu, Raghavan Manmatha:
Document image cleanup and binarization. Document Recognition 1998: 263- - [c14]Raghavan Manmatha, S. Chandu Ravela, Y. Chitti:
Computing local and global similarity in images. Human Vision and Electronic Imaging 1998: 540-551 - [c13]Srinivas Ravela, R. Manmatha:
Retrieving Images by Appearance. ICCV 1998: 608-613 - [c12]Srinivas Ravela, Raghavan Manmatha:
On computing global similarity in images. WACV 1998: 82-87 - [c11]Madirakshi Das, Raghavan Manmatha, Edward M. Riseman:
Indexing flowers by color names using domain knowledge-driven segmentation. WACV 1998: 94-99 - 1997
- [c10]Victor Wu, R. Manmatha, Edward M. Riseman:
Finding Text in Images. ACM DL 1997: 3-12 - [c9]Raghavan Manmatha, S. Chandu Ravela:
Syntactic characterization of appearance and its application to image retrieval. Human Vision and Electronic Imaging 1997: 484-495 - [c8]Srinivas Ravela, R. Manmatha:
Image Retrieval by Appearance. SIGIR 1997: 278-285 - 1996
- [c7]R. Manmatha, Chengfeng Han, Edward M. Riseman:
Word Spotting: A New Approach to Indexing Handwriting. CVPR 1996: 631-637 - [c6]R. Manmatha, Chengfeng Han, Edward M. Riseman, W. Bruce Croft:
Indexing Handwriting Using Word Matching. Digital Libraries 1996: 151-159 - [c5]Srinivas Ravela, R. Manmatha, Edward M. Riseman:
Image Retrieval Using Scale-Space Matching. ECCV (1) 1996: 273-282 - 1994
- [c4]R. Manmatha:
A framework for recovering affine transforms using points, lines or image brightnesses. CVPR 1994: 141-146 - [c3]R. Manmatha:
Measuring the Affine Transform Using Gaussian Filters. ECCV (2) 1994: 159-164 - 1993
- [c2]Raghavan Manmatha, John Oliensis:
Extracting affine deformations from image patches. I. Finding scale and rotation. CVPR 1993: 754-755
1980 – 1989
- 1989
- [c1]Rabindranath Dutta, R. Manmatha, Lance R. Williams, Edward M. Riseman:
A data set for quantitative motion analysis. CVPR 1989: 159-164
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-27 00:07 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint