BTS: A Bi-Lingual Benchmark for Text Segmentation in the Wild
Xixi Xu, Zhongang Qi, Jianqi Ma, Honglun Zhang, Ying Shan, Xiaohu Qie
[CVPR][text-image-preprocessing]
A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-Resolution
Jianqi Ma, Zhetong Liang, Lei Zhang
[CVPR][text-image-preprocessing]
Fourier Document Restoration for Robust Document Dewarping and Recognition
Chuhui Xue, Zichen Tian, Fangneng Zhan, Shijian Lu, Song Bai
[CVPR][text-image-preprocessing]
Revisiting Document Image Dewarping by Grid Regularization
Xiangwei Jiang, Rujiao Long, Nan Xue, Zhibo Yang, Cong Yao, Gui-Song Xia
[CVPR][text-image-preprocessing]
Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection
Jingqun Tang, Wenqing Zhang, Hongye Liu, MingKun Yang, Bo Jiang, Guanglong Hu, Xiang Bai
[CVPR][text-detection]
Vision-Language Pre-Training for Boosting Scene Text Detectors
Sibo Song, Jianqiang Wan, Zhibo Yang, Jun Tang, Wenqing Cheng, Xiang Bai, Cong Yao
[CVPR][text-detection]
Towards End-to-End Unified Scene Text Detection and Layout Analysis
Shangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii, Michalis Raptis
[CVPR][text-detection]
Open-Set Text Recognition via Character-Context Decoupling
Chang Liu, Chun Yang, Xu-Cheng Yin
[CVPR][text-recognition]
Pushing the Performance Limit of Scene Text Recognizer Without Human Annotation
Caiyuan Zheng, Hui Li, Seon-Min Rhee, Seungju Han, Jae-Joon Han, Peng Wang
[CVPR][text-recognition]
SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition
Mingxin Huang, Yuliang Liu, Zhenghao Peng, Chongyu Liu, Dahua Lin, Shenggao Zhu, Nicholas Yuan, Kai Ding, Lianwen Jin
[CVPR][end-to-end-ocr]
Text Spotting Transformers
Xiang Zhang, Yongwen Su, Subarna Tripathi, Zhuowen Tu
[CVPR][end-to-end-ocr]
Towards Weakly-Supervised Text Spotting Using a Multi-Task Transformer
Yair Kittenplon, Inbal Lavi, Sharon Fogel, Yarin Bar, R. Manmatha, Pietro Perona
[CVPR][end-to-end-ocr]
XYLayoutLM: Towards Layout-Aware Multimodal Networks for Visually-Rich Document Understanding
Zhangxuan Gu, Changhua Meng, Ke Wang, Jun Lan, Weiqiang Wang, Ming Gu, Liqing Zhang
[CVPR][document-image-understanding]
LaTr: Layout-Aware Transformer for Scene-Text VQA
Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha
[CVPR][document-image-understanding]
V-Doc: Visual Questions Answers With Documents
Yihao Ding, Zhe Huang, Runlin Wang, YanHang Zhang, Xianru Chen, Yuzhong Ma, Hyunsuk Chung, Soyeon Caren Han
[CVPR][document-image-understanding]
PubTables-1M: Towards Comprehensive Table Extraction From Unstructured Documents
Brandon Smock, Rohith Pesala, Robin Abraham
[CVPR][document-image-understanding]
Neural Collaborative Graph Machines for Table Structure Recognition
Hao Liu, Xin Li, Bing Liu, Deqiang Jiang, Yinsong Liu, Bo Ren
[CVPR][document-image-understanding]
TableFormer: Table Structure Understanding With Transformers
Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, Peter Staar
[CVPR][document-image-understanding]
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
Ye Yuan, Xiao Liu, Wondimu Dikubab, Hui Liu, Zhilong Ji, Zhongqin Wu, Xiang Bai
[CVPR][document-image-understanding]
XMP-Font: Self-Supervised Cross-Modality Pre-Training for Few-Shot Font Generation
Wei Liu, Fangyue Liu, Fei Ding, Qian He, Zili Yi
[CVPR][others]
Look Closer To Supervise Better: One-Shot Font Generation via Component-Based Discriminator
Yuxin Kong, Canjie Luo, Weihong Ma, Qiyuan Zhu, Shenggao Zhu, Nicholas Yuan, Lianwen Jin
[CVPR][others]
Few-Shot Font Generation by Learning Fine-Grained Local Styles
Licheng Tang, Yiyang Cai, Jiaming Liu, Zhibin Hong, Mingming Gong, Minhu Fan, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang
[CVPR][others]
Aesthetic Text Logo Synthesis via Content-Aware Layout Inferring
Yizhi Wang, Guo Pu, Wenhan Luo, Yexin Wang, Pengfei Xiong, Hongwen Kang, Zhouhui Lian
[CVPR][others]
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization
Canjie Luo, Lianwen Jin, Jingdong Chen
[CVPR][others]
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Mengjun Cheng, Yipeng Sun, Longchao Wang, Xiongwei Zhu, Kun Yao, Jie Chen, Guoli Song, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang
[CVPR][others]
Knowledge Mining With Scene Text for Fine-Grained Recognition
Hao Wang, Junchao Liao, Tianheng Cheng, Zewen Gao, Hao Liu, Bo Ren, Xiang Bai, Wenyu Liu
[CVPR][others]
Scene Text Telescope: Text-Focused Scene Image Super-Resolution
Jingye Chen, Bin Li, Xiangyang Xue
[CVPR][text-image-preprocessing]
Rethinking Text Segmentation: A Novel Dataset and a Text-Specific Refinement Approach
Xingqian Xu, Zhifei Zhang, Zhaowen Wang, Brian Price, Zhonghao Wang, Humphrey Shi
[CVPR][text-image-preprocessing]
Variational Transformer Networks for Layout Generation
Diego Martin Arroyo, Janis Postels, Federico Tombari
[CVPR][text-image-preprocessing]
Semantic-Aware Video Text Detection
Wei Feng, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu
[CVPR][text-detection]
Fourier Contour Embedding for Arbitrary-Shaped Text Detection
Yiqin Zhu, Jianyong Chen, Lingyu Liang, Zhanghui Kuang, Lianwen Jin, Wayne Zhang
[CVPR][text-detection]
Self-Attention Based Text Knowledge Mining for Text Detection
Qi Wan, Haoqin Ji, Linlin Shen
[CVPR][text-detection]
Progressive Contour Regression for Arbitrary-Shape Scene Text Detection
Pengwen Dai, Sanyi Zhang, Hua Zhang, Xiaochun Cao
[CVPR][text-detection]
MOST: A Multi-Oriented Scene Text Detector With Localization Refinement
Minghang He, Minghui Liao, Zhibo Yang, Humen Zhong, Jun Tang, Wenqing Cheng, Cong Yao, Yongpan Wang, Xiang Bai
[CVPR][text-detection]
Primitive Representation Learning for Scene Text Recognition
Ruijie Yan, Liangrui Peng, Shanyu Xiao, Gang Yao
[CVPR][text-recognition]
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
Shancheng Fang, Hongtao Xie, Yuxin Wang, Zhendong Mao, Yongdong Zhang
[CVPR][text-recognition]
Sequence-to-Sequence Contrastive Learning for Text Recognition
Aviad Aberdam, Ron Litman, Shahar Tsiper, Oron Anschel, Ron Slossberg, Shai Mazor, R. Manmatha, Pietro Perona
[CVPR][text-recognition]
Dictionary-Guided Scene Text Recognition
Nguyen Nguyen, Thu Nguyen, Vinh Tran, Minh-Triet Tran, Thanh Duc Ngo, Thien Huu Nguyen, Minh Hoai
[CVPR][text-recognition]
What if We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels
Jeonghun Baek, Yusuke Matsui, Kiyoharu Aizawa
[CVPR][text-recognition]
MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition
Ayan Kumar Bhunia, Shuvozit Ghose, Amandeep Kumar, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song
[CVPR][text-recognition]
Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting
Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song
[CVPR][text-recognition]
Implicit Feature Alignment: Learn To Convert Text Recognizer to Text Spotter
Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo
[CVPR][end-to-end-ocr]
A Multiplexed Network for End-to-End, Multilingual OCR
Jing Huang, Guan Pang, Rama Kovvuri, Mandy Toh, Kevin J Liang, Praveen Krishnan, Xi Yin, Tal Hassner
[CVPR][end-to-end-ocr]
SelfDoc: Self-Supervised Document Representation Learning
Peizhao Li, Jiuxiang Gu, Jason Kuen, Vlad I. Morariu, Handong Zhao, Rajiv Jain, Varun Manjunatha, Hongfu Liu
[CVPR][document-image-understanding]
TextOCR: Towards Large-Scale End-to-End Reasoning for Arbitrary-Shaped Scene Text
Amanpreet Singh, Guan Pang, Mandy Toh, Jing Huang, Wojciech Galuba, Tal Hassner
[CVPR][document-image-understanding]
Scene Text Retrieval via Joint Text Detection and Similarity Learning
Hao Wang, Xiang Bai, Mingkun Yang, Shenggao Zhu, Jing Wang, Wenyu Liu
[CVPR][document-image-understanding]
TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption
Zhengyuan Yang, Yijuan Lu, Jianfeng Wang, Xi Yin, Dinei Florencio, Lijuan Wang, Cha Zhang, Lei Zhang, Jiebo Luo
[CVPR][document-image-understanding]
Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship
Jing Wang, Jinhui Tang, Mingkun Yang, Xiang Bai, Jiebo Luo
[CVPR][document-image-understanding]
BEDSR-Net: A Deep Shadow Removal Network From a Single Document Image
Yun-Hsuan Lin, Wen-Chin Chen, Yung-Yu Chuang
[CVPR][text-image-preprocessing]
Cross-Domain Document Object Detection: Benchmark Suite and Method
Kai Li, Curtis Wigington, Chris Tensmeyer, Handong Zhao, Nikolaos Barmpalios, Vlad I. Morariu, Varun Manjunatha, Tong Sun, Yun Fu
[CVPR][text-image-preprocessing]
Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection
Shi-Xue Zhang, Xiaobin Zhu, Jie-Bo Hou, Chang Liu, Chun Yang, Hongfa Wang, Xu-Cheng Yin
[CVPR][text-detection]
ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection
Yuxin Wang, Hongtao Xie, Zheng-Jun Zha, Mengting Xing, Zilong Fu, Yongdong Zhang
[CVPR][text-detection]
SCATTER: Selective Context Attentional Scene Text Recognizer
Ron Litman, Oron Anschel, Shahar Tsiper, Roee Litman, Shai Mazor, R. Manmatha
[CVPR][text-recognition]
Towards Accurate Scene Text Recognition With Semantic Reasoning Networks
Deli Yu, Xuan Li, Chengquan Zhang, Tao Liu, Junyu Han, Jingtuo Liu, Errui Ding
[CVPR][text-recognition]
SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition
Zhi Qiao, Yu Zhou, Dongbao Yang, Yucan Zhou, Weiping Wang
[CVPR][text-recognition]
On Vocabulary Reliance in Scene Text Recognition
Zhaoyi Wan, Jielei Zhang, Liang Zhang, Jiebo Luo, Cong Yao
[CVPR][text-recognition]
What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images
Xing Xu, Jiefu Chen, Jinhui Xiao, Lianli Gao, Fumin Shen, Heng Tao Shen
[CVPR][text-recognition]
Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition
Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Yongpan Wang
[CVPR][text-recognition]
OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold
Mohamed Yousef, Tom E. Bishop
[CVPR][text-recognition]
ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network
Yuliang Liu, Hao Chen, Chunhua Shen, Tong He, Lianwen Jin, Liangwei Wang
[CVPR][end-to-end-ocr]
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
Xinyu Wang, Yuliang Liu, Chunhua Shen, Chun Chet Ng, Canjie Luo, Lianwen Jin, Chee Seng Chan, Anton van den Hengel, Liangwei Wang
[CVPR][document-image-understanding]
Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text
Difei Gao, Ke Li, Ruiping Wang, Shiguang Shan, Xilin Chen
[CVPR][document-image-understanding]
Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA
Ronghang Hu, Amanpreet Singh, Trevor Darrell, Marcus Rohrbach
[CVPR][document-image-understanding]
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation
Sharon Fogel, Hadar Averbuch-Elor, Sarel Cohen, Shai Mazor, Roee Litman
[CVPR][others]
SwapText: Image Based Texts Transfer in Scenes
Qiangpeng Yang, Jun Huang, Wei Lin
[CVPR][others]
STEFANN: Scene Text Editor Using Font Adaptive Neural Network
Prasun Roy, Saumik Bhattacharya, Subhankar Ghosh, Umapada Pal
[CVPR][others]
Fast(er) Reconstruction of Shredded Text Documents via Self-Supervised Deep Asymmetric Metric Learning
Thiago M. Paixao, Rodrigo F. Berriel, Maria C. S. Boeres, Alessandro L. Koerich, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos
[CVPR][others]
Sequential Motif Profiles and Topological Plots for Offline Signature Verification
Elias N. Zois, Evangelos Zervas, Dimitrios Tsourounis, George Economou
[CVPR][others]
Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes
Chengquan Zhang, Borong Liang, Zuming Huang, Mengyi En, Junyu Han, Errui Ding, Xinghao Ding
[CVPR][arXiv][text-detection][text-segmentation]
Character Region Awareness for Text Detection
Youngmin Baek, Bado Lee, Dongyoon Han, Sangdoo Yun, Hwalsuk Lee
[CVPR][arXiv][text-detection]
Pyramid Mask Text Detector
Jingchao Liu, Xuebo Liu, Jie Sheng, Ding Liang, Xin Li, Qingjie Liu
[CVPR][arXiv][code][text-detection][text-segmentation]
Towards Robust Curve Text Detection with Conditional Spatial Expansion
Zichuan Liu, Guosheng Lin, Sheng Yang, Fayao Liu, Weisi Lin, Wang Ling Goh
[CVPR][arXiv][text-detection][text-segmentation]
Scene Text Detection with Supervised Pyramid Context Network
Enze Xie, Yuhang Zang, Shuai Shao, Gang Yu, Cong Yao, Guangyao Li
[AAAI][arXiv][text-detection]
Shape Robust Text Detection with Progressive Scale Expansion Network
Wenhai Wang, Enze Xie, Xiang Li, Wenbo Hou, Tong Lu, Gang Yu, Shuai Shao
[CVPR][arXiv][code][text-detection][text-segmentation]
Learning Shape-Aware Embedding for Scene Text Detection
Zhuotao Tian, Michelle Shu, Pengyuan Lyu, Ruiyu Li, Chao Zhou, Xiaoyong Shen, Jiaya Jia
[CVPR][text-detection]