Acknowledgement
본 논문은 2022년도 BB21+ 사업으로 지원되었으며, 또한 과학기술정보통신부 및 정보통신기획평가원의 지역지능화혁신 인재양성(Grand ICT연구센터) 사업의 연구결과로 수행되었음(IITP-2023-2020-0-01791).
References
- Cai, Z. and Vasconcelos, N. (2018). Cascade R-CNN: Delving Into High Quality Object Detection, Preceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, Salt Lake City, UT, USA, pp. 6154-6162, 2018.
- Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., and Zagoruyko, S. (2020). End-to-End Object Detection with Transformers, Proceedings of the European Conference on Computer Vision, Aug. 23-28, pp. 213-229.
- Feng, H., Wang, Y., Zhou, W., Deng, J., and Li, H. (2021). DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction, Proceeding of ACM International Conference on Multimedia, Oct. 20-24, Chengdu, China, pp. 273-281.
- Harley, A. W., Ufkes, A., and Derpanis, K. G. (2015). Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval. Proceedings of the International Conference on Document Analysis and Recognition, Aug. 23-26, Tunis, Tunisia, pp. 991-995.
- He, T., Tian, Z., Huang, W., Shen, C., Qiao Y., and Sun, C. (2018). An End-to-End TextSpotter with Explicit Alignment and Attention, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, Salt Lake City, UT, USA, pp. 5020-5029.
- Kozlowski, M. and Weichbroth, P. (2021). Samples of Electronic Invoices, Mendeley Data. https://doi.org/10.17632/tnj49gpmtz.2.
- Lee, D. S. and Kwon, S. K. (2022). Structure Recognition Method in Various Table Types for Document Processing Automation. Journal of Korea Multimedia Society, 25(5), 695-702. https://doi.org/10.9717/kmms.2022.25.5.69
- Liao, M., Wan, Z., Yao, C., Chen, K., and Bai, X. (2020). Real-time Scene Text Detection with Differentiable Binarization. Proceedings of the AAAI conference on artificial intelligence, Feb. 7-12, New York, NY, USA, pp. 11474-11481
- Prasad, D., Gadpal, A., Kapadni, K., Visave, M., and Sultanpure, K. (2020). CascadeTabNet: An Approach for End to End Table Detection and Structure Recognition from Image-based Documents, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, June 14-19, Seattle, Wa, USA, pp. 2439-2447.
- Shaoqing, R., Kaiming, H., Girshick, R., and Sun, J. (2017). Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks, IEEE Transection on Pattern Analysis and Machine Intelligence, 39(6), 1137-1149. https://doi.org/10.1109/TPAMI.2016.2577031.
- Shi, B., Bai, X., and Yao, C. (2016). An End-to-end Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(11), 2298-2304. https://doi.org/10.1109/TPAMI.2016.2646371.
- Shi, B., Yang, M., Wang. X., Lyu, P., Yao, C., and Bai, X. (2019). ASTER: An Attentional Scene Text Recognizer with Flexible Rectification, IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(9), 2035-2048. https://doi.org/10.1109/TPAMI.2018.2848939.
- Smock, B., Pesala R., and Abraham, R. (2022). PubTables-1M: Towards Comprehensive Table Extraction from Unstructured Documents, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 19-20, New Orleans, LA, USA, pp. 4624-4632.
- Zhong, X., Bavani, E. S., and Yepes, A. J. (2020). Image-Based Table Recognition: Data, Model, and Evaluation, Proceedings of the European Conference on Computer Vison, Aug. 23-28, pp. 564-580.