Please use this identifier to cite or link to this item: https://doi.org/10.21256/zhaw-30744
Publication type: Article in scientific journal
Type of review: Peer review (publication)
Title: MathNet : a data-centric approach for printed mathematical expression recognition
Authors: Schmitt-Koopmann, Felix
Huang, Elaine M.
Hutter, Hans-Peter
Stadelmann, Thilo
Darvishy, Alireza
et. al: No
DOI: 10.1109/ACCESS.2024.3404834
10.21256/zhaw-30744
Published in: IEEE Access
Volume(Issue): 12
Page(s): 76963
Pages to: 76974
Issue Date: 23-May-2024
Publisher / Ed. Institution: IEEE
ISSN: 2169-3536
Language: English
Subjects: Data-centric AI; Deep learning; Labeling; Document analysis; Mathematical expression recognition; Pattern recognition
Subject (DDC): 006: Special computer methods
Abstract: Printed mathematical expression recognition (MER) models are usually trained and tested using LaTeX-generated mathematical expressions (MEs) as input and the LaTeX source code as ground truth. As the same ME can be generated by various different LaTeX source codes, this leads to unwanted variations in the ground truth data that bias test performance results and hinder efficient learning. In addition, the use of only one font to generate the MEs heavily limits the generalization of the reported results to realistic scenarios. We propose a data-centric approach to overcome this problem, and present convincing experimental results: Our main contribution is an enhanced LaTeX normalization to map any LaTeX ME to a canonical form. Based on this process, we developed an improved version of the benchmark dataset im2latex-100k, featuring 30 fonts instead of one. Second, we introduce the real-world dataset realFormula, with MEs extracted from papers. Third, we developed a MER model, MathNet, based on a convolutional vision transformer, with superior results on all four test sets (im2latex-100k, im2latexv2, realFormula, and InftyMDB-1), outperforming the previous state of the art by up to 88.3%.
URI: https://digitalcollection.zhaw.ch/handle/11475/30744
Related research data: https://github.com/felix-schmitt/MathNet
Fulltext version: Published version
License (according to publishing contract): CC BY 4.0: Attribution 4.0 International
Departement: School of Engineering
Organisational Unit: Centre for Artificial Intelligence (CAI)
Institute of Computer Science (InIT)
Published as part of the ZHAW project: Accessible Scientific PDFs for All
Appears in collections:Publikationen School of Engineering

Files in This Item:
File Description SizeFormat 
2024_SchmittKoopmann-etal_MathNet-Printed-Mathematical-Expression-Recognition.pdf1.63 MBAdobe PDFThumbnail
View/Open
Show full item record
Schmitt-Koopmann, F., Huang, E. M., Hutter, H.-P., Stadelmann, T., & Darvishy, A. (2024). MathNet : a data-centric approach for printed mathematical expression recognition. IEEE Access, 12, 76963–76974. https://doi.org/10.1109/ACCESS.2024.3404834
Schmitt-Koopmann, F. et al. (2024) ‘MathNet : a data-centric approach for printed mathematical expression recognition’, IEEE Access, 12, pp. 76963–76974. Available at: https://doi.org/10.1109/ACCESS.2024.3404834.
F. Schmitt-Koopmann, E. M. Huang, H.-P. Hutter, T. Stadelmann, and A. Darvishy, “MathNet : a data-centric approach for printed mathematical expression recognition,” IEEE Access, vol. 12, pp. 76963–76974, May 2024, doi: 10.1109/ACCESS.2024.3404834.
SCHMITT-KOOPMANN, Felix, Elaine M. HUANG, Hans-Peter HUTTER, Thilo STADELMANN und Alireza DARVISHY, 2024. MathNet : a data-centric approach for printed mathematical expression recognition. IEEE Access. 23 Mai 2024. Bd. 12, S. 76963–76974. DOI 10.1109/ACCESS.2024.3404834
Schmitt-Koopmann, Felix, Elaine M. Huang, Hans-Peter Hutter, Thilo Stadelmann, and Alireza Darvishy. 2024. “MathNet : A Data-Centric Approach for Printed Mathematical Expression Recognition.” IEEE Access 12 (May): 76963–74. https://doi.org/10.1109/ACCESS.2024.3404834.
Schmitt-Koopmann, Felix, et al. “MathNet : A Data-Centric Approach for Printed Mathematical Expression Recognition.” IEEE Access, vol. 12, May 2024, pp. 76963–74, https://doi.org/10.1109/ACCESS.2024.3404834.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.