Previous |  Up |  Next

Article

Keywords:
DML-CZ; metadata generation; XML; MathML; PDF; copy-math; metadata generation; Tralics
Summary:
The quality of digital mathematical library depends on the formats and quality of data it offers. We show several enhancements of (meta)data of the Czech Digital Mathematics Library DML-CZ. We discuss possible minimalist modification of regular LaTeX documents that would simplify generating basic metadata that describes the article in an XML/MathML format. We also show a proof of concept of a method that enables us to include LaTeX source code of mathematical expressions into pdfTeX-generated PDFs in such a way that the reader can Copy & Paste the code from his PDF viewer. This code, hidden in the PDF file, can also be used for LaTeX math indexing.
References:
1. Archivum Mathematicum. [online], http://www.emis.de/journals/AM/, Masaryk University, Brno, Czech Republic. Last modified December 18, 2009. [cit. 2010-04-25].
2. Centre de diffusion de revues académiques mathématiques. [online], http://www.cedram.org/, [Center for diffusion of mathematic journals]. [cit. 2008-05-25].
3. Czech Digital Mathematics Library. [online], http://dml.cz/, [cit. 2010-04-24]. Zbl 1170.68487
4. EuDML: The European Digital Mathematics Library. [online], http://www.eudml.eu/, This page was last modified on 20 January 2010, at 08:09. [cit. 2010-04-25].
5. Hatlapatka, R., Sojka, P.: PDF Enhancements Tools for a Digital Library. In: Sojka, P. (ed.) Proceedings of DML 2010, pp. 69–76. Masaryk University Press, Paris, France (Jul 2010).
6. Infty Project: Research Project on Mathematical Information Processing. [online], http://www.inftyproject.org/en/, [cit. 2010-06-02].
7. Tralics: a LaTeX to XML translator. http://www-sop.inria.fr/apics/tralics/, Last modified $Date: 2009/11/24 17:17:03 $ [cit. 2010-04-24].
8. Bouche, T.: A pdfLaTeX-based automated journal production system. TUGboat 27(1), 45–50 (2006), In Proceedings of EuroTeX 2006.
9. Grimm, J.: Tralics, a LaTeX to XML Translator. TUGboat 24(3), 377–388 (2003), In Proceedings of EuroTeX.
10. Růžička, M.: Automated Processing of TeX-Typeset Articles for a Digital Library. In: Sojka, P. (ed.) DML 2008 – Towards Digital Mathematics Library. pp. 167–176 (2008), Birmingham, UK, July 27th, 2008.
11. Suzuki, M., Kanahori, T., Ohtake, N., Yamaguchi, K.: An Integrated OCR Software for mathematical Documents and Its Output with Accessibility. In: Computers Helping people with Special Needs. Lecture Notes in Computer Sciences, vol. 3119, pp. 648–655. Springer (2004), 9th International Conference ICCHP 2004, Paris, July 2004.
Partner of
EuDML logo