AV Aho, JD Ullman, 1972.
The Theory of Parsing, Translation, and Compiling.
Prentice Hall
Mohammad Abolhassani, Norbert Fuhr and Norbert Gövert,
Information extraction and automatic markup for XML documents,
In Blanken et al, 2003, 159--174,
Springer. doi: 10.1007/978-3-540-45194-5_11
Backus, J.W.,
The Syntax and Semantics of the Proposed International Algebraic Language of Zürich ACM-GAMM Conference,
Proceedings of the International Conference on Information Processing, UNESCO,
1959, pp.125-132.
Brzozowski, J. A. 1964. Derivatives of Regular Expressions. J. ACM 11,
4 (Oct. 1964), 481-494.
doi: 10.1145/321239.321249
Chomsky, Noam (1956). "Three models for the description of language".
IRE Transactions on Information Theory 2: 113–124.
doi: 10.1109/TIT.1956.1056813
Peter Fankhauser and Yi Xu,
MarkItUp! - An incremental approach to document structure recognition,
Electronic Publishing, 1993, pages 447-456
Eila Kuikka and Martti Penttonen,
Transformation of Structured Documents,
Electronic Publishing Origination, Dissemination and Design, 8(4), 1995.
Bertin Klein and Peter Fankhauser,
Error tolerant Document Structure Analysis,
International Journal on Digital Libraries,
1997, volume 1, pages 344-357.
doi: 10.1007/s007990050028
Lewis, P. M. and Stearns, R. E. 1968. Syntax-Directed Transduction.
J. ACM 15, 3 (Jul. 1968), 465-488.
doi: 10.1145/321466.321477
Greger Lindén,
Structured Document Transformations,
1997
Moore, E. F., [1956]. Gedanken experiments on sequential machines,
Automata Studies, Princeton Univ. Press,
Princeton, New Jersey, pp. 129-153.
Makoto Murata,
Transformation of Documents and Schemas by Patterns and Contextual Conditions,
Proceedings of the Third International Workshop on Principles of Document Processing (PODP 96),
1997, pages 153-169, Springer-Verlag. doi: 10.1007/3-540-63620-X_61
Sperberg-McQueen, C. M. Applications of Brzozowski derivatives to XML schema processing.
In Extreme Markup Languages 2005, page 26, Internet, 2005. IDEAlliance.
Kazem Taghva, Allen Condit, and Julie Borsack,
Autotag: A tool for creating structured document collections from printed materials,
Electronic Publishing, Artistic Imaging, and Digital Typography, Proc. of the EP ’98 and RIDT
’98 Conferences, 1998, pages 420-431,
Springer-Verlag
Tang, X. 2003 A High-Level Specification Language for Structured Document Transformation. Doctoral Thesis.
UMI Order Number: AAINQ84932., University of Waterloo.
Dénes Vargha,
Schema method: a framework for correcting grammatically ill-formed input
Proceedings of the 12th conference on Computational linguistics - Volume 1
Computer and Automation Institute, Hungarian Academy of Sciences
Pages 341 - 347
Association for Computational Linguistics Morristown, NJ, USA ©1988
ISBN: 9638431563. doi: 10.3115/991635.991705
Standard Generalized Markup Language (SGML)
International Organization for Standardization ISO 8879:1986
Grammar-driven Markup Generation
Mario Blažević
Senior software developer
Stilo International plc.
Abstract
This paper defines the concept of grammar-driven normalization of incomplete instances, sketches its implementation
for RELAX NG schema and XML documents, and presents an example of its practical use for automated document
conversion.
Grammar-driven Markup Generation
Balisage: The Markup Conference 2010
August 3 - 6, 2010
The materials listed below were provided by the speaker as supplements to a
presentation at Balisage. These materials may include the slides or visuals used in the
presentation; supplementary material, such as code samples or a demonstration application;
and/or the paper accompanying the presentation (if it has not been provided in XML). These
materials have been zipped for easy download and are identified by a brief description of
the contents. The materials themselves are untouched
, that is, they
have not been tested or edited by Balisage: The Markup Conference or by Mulberry
Technologies, Inc. As such, they are included on this website AS IS
,
i.e., as provided by the speaker, with no warranties, express or otherwise, made by Balisage
or Mulberry.
Slides and Materials