Balisage Paper: A Virtualization-Based Retrieval and Update API for XML-Encoded Corpora^[1]

Balisage: The Markup Conference 2010
August 3 - 6, 2010

The materials listed below were provided by the speaker as supplements to a presentation at Balisage. These materials may include the slides or visuals used in the presentation; supplementary material, such as code samples or a demonstration application; and/or the paper accompanying the presentation (if it has not been provided in XML). These materials have been zipped for easy download and are identified by a brief description of the contents. The materials themselves are untouched, that is, they have not been tested or edited by Balisage: The Markup Conference or by Mulberry Technologies, Inc. As such, they are included on this website AS IS, i.e., as provided by the speaker, with no warranties, express or otherwise, made by Balisage or Mulberry.

Slides and Materials

Bal2010briq1111-slides.zip: Presentation slides in Adobe PDF.

Cyril Briquet and Pascale Renders. «Une approche reposante (RESTful) des aspects opérationnels de la rétroconversion du Französisches Etymologisches Wörterbuch (FEW)». Proc. Liège Day in Processing of Gallo-Roman Sources (TraSoGal), May 2009.

Éva Büchi. «Les Structures du /Französisches Etymologisches Wörterbuch/. Recherches métalexicographiques et métalexicologiques», Niemeyer, Tübingen, 1996.

David Carmel, Yoelle S. Maarek, Matan Mandelbrod, Yosi Mass and Aya Soffer. «Searching XML documents via XML fragments». Proc. SIGIR, Toronto, ON, 2003.

Jacques Dendien and Jean-Marie Pierrel. «Le Trésor de la Langue Française informatisé. Un exemple d’informatisation d’un dictionnaire de langue de référence». In Traitement Automatique des Langues 43 (2), 2003.

Steven DeRose. «Markup Overlap: A Review and a Horse». In Proc. Extreme Markup Languages, Montréal, Québec, August 2004.

Document Object Model (DOM). [online] [cited April 15, 2010] http://www.w3.org/DOM/

General Architecture for Text Engineering. [online] [cited June 25, 2010] http://gate.ac.uk/

Information Extraction. [online] [cited June 25, 2010] http://en.wikipedia.org/wiki/Information_extraction

IBM Trainable Information Extraction Systems. [online] [cited June 25, 2010] http://www.research.ibm.com/IE/

Julia Imhof. «Evaluation Strategies for XQuery Full-Text». M.S. Thesis, ETH Zurich, September 2008.

Jaap Kamps, Maarten Marx, Maarten de Rijke and Börkur Sigurbjörnsson. «Articulating Information Needs in XML Query Languages». In ACM Transactions on Information Systems 24 (4), October 2006. doi:https://doi.org/10.1145/1185877.1185879

Luca Lini, Daniella Lombardini, Michele Paoli, Dario Colazzo and Carlo Sartiani. «XTReSy: A Text Retrieval System for XML documents». In D. Buzzetti, H. Short, and G. Pancalddella, editors, Augmenting Comprehension: Digital Tools for the History of Ideas. Office for Humanities Communication Publications, King's College, London, 2001.

Oxford English Dictionary. [online] [cited June 25, 2010] http://www.oed.com/

Oxford English Dictionary. [online] [cited June 25, 2010] http://en.wikipedia.org/wiki/Oxford_English_Dictionary

Französisches Etymologisches Wörterbuch. [online] [cited April 15, 2010] http://www.atilf.fr/few

William Pugh. «Skip lists: a probabilistic alternative to balanced trees». Communications of the ACM 33 (6), June 1990. doi:https://doi.org/10.1145/78973.78977

Liam R. E. Quin. «Text Retrieval for XML-Encoded Corpora: A Lexical Approach». Proc. Balisage, August 2008. doi:https://doi.org/10.4242/BalisageVol1.Quin01

Pascale Renders. «L’informatisation du Französisches Etymologisches Wörterbuch : quels objectifs, quelles possibilités ?». Proc. Congrès International de Linguistique et de Philologie Romanes, Innsbruck, Austria, September 2007.

Pascale Renders and Cyril Briquet. «Conception d’algorithmes de rétroconversion». Proc. Liège Day in Processing of Gallo-Roman Sources (TraSoGal), May 2009.

Regular Fragmentations [online] [cited April 15, 2010] http://www.simonstl.com/projects/fragment/

Neil Savage. «New Search Challenges and Opportunities». Communications of the ACM 53 (1), January 2010. doi:https://doi.org/10.1145/1629175.1629183

Simon St.Laurent. «Treating Complex Textual Content as Markup». Proc. Extreme Markup Languages, Montréal, Québec, 2001.

String Projection [online] [cited April 15, 2010] http://en.wikipedia.org/wiki/String_projection

«Trésor de la Langue Française informatisé» (TLFi) CD-ROM, CNRS Editions, Paris, 2004.

Trésor de la Langue Française informatisé [online] [cited April 14, 2010] http://atilf.atilf.fr/tlf.htm

Xavier Tannier, Jean-Jacques Girardot and Mihaela Mathieu. «Classifying XML Tags through Reading Contexts». Proc. ACM Symposium on Document Engineering, Bristol, UK, 2005. doi:https://doi.org/10.1145/1096601.1096638

Xavier Tannier. «Traiter les documents XML avec les contextes de lecture». Traitement Automatique des Langues 47 (1), 2006.

Xavier Tannier. «Extraction et recherche d'information en langage naturel dans les documents semi-structurés». PhD Dissertation, Ecole Nationale Supérieure des Mines, Saint-Etienne, September 2006.

John van der Voort van der Kleij. «Reverse Lemmatizing of the Dictionary of Middle Dutch (1885-1929) Using Pattern Matching». Proc. Conf. Computational Lexicography and Text Research, Budapest, Hungary, 2005.

Walther von Wartburg et al. «Französisches Etymologisches Wörterbuch. Eine darstellung des galloromanischen sprachschatzes», 25 volumes, Bonn/Heidelberg/Leipzig-Berlin/Bâle, Klopp/Winter/Teubner/Zbinden, 1922-2002.

XPath [online] [cited April 15, 2010] http://en.wikipedia.org/wiki/XPath

XQuery [online] [cited April 15, 2010] http://en.wikipedia.org/wiki/XQuery

XQuery and XPath Full Text 1.0 [online] [cited June 25, 2010] http://www.w3.org/TR/xpath-full-text-10/

Xavier Franc. XQuery Full-Text for the impatient [online] [cited June 25, 2010] http://www.xmlmind.com/_tutorials/XQueryFullText/index.html

Xavier Franc. XQuery Update for the impatient: A quick introduction to the XQuery Update Facility [online] [cited April 15, 2010] http://www.xmlmind.com/_tutorials/XQueryUpdate/index.html

XQuery Update Facility 1.0 [online] [cited April 15, 2010] http://www.w3.org/TR/xquery-update-10/

Author's keywords for this paper:

XML; corpus; API; text; retrieval; update; algorithm; virtual; virtualization; string; context

BalisageThe Markup Conference2010

Balisage Paper: A Virtualization-Based Retrieval and Update API for XML-Encoded Corpora^[1]

Slides and Materials

Author's keywords for this paper:

Balisage Series on Markup Technologies

Balisage Paper: A Virtualization-Based Retrieval and Update API for XML-Encoded Corpora[1]

Slides and Materials

Author's keywords for this paper:

Balisage Series on Markup Technologies

Balisage Paper: A Virtualization-Based Retrieval and Update API for XML-Encoded Corpora^[1]