Leventhal, Michael and
Eric Lemoine 2009. The XML chip at 6 years. Proceedings of International Symposium on
Processing XML Efficiently 2009, Montréal. doi: 10.4242/BalisageVol4.Leventhal01.
Salz, Richard,
Heather Achilles, and David Maze. 2009. Hardware and software trade-offs in the IBM
DataPower XML XG4 processor card. Proceedings of International Symposium on Processing XML
Efficiently 2009, Montréal. doi: 10.4242/BalisageVol4.Salz01.
Cameron, Robert D. 2007. A Case Study
in SIMD Text Processing with Parallel Bit Streams UTF-8 to UTF-16 Transcoding. Proceedings
of 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming 2008, Salt
Lake City, Utah. On the Web at http://research.ihost.com/ppopp08/. doi: 10.1145/1345206.1345222.
Cameron, Robert D.,
Kenneth S Herdy, and Dan Lin. 2008. High Performance XML Parsing Using Parallel Bit Stream
Technology. Proceedings of CASCON 2008. 13th ACM SIGPLAN Symposium on Principles and
Practice of Parallel Programming 2008, Toronto. doi: 10.1145/1463788.1463811.
Cameron, Robert D. and Dan
Lin. 2009. Architectural Support for SWAR Text Processing with Parallel Bit Streams: The
Inductive Doubling Principle. Proceedings of ASPLOS 2009, Washington, DC. doi: 10.1145/1508244.1508283.
u8u16 - A High-Speed UTF-8 to UTF-16
Transcoder Using Parallel Bit Streams Technical Report 2007-18. 2007. School of Computing
Science Simon Fraser University, June 21 2007.
Hilewitz, Y. and Ruby B. Lee.
2006. Fast Bit Compression and Expansion with Parallel Extract and Parallel Deposit
Instructions. Proceedings of the IEEE 17th International Conference on Application-Specific
Systems, Architectures and Processors (ASAP), pp. 65-72, September 11-13, 2006. doi: 10.1109/ASAP.2006.33.
Kay, Michael Y. 2008. Ten Reasons Why Saxon
XQuery is Fast, IEEE Data Engineering Bulletin, December 2008.
Hitchens, Ron. Java NIO. O'Reilly, 2002.
Parallel Bit Stream Technology as a Foundation for XML Parsing Performance
Rob Cameron
Professor of Computing Science
Simon Fraser University
Ken Herdy
Graduate Student, School of Computing Science
Simon Fraser University
Ehsan Amiri
Graduate Student, School of Computing Science
Simon Fraser University
Abstract
By first transforming the octets (bytes) of XML texts into eight parallel bit
streams, the SIMD features of commodity processors can be exploited for parallel
processing of blocks of 128 input bytes at a time. Established transcoding and parsing
techniques are reviewed followed by new techniques including parsing with bitstream
addition. Further opportunities are discussed in light of expected advances in CPU
architecture and compiler technology. Implications for various APIs and information
models are presented as well opportunities for collaborative open-source
development.
Parallel Bit Stream Technology as a Foundation for XML Parsing Performance
International Symposium on Processing XML Efficiently: Overcoming Limits on Space,
Time, or Bandwidth
August 10, 2009
The materials listed below were provided by the speaker as supplements to a
presentation at Balisage. These materials may include the slides or visuals used in the
presentation; supplementary material, such as code samples or a demonstration application;
and/or the paper accompanying the presentation (if it has not been provided in XML). These
materials have been zipped for easy download and are identified by a brief description of
the contents. The materials themselves are untouched
, that is, they
have not been tested or edited by Balisage: The Markup Conference or by Mulberry
Technologies, Inc. As such, they are included on this website AS IS
,
i.e., as provided by the speaker, with no warranties, express or otherwise, made by Balisage
or Mulberry.
Slides and Materials