Bit0
stream is the stream
of bits consisting of bit 0 of each byte in the input byte stream, Bit1
is
the bit stream consisting of bit 1 of each byte in the input stream and so on. The set
of streams Bit0
through Bit7
are known as the Input Data |
<
|
t
|
a
|
g
|
/
|
>
|
ASCII |
00111100
|
01110100
|
01100001
|
01100111
|
00101111
|
00111110
|
Bit0 |
0
|
0
|
0
|
0
|
0
|
0
|
Bit1 |
0
|
1
|
1
|
1
|
0
|
0
|
Bit2 |
1
|
1
|
1
|
1
|
1
|
1
|
Bit3 |
1
|
1
|
0
|
0
|
0
|
1
|
Bit4 |
1
|
0
|
0
|
0
|
1
|
1
|
Bit5 |
1
|
1
|
0
|
1
|
1
|
1
|
Bit6 |
0
|
0
|
0
|
1
|
1
|
1
|
Bit7 |
0
|
0
|
1
|
1
|
1
|
0
|
Bit0/Bit1
, Bit2/Bit3
, Bit4/Bit5
, and
Bit6/Bit7
pairs. In the final stage, the streams are further subdivided
in the individual bit streams. Input Data |
> 

|
GenRefs |
_11______________
|
DecRefs |
_______11________
|
HexRefs |
______________11_
|
DelMask |
111__1111__11111_
|
ErrorFlag |
_________________
|
Input Data |
> &#, &#x;
|
GenRefs |
_11___________
|
DecRefs |
______________
|
HexRefs |
______________
|
DelMask |
111__11__111__
|
ErrorFlag |
_______1____1_
|
LAngle | Marks the position of any left angle bracket character. |
RAngle | Marks the position of any right angle bracket character. |
LBracket | Marks the position of any left square bracker character. |
RBracket | Marks the position of any right square bracket character. |
Exclam | Marks the position of any exclamation mark character. |
QMark | Marks the position of any question mark character. |
Hyphen | Marks the position of any hyphen character. |
Equals | Marks the position of any equal sign character. |
SQuote | Marks the position of any single quote character. |
DQuote | Marks the position of any double quote character. |
Slash | Marks the position of any forward slash character |
NameScan | Marks the position of any XML name character. |
WS | Marks the position of any XML 1.0 whitespace character. |
PI_start | Marks the position of the start of any processing instruction at the '?' character position. |
PI_end | Marks the position of any end of any processing instruction at the '>' character position. |
CtCD_start | Marks the position of the start of any comment or CDATA section at the '!' character position. |
EndTag_start | Marks the position of any end tag at the '/' character position. |
CD_end | Marks the position of the end of any CDATA section at the '>' character position. |
DoubleHyphen | Marks the position of any double hyphen character. |
RefStart | Marks the position of any ampersand character. |
Hash | Marks the position of any hash character. |
x | Marks the position of any 'x' character. |
Digit | Marks the position of any digit. |
Hex | Marks the position of any hexidecimal character. |
Semicolon | Marks the position of any semicolon character. |
Input Data |
<tag><tag> text <
> </tag></tag>
|
LAngle |
1____1______________________1_____1_____
|
RAngle |
____1____1_______________________1_____1
|
WS |
__________1____1____1______1____________
|
RefStart |
________________1____1__________________
|
Hex |
__1____1____1___________11_____1_____1__
|
Semicolon |
___________________1______1_____________
|
Slash |
_____________________________1_____1____
|
Input Data | A Text in Farsi: ى ك م ت ن ف ا ر س ى |
High Nybbles |
42567726624677632D8DBDBDAD82D8DAD82D8D8
|
Low Nybbles |
10458409E061239A099838187910968A9509399
|
u8Unibyte |
11111111111111111__________1______1____
|
u8Prefix |
_________________1_1_1_1_1__1_1_1__1_1_
|
u8Suffix |
__________________1_1_1_1_1__1_1_1__1_1
|
u8Prefix2 |
_________________1_1_1_1_1__1_1_1__1_1_
|
u8Scope22 |
__________________1_1_1_1_1__1_1_1__1_1
|
ErrorFlag |
_______________________________________
|
Input Data |
first line C second line CL third line L one more C nothing
left
|
CR |
-----------1-------------1------------------------1-------------
|
LF |
--------------------------1------------1------------------------
|
DelMask |
--------------------------1-------------------------------------
|
Input Data | <?php?> <!-- example --> <![CDATA[ shift: a<<1 ]]> |
CD_Span | ___________________________1111111111111111111111_ |
Ct_Span | ___________111111111111___________________________ |
PI_Span | _11111____________________________________________ |
CtCDPI_Mask | _111111__111111111111111__111111111111111111111111 |
ErrorFlag | __________________________________________________ |
Input Data |
<root><t1>text</t1><t2
a1='foo' a2 =
'fie'>more</t2><tag3
att3='b'/></root>
|
ElemNames |
_1111__11___________11_______________________________1111__________________
|
AttNames |
_______________________11_______11________________________1111_____________
|
AttrVals |
__________________________11111______11111_____________________111_________
|
EmptyTagEnds |
___________________________________________________________________1_______
|
EndTags |
_______________111______________________________111__________________11111_
|
Start/EmptyTags |
_1111__11___________1111111111111111111111___________11111111111111________
|
ErrorFlag |
___________________________________________________________________________
|