Moodle PHP Documentation 4.3
Moodle 4.3.5 (Build: 20240610) (7dcfaa79f78)
HTMLPurifier_Lexer_DirectLex Class Reference
Inheritance diagram for HTMLPurifier_Lexer_DirectLex:
HTMLPurifier_Lexer

Public Member Functions

 extractBody ($html)
 Takes a string of HTML (fragment or document) and returns the content.
 
 normalize ($html, $config, $context)
 Takes a piece of HTML and normalizes it by converting entities, fixing encoding, extracting bits, and other good stuff.
 
 parseAttr ($string, $config)
 
 parseAttributeString ($string, $config, $context)
 Takes the inside of an HTML tag and makes an assoc array of attributes.
 
 parseData ($string, $is_attr, $config)
 Parses special entities into the proper characters.
 
 parseText ($string, $config)
 
 tokenizeHTML ($html, $config, $context)
 

Static Public Member Functions

static create ($config)
 Retrieves or sets the default Lexer as a Prototype Factory.
 

Public Attributes

 $tracksLineNumbers = true
 @type bool
 

Protected Member Functions

 scriptCallback ($matches)
 Callback function for script CDATA fudge.
 
 substrCount ($haystack, $needle, $offset, $length)
 PHP 5.0.x compatible substr_count that implements offset and length.
 

Static Protected Member Functions

static CDATACallback ($matches)
 Callback function for escapeCDATA() that does the work.
 
static escapeCDATA ($string)
 Translates CDATA sections into regular sections (through escaping).
 
static escapeCommentedCDATA ($string)
 Special CDATA case that is especially convoluted for <script>
 
static removeIEConditional ($string)
 Special Internet Explorer conditional comments should be removed.
 

Protected Attributes

 $_special_entity2str
 Most common entity to raw value conversion table for special entities.
 
 $_whitespace = "\x20\x09\x0D\x0A"
 Whitespace characters for str(c)spn.
 

Member Function Documentation

◆ CDATACallback()

static HTMLPurifier_Lexer::CDATACallback ( $matches)
staticprotectedinherited

Callback function for escapeCDATA() that does the work.

Warning
Though this is public in order to let the callback happen, calling it directly is not recommended.
Parameters
array$matchesPCRE matches array, with index 0 the entire match and 1 the inside of the CDATA section.
Return values
stringEscaped internals of the CDATA section.

◆ create()

static HTMLPurifier_Lexer::create ( $config)
staticinherited

Retrieves or sets the default Lexer as a Prototype Factory.

By default HTMLPurifier_Lexer_DOMLex will be returned. There are a few exceptions involving special features that only DirectLex implements.

Note
The behavior of this class has changed, rather than accepting a prototype object, it now accepts a configuration object. To specify your own prototype, set Core.LexerImpl to it. This change in behavior de-singletonizes the lexer object.
Parameters
HTMLPurifier_Config$config
Return values
HTMLPurifier_Lexer
Exceptions
HTMLPurifier_Exception

◆ escapeCDATA()

static HTMLPurifier_Lexer::escapeCDATA ( $string)
staticprotectedinherited

Translates CDATA sections into regular sections (through escaping).

Parameters
string$stringHTML string to process.
Return values
stringHTML with CDATA sections escaped.

◆ escapeCommentedCDATA()

static HTMLPurifier_Lexer::escapeCommentedCDATA ( $string)
staticprotectedinherited

Special CDATA case that is especially convoluted for <script>

Parameters
string$stringHTML string to process.
Return values
stringHTML with CDATA sections escaped.

◆ extractBody()

HTMLPurifier_Lexer::extractBody ( $html)
inherited

Takes a string of HTML (fragment or document) and returns the content.

Todo
Consider making protected

◆ normalize()

HTMLPurifier_Lexer::normalize ( $html,
$config,
$context )
inherited

Takes a piece of HTML and normalizes it by converting entities, fixing encoding, extracting bits, and other good stuff.

Parameters
string$htmlHTML.
HTMLPurifier_Config$config
HTMLPurifier_Context$context
Return values
string
Todo
Consider making protected

◆ parseAttributeString()

HTMLPurifier_Lexer_DirectLex::parseAttributeString ( $string,
$config,
$context )

Takes the inside of an HTML tag and makes an assoc array of attributes.

Parameters
string$stringInside of tag excluding name.
HTMLPurifier_Config$config
HTMLPurifier_Context$context
Return values
arrayAssoc array of attributes.

◆ parseData()

HTMLPurifier_Lexer::parseData ( $string,
$is_attr,
$config )
inherited

Parses special entities into the proper characters.

This string will translate escaped versions of the special characters into the correct ones.

Parameters
string$stringString character data to be parsed.
Return values
stringParsed character data.

◆ removeIEConditional()

static HTMLPurifier_Lexer::removeIEConditional ( $string)
staticprotectedinherited

Special Internet Explorer conditional comments should be removed.

Parameters
string$stringHTML string to process.
Return values
stringHTML with conditional comments removed.

◆ scriptCallback()

HTMLPurifier_Lexer_DirectLex::scriptCallback ( $matches)
protected

Callback function for script CDATA fudge.

Parameters
array$matches,inform of array(opening tag, contents, closing tag)
Return values
string

◆ substrCount()

HTMLPurifier_Lexer_DirectLex::substrCount ( $haystack,
$needle,
$offset,
$length )
protected

PHP 5.0.x compatible substr_count that implements offset and length.

Parameters
string$haystack
string$needle
int$offset
int$length
Return values
int

◆ tokenizeHTML()

HTMLPurifier_Lexer_DirectLex::tokenizeHTML ( $html,
$config,
$context )
Parameters
String$html
HTMLPurifier_Config$config
HTMLPurifier_Context$context
Return values
array|HTMLPurifier_Token[]

Reimplemented from HTMLPurifier_Lexer.

Member Data Documentation

◆ $_special_entity2str

HTMLPurifier_Lexer::$_special_entity2str
protectedinherited
Initial value:
=
array(
'&quot;' => '"',
'&amp;' => '&',
'&lt;' => '<',
'&gt;' => '>',
'&#39;' => "'",
'&#039;' => "'",
'&#x27;' => "'"
)

Most common entity to raw value conversion table for special entities.

@type array

◆ $_whitespace

HTMLPurifier_Lexer_DirectLex::$_whitespace = "\x20\x09\x0D\x0A"
protected

Whitespace characters for str(c)spn.

@type string


The documentation for this class was generated from the following file: