HTML Parser class for the Finder indexer package.
- Depuis
- 2.5
◆ parse()
Method to parse input and extract the plain text. Because this method is called from both inside and outside the indexer, it needs to be able to batch out its parsing functionality to deal with the inefficiencies of regular expressions. We will parse recursively in 2KB chunks.
- Paramètres
-
string | $input | The input to parse. |
- Renvoie
- string The plain text input.
- Depuis
- 2.5
Références $input, et ENT_QUOTES.
◆ process()
Method to process HTML input and extract the plain text.
- Paramètres
-
string | $input | The input to process. |
- Renvoie
- string The plain text input.
- Depuis
- 2.5
Références $input.
◆ removeBlocks()
removeBlocks |
( |
|
$input, |
|
|
|
$startTag, |
|
|
|
$endTag |
|
) |
| |
|
private |
Method to remove blocks of text between a start and an end tag. Each block removed is effectively replaced by a single space.
Note: The start tag and the end tag must be different. Note: Blocks must not be nested. Note: This method will function correctly with multi-byte strings.
- Paramètres
-
string | $input | String to be processed. |
string | $startTag | String representing the start tag. |
string | $endTag | String representing the end tag. |
- Renvoie
- string with blocks removed.
- Depuis
- 3.4
Références $input, Joomla\Database\Query\$offset, $return, et $start.
La documentation de cette classe a été générée à partir du fichier suivant :