Zeta Components - high quality PHP components

Zeta Components Manual :: Docs For Class ezcDocumentWikiCreoleTokenizer

Document::ezcDocumentWikiCreoleTokenizer

Class ezcDocumentWikiCreoleTokenizer

Tokenizer for Creole wiki documents.

The Creole wiki syntax is a started effort to unify wiki markup languages. Its documentation can be found at:

http://www.wikicreole.org/

For the basic workings of the tokenizer see the class level documentation in the ezcDocumentWikiTokenizer class.

Source for this file: /Document/src/document/wiki/tokenizer/creole.php

ezcDocumentWikiTokenizer
   |
   --ezcDocumentWikiCreoleTokenizer
Version:   //autogen//

Constants

SPECIAL_CHARS = '/*^,#_~\\\\\\[\\]{}|=-' Special characters, which do have some special meaaning and though may not have been matched otherwise.
TEXT_END_CHARS = '/*^,#_~\\\\\\[\\]{}|=\\r\\n\\t\\x20-' Characters ending a pure text section.
WHITESPACE_CHARS = '[\\x20\\t]' Common whitespace characters. The vertical tab is excluded, because it causes strange problems with PCRE.

Inherited Member Variables

From ezcDocumentWikiTokenizer
protected ezcDocumentWikiTokenizer::$tokens

Method Summary

public void __construct( )
Construct tokenizer
protected array filterTokens( $tokens )
Filter tokens
protected void parsePluginContents( $plugin )
Parse plugin contents

Inherited Methods

From ezcDocumentWikiTokenizer
public abstract void ezcDocumentWikiTokenizer::__construct()
Construct tokenizer
protected void ezcDocumentWikiTokenizer::convertTabs()
Convert tabs to spaces
protected abstract array ezcDocumentWikiTokenizer::filterTokens()
Filter tokens
public array ezcDocumentWikiTokenizer::tokenizeFile()
Tokenize the given file
public array ezcDocumentWikiTokenizer::tokenizeString()
Tokenize the given string

Methods

__construct

void __construct( )

Construct tokenizer

Create token array with regular repression matching the respective token.

Redefinition of:
Method Description
ezcDocumentWikiTokenizer::__construct() Construct tokenizer

filterTokens

array filterTokens( $tokens )

Filter tokens

Method to filter tokens, after the input string ahs been tokenized. The filter should extract additional information from tokens, which are not generally available yet, like the depth of a title depending on the title markup.

Parameters:
Name Type Description
$tokens array
Redefinition of:
Method Description
ezcDocumentWikiTokenizer::filterTokens() Filter tokens

parsePluginContents

void parsePluginContents( ezcDocumentWikiPluginToken $plugin )

Parse plugin contents

Plugins are totally different in each wiki component and its contents should not be passed through the normal wiki parser. So we fetch its contents completely and let each tokinzer extract names and parameters from the complete token itself.

Parameters:
Name Type Description
$plugin ezcDocumentWikiPluginToken
Documentation generated by phpDocumentor 1.4.3